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METHODS OF DIAGNOSIS OF LUNG CANCER, COMPOSITIONS AND METHODS 
OF SCREENING FOR MODULATORS OF LUNG CANCER 

5 

CROSS-REFERENCES TO RELATED APPLICATIONS 

This application is related to USSN 60/284,770, filed April 18, 2001; USSN 
60/290,492, filed May 10, 2001; USSN 60/334,370, filed November 29, 2001; USSN 
60/339,245, filed November 9, 2001; USSN 60/350,666, filed November 13, 2001; and 
10 USSN 60/xxx,xxx, filed April 12, 2002 (Docket OMNI-002P); each of which is incorporated 
herein by reference in its entirety. 

FIELD OF THE INVENTION 
The invention relates to the identification of nucleic acid and protein expression 
15 profiles and nucleic acids, products, and antibodies thereto that are involved in lung cancer; 
and to the use of such expression profiles and compositions in diagnosis and therapy of lung 
cancer. The invention further relates to methods for identifying and using agents and/or 
targets that inhibit lung cancer or related conditions. 

20 BACKGROUND OF THE INVENTION 

Lung cancer is the second most commonly occurring cancer in the United States and 
is the leading cause of cancer-related death. It is estimated that there are over 160,000 new 
cases of lung cancer in the United States every year. Of those who are diagnosed with lung 
cancer, 86 percent will die within five years. Lung cancer is the most common visceral 

25 cancer in men and accounts for nearly one third of all cancer deaths in both men and women. 
In fact, lung cancer accounts for 7% of all deaths, due to any cause, in both men and women. 

Smoking is the primary cause of lung cancer, with more than 80% of lung cancers 
resulting from smoking. About 400 to 500 separate gaseous substances are present in the 
smoke of a non-filter cigarette. The most noteworthy substances include nitrogen oxides, 

30 hydrogen cyanide, formaldehyde, benzene, and toluene. The particles present in cigarette 
smoke contain at least 3,500 individual compounds such as nicotine, tobacco alkaloids 
(nornicotine, anatabine, anabasine), polycyclic aromatic hydrocarbons (e.g., benzo(a)pyrene, 
B(a)P), naphthalenes, aromatic amines, phenols, and tobacco-specific nitrosamines. 
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Tobacco-specific nitrosamines are formed during tobacco curing and processing, and 

are suspected of causing lung cancer in humans. In rodent studies, regardless of the where or 
how it is applied, the tobacco-specific nitrosamine known as NNK produces lung adenomas 
and lung adenocarcinomas. The tobacco-specific nitrosamine known as NNAL also produces 
5 lung adenocarcinomas in rodents. 

Many of the chemicals found in cigarette smoke also affect the nonsmoker inhaling 
"secondhand" or sidestream smoke. Indeed, the smoke inhaled by non-smokers has a 
chemical composition similar to the smoke inhaled by smokers, but, importantly, the 
concentrations of the carcinogenic tobacco-specific nitrosamines are present in higher 
10 concentrations in second hand smoke. For this and other reasons, "passive smoking" is an 
important cause of lung cancer, causing as many as 3,000 lung cancer deaths in nonsmokers 
each year. 

In addition to smoking, other factors thought to be causes of lung cancer include on- 
the-job exposure to carcinogens such as asbestos and uranium, exposure to chemical hazards 

15 such as radon, polycyclic aromatic hydrocarbons, chromium, nickel, and inorganic arsenic, 
genetic factors, and diet. 

Histological classification of various lung cancers define the types of cancer that 
begin in the lung. See, e.g., Travis, et al. (1999) Histological Typing of L ung and Pleural 
Tumours (International Histological Classification of Tumours, No 1 . Four major cell types 

20 make up more than 88% of all primary lung neoplasms. These are: squamous or epidermoid 
carcinoma, small cell (also called oat cell) carcinoma, adenocarcinoma, and large cell (also 
called large cell anaplastic) carcinoma. The remainder include undifferentiated carcinomas, 
carcinoids, bronchial gland tumors, and other rarer types. The various cell types have 
different natural histories and responses to therapy, and, thus, a correct histologic diagnosis is 

25 the first step of effective treatment. 

Small cell lung cancer (SCLC) accounts for 18-25% of all lung cancers, and occurs 
less frequently than non-small cell lung cancers, and generally spread to distant organs more 
rapidly than non-small cell lung cancer. In general, at the time of presentation small cell lung 
cancers have already spread beyond the beyond the bounds where surgery and curative intent 

30 can be undertaken. Hoever, if identified early enough, these cancers are often responsive to 
chemotherapy and thoracic radiation treatment. 

Non-small cell lung cancers (NSCLC) are the more frequently occurring form of lung 
cancer. They comprise squamous cell carcinoma, adenocarcinoma, and large cell carcinoma 
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and account for more than 75% of all lung cancers. Non-small cell tumors that are localized 

at the time of presentation can sometimes be cured with surgery and/or radiotherapy, but 

usually are not identified until significant metastasis has occurred, which are typically not 

very responsive to surgical, chemotherapy, or radiation treatment.. 

5 The screening of asymptomatic persons at high risk for lung cancer has often proven 

ineffective. In general, only 5 to 15 percent of lung cancer patients have their disease 

detected while they are asymptomatic. Of course, early detection and treatment are critical 

factors in the fight against lung cancer. The average survival rate is 49% for those whose 

cancer is detected early, before the cancer has spread from the lung. Lung cancer often 

1 0 spreads outside of the lung, and it may have spread to the bones or brain by the time it is 
diagnosed. While the prognosis may be better for lung cancers that are detected early, 
because of the lack ofv effective curative treatments, early detection does not necessarily alter 
the total death rate from lung cancer. 

Thus, methods for diagnosis and prognosis of lung cancer and effective treatment of 

1 5 lung cancer would be desirable. Accordingly, provided herein are methods that can be used 
in diagnosis and prognosis of lung cancer. Further provided are methods that can be used to 
screen candidate therapeutic agents for the ability to modulate, e.g., treat, lung cancer. 
Additionally, provided herein are molecular targets and compositions for therapeutic 
intervention in lung disease and other metastatic cancers. 

20 

SUMMARY OF THE INVENTION 
The present invention provides nucleotide sequences of genes that are up- and down- 
regulated in lung cancer cells. Such genes are useful for diagnostic purposes, and also as 
targets for screening for therapeutic compounds that modulate lung cancer, such as 

25 antibodies. The methods of detecting nucleic acids of the invention or their encoded proteins 
can be used for a number of purposes. Examples include early detection of lung cancers, 
monitoring and early detection of relapse following treatment of lung cancers, monitoring 
response to therapy of lung cancers, determining prognosis of lung cancers, directing therapy 
of lung cancers, selecting patients for postoperative chemotherapy or radiation therapy, 

30 selecting therapy, determining tumor prognosis, treatment, or response to treatment, and early 
detection of precancerous lesions of the lung. Examples of benign or precancerous lesions 
include: atelectasis, emphysema, brochitis, chronic obstructive pulmonary disease, fibrosis, 
hypersensitivity pneumonitis (HP), interstitial pulmonary fibrosis (IPF), asthma, and 
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bronchiectasis. Other aspects of the invention will become apparent to the skilled artisan by 
the following description of the invention. 

In one aspect, the present invention provides a method of detecting a lung cancer- 
associated transcript in a cell from a patient, the method comprising contacting a biological 
5 sample from the patient with a polynucleotide that selectively hybridizes to a sequence at 
least 80% identical to a sequence as shown in Tables 1A-16. Alternatively, the sample may 
be contacted with a specific binding reagent, e.g., antibody. 

In one embodiment, the polynucleotide selectively hybridizes to a sequence at least 
95% identical to a sequence as shown in Tables 1A-16. In another embodiment, the 
10 polynucleotide comprises a sequence as shown in Tables 1A-16. 

In one embodiment, the biological sample is a tissue sample, or a body fluid. In 
another embodiment, the biological sample comprises isolated nucleic acids, e.g., mRNA. 

In one embodiment, the polynucleotide is labeled, e.g., with a fluorescent label. In 
one embodiment, the polynucleotide is immobilized on a solid surface. In one embodiment, 
15 the patient is undergoing a therapeutic regimen to treat lung cancer. In another embodiment, 
the patient is suspected of having lung cancer. In one embodiment, the patient is a primate, 
e.g., a human. 

In one embodiment, the method further comprises the step of amplifying nucleic acids 
before the step of contacting the biological sample with the polynucleotide. 

20 In another aspect, the present invention provides a method of monitoring the efficacy 

of a therapeutic treatment of lung cancer, the method comprising the steps of: (i) providing a 
biological sample from a patient undergoing the therapeutic treatment; and (ii) determining 
the level of a lung cancer-associated transcript in the biological sample by contacting the 
biological sample with a polynucleotide that selectively hybridizes to a sequence at least 80% 

25 identical to a sequence as shown in Tables 1 A- 16, thereby monitoring the efficacy of the 
therapy. Or the sample may be evaluated for protein, e.g., contacting the sample with an 
antibody. 

In one embodiment, the method further comprises the step of: (iii) comparing the 
level of the lung cancer-associated transcript to a level of the lung cancer-associated 
30 transcript in a biological sample from the patient prior to, or earlier in, the therapeutic 
treatment. Or the sample may be evalated for comparison of protein. 

In another aspect, the present invention provides a method of monitoring the efficacy 
of a therapeutic treatment of lung cancer, the method comprising the steps of: (i) providing a 
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biological sample from a patient undergoing the therapeutic treatment; and (ii) determining 

the level of a lung cancer-associated antibody in the biological sample by contacting the 

biological sample with a polypeptide encoded by a polynucleotide that selectively hybridizes 

to a sequence at least 80% identical to a sequence as shown in Tables .1 A-16, wherein the 

5 polypeptide specifically binds to the lung cancer-associated antibody, thereby monitoring the 

efficacy of the therapy. 

In one embodiment, the method further comprises the step of: (iii) comparing the 

level of the lung cancer-associated antibody to a level of the lung cancer-associated antibody 

in a biological sample from the patient prior to, or earlier in, the therapeutic treatment. 

10 In another aspect, the present invention provides a method of monitoring the efficacy 

of a therapeutic treatment of lung cancer, the method comprising the steps of: (i) providing a 
biological sample from a patient undergoing the therapeutic treatment; and (ii) determining 
the level of a lung cancer-associated polypeptide in the biological sample by contacting the 
biological sample with an antibody, wherein the antibody specifically binds to a polypeptide 

15 encoded by a polynucleotide that selectively hybridizes to a sequence at least 80% identical 
to a sequence as shown in Tables 1 A- 16, thereby monitoring the efficacy of the therapy. 

In one embodiment, the method further comprises the step of: (iii) comparing the 
level of the lung cancer-associated polypeptide to a level of the lung cancer-associated 
polypeptide in a biological sample from the patient prior to, or earlier in, the therapeutic 

20 treatment. In one aspect, the present invention provides an isolated nucleic acid molecule 
consisting of a polynucleotide sequence as shown in Tables 1 A- 16. In one embodiment, an 
expression vector or cell comprises the isolated nucleic acid. In one aspect, the present 
invention provides an isolated polypeptide which is encoded by a nucleic acid molecule 
having polynucleotide sequence as shown in Tables 1 A- 16. 

25 In another aspect, the present invention provides an antibody that specifically binds to 

an isolated polypeptide which is encoded by a nucleic acid molecule having polynucleotide 
sequence as shown in Tables 1 A-16. In one embodiment, the antibody is conjugated to an 
effector component, e.g., a fluorescent label, a radioisotope or a cytotoxic chemical. In one 
embodiment, the antibody is an antibody fragment. In another embodiment, the antibody is 

30 humanized. 

In one aspect, the present invention provides a method of detecting lung cancer in a a 
patient, the method comprising contacting a biological sample from the patient with an 
antibody or protein as described herein. 
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In another aspect, the present invention provides a method of detecting antibodies 
specific to a lung cancer gene in a patient, the method comprising contacting a biological 
sample from the patient with a polypeptide encoded by a nucleic acid comprises a sequence 
from Tables 1 A-16. 

5 In another aspect, the present invention provides a method for identifying a compound 

that modulates a lung cancer-associated polypeptide, the method comprising the steps of: (i) 
contacting the compound with a lung cancer-associated polypeptide, the polypeptide encoded 
by a polynucleotide that selectively hybridizes to a sequence at least 80% identical to a 
sequence as shown in Tables 1 A-16; and (ii) determining the functional effect of the 

1 0 compound upon the polypeptide. 

In one embodiment, the functional effect is a physical effect, an enzymatic effect, or a 
chemical effect. In one embodiment, the polypeptide is expressed in a eukaryotic host cell or 
cell membrane. In another embodiment, the polypeptide is recombinant. In one 
embodiment, the functional effect is determined by measuring ligand binding to the 

15 polypeptide. 

In another aspect, the present invention provides a method of inhibiting proliferation 
or another critical process of a lung cancer-associated cell to treat lung cancer in a patient, the 
method comprising the step of administering to the subject a therapeutically effective amount 
of a compound identified as described herein. In one embodiment, the compound is an 
20 antibody. 

In another aspect, the present invention provides a drug screening assay comprising 
the steps of: (i) administering a test compound to a mammal having lung cancer or a cell 
isolated therefrom; (ii) comparing the level of gene expression of a polynucleotide that 
selectively hybridizes to a sequence at least 80% identical to a sequence as shown in Tables 
25 1 A-16 in a treated cell or mammal with the level of gene expression of the polynucleotide in 
a control cell or mammal, wherein a test compound that modulates the level of expression of 
the polynucleotide is a candidate for the treatment of lung cancer. 

In one embodiment, the control is a mammal with lung cancer or a cell therefrom that 
has not been treated with the test compound. In another embodiment, the control is a normal 
30 cell or mammal, or a non-malignant lung disease. 

In another aspect, the present invention provides a method for treating a mammal 
having lung cancer comprising administering a compound identified by the assay described 
herein. 
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In another aspect, the present invention provides a pharmaceutical composition for 

treating a mammal having lung cancer, the composition comprising a compound identified by 

the assay described herein and a physiologically acceptable excipient. 

5 DETAILED DESCRIPTION OF THE INVENTION 

In accordance with the objects outlined above, the present invention provides novel 
methods for diagnosis and treatment of lung disease or cancer, as well as methods for 
screening for compositions which modulate lung cancer. "Treatment, monitoring, detection 
or modulation of lung disease or cancer" includes treatment, monitoring, detection, or 

10 modulation of lung disease in those patients who have lung disease (whether malignant or 
non-malignant, e.g., emphysema, bronchitis, or fibrosis) as well as patients with lung cancers 
in which gene expression from a gene in Tables 1A-16 is increased or decreased, indicating 
that the subject is more likely to have disease. In particular,while these targets are identified 
primarily from lung cancer samples, these same targets are likely to be similarly found in 

15 analyses of other medical conditions. These other conditions may result from similar 
pathological processes which affect similar tissues, e.g., lung cancer, small cell lung 
carcinoma (oat cell carcinoma), non-small cell carcinomas (e.g., squamous cell carcinoma, 
adenocarcinoma, large cell lung carcinoma, carcinoid, granulomatous), fibrosis (idiopathic 
pulmonary fibrosis (IPF), hypersensitivity pneumonitis (HP), interstitial pneumonitis, 

20 nonspecific idiopathic pneumonitis (NSIP)), chronic obstructive pulmonary disease (COPD, 
e.g., emphysema, chronic bronchitis), asthma, bronchiectasis, and esophageal cancer. See, 
e.g., the NCI webpage and USSN 60/347,349 and USSN 60/xxx,xxx (docket LFBR-001-1P, 
filed March 29, 2002), each of which is incorporated herein by reference. The treatment may 
be of lung cancer or related condition itself, or treatment of metastasis. 

25 In particular, identification of markers selectively expressed on these cancers allows 

for use of that expression in diagnostic, prognostic, or therapeutic methods. As such, the 
invention defines various compositions, e.g., nucleic acids, polypeptides, antibodies, and 
small molecule agonists/antagonists, which will be useful to selectively identify those 
markers. For example, therapeutic methods may take the form of protein therapeutics which 

30 use the marker expression for selective localization or modulation of function (for those 
markers which have a causative disease effect), for vaccines, identification of binding 
partners, or antagonism, e.g., using antisense or RNAi. The markers may be useful for 
molecular characterization of subsets of lung diseases, which subsets may actually require 
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very different treatments. Moreover, the markers may also be important in related diseases to 
the specific cancers, e.g., which affect similar tissues in non-malignant diseases, or have 
similar mechanisms of induction/maintenance. Metastatic processes or characteristics may 
also be targeted. Diagnostic and prognostic uses are made available, e.g., to subset related 
5 but distinct diseases, or to determine treatment strategy. The detection methods may be based 
upon nucleic acid, e.g., PCR or hybridization techniques, or protein, e.g., ELISA, imaging, 
IHC, etc. The diagnosis may be qualitative or quantitative, and may detect increases or 
decreases in expression levels. 

Tables 1 A- 16 provide unigene cluster identification numbers for the nucleotide 

10 sequence of genes that exhibit increased or decreased expression in lung cancer samples. The 
tables also provide an exemplar accession number that provides a nucleotide sequence that is 
part of the unigene cluster. In Table 1A, genes marked as 'target 1" or "target 2" are 
particularly useful as therapeutic targets. Genes marked as "target 3" are particularly useful 
as diagnostic markers. Genes marked as "chron" are upregulated in chronically diseased lung 

15 (e.g., emphysema, bronchitis, fibrosis) relative to lung tumors and normal tissue. In certain 
analyses, the ratio for the "chron" category was determined using the 70th percentile of 
chronically diseases lung samples divided by the 90th percentile of normal lung samples. 
The ratio for the targets was determined using the 70th percentile of lung tumor samples 
divided by the 90th percentile of normal lung samples. 

20 

Definitions 

The term "lung cancer protein" or "lung cancer polynucleotide" or "lung cancer- 
associated transcript" refers to nucleic acid and polypeptide polymorphic variants, alleles, 
mutants, and interspecies homologs that: (1) have a nucleotide sequence that has greater than 

25 about 60% nucleotide sequence identity, 65%, 70%, 75%, 80%, 85%, 90%, preferably 91%, 
92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% or greater nucleotide sequence identity, 
preferably over a region of over a region of at least about 25, 50, 100, 200, 500, 1000, or 
more nucleotides, to a nucleotide sequence of or associated with a unigene cluster of Tables 
1 A-16; (2) bind to antibodies, e.g., polyclonal antibodies, raised against an immunogen 

30 comprising an amino acid sequence encoded by a nucleotide sequence of or associated with a 
unigene cluster of Tables 1A-16, and conservatively modified variants thereof; (3) 
specifically hybridize under stringent hybridization conditions to a nucleic acid sequence, or 
the complement thereof of Tables 1A-16 and conservatively modified variants thereof; or (4) 
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have an amino acid sequence that has greater than about 60% amino acid sequence identity, 
65%, 70%, 75%, 80%, 85%, 90%, preferably 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 
or 99% or greater amino sequence identity, preferably over a region of over a region of at 
least about 25, 50, 100, 200, 500, 1000, or more amino acid, to an amino acid sequence 
5 encoded by a nucleotide sequence of or associated with a unigene cluster of Tables 1A-16. A 
polynucleotide or polypeptide sequence is typically from a mammal including, but not 
limited to, primate, e.g., human; rodent, e.g., rat, mouse, hamster; cow, pig, horse, sheep, or 
other mammal. A "lung cancer polypeptide" and a "lung cancer polynucleotide," include 
both naturally occurring or recombinant forms. 

10 A "full length" lung cancer protein or nucleic acid refers to a lung cancer polypeptide 

or polynucleotide sequence, or a variant thereof, that contains the elements normally 
contained in one or more naturally occurring, wild type lung cancer polynucleotide or 
polypeptide sequences. The "full length" may be prior to, or after, various stages of post- 
translational processing or splicing, including alternative splicing. 

15 "Biological sample" as used herein is a sample of biological tissue or fluid that 

contains nucleic acids or polypeptides, e.g., of a lung cancer protein, polynucleotide, or 
transcript. Such samples include, but are not limited to, tissue isolated from primates, e.g., 
humans, or rodents, e.g., mice, and rats. Biological samples may also include sections of 
tissues such as biopsy and autopsy samples, frozen sections taken for histologic purposes, 

20 archival materials, blood, plasma, serum, sputum, stool, tears, mucus, hair, skin, etc. 
Biological samples also include explants and primary and/or transformed cell cultures 
derived from patient tissues. A biological sample is typically obtained from a eukaryotic 
organism, most preferably a mammal such as a primate, e.g., chimpanzee or human; cow; 
dog; cat; a rodent, e.g., guinea pig, rat, mouse; rabbit; or other mammal; or a bird; reptile; 

25 fish. Livestock and domestic animals are of interest. 

"Providing a biological sample" means to obtain a biological sample for use in 
methods described in this invention. Most often, this will be done by removing a sample of 
cells from an animal, but can also be accomplished by using previously isolated cells (e.g., 
isolated by another person, at another time, and/or for another purpose), or by performing the 

30 methods of the invention in vivo. Archival tissues or materials, having treatment or outcome 
history, will be particularly useful. 

The terms "identical" or percent "identity," in the context of two or more nucleic 
acids or polypeptide sequences, refer to two or more sequences or subsequences that are the 
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same or have a specified percentage of amino acid residues or nucleotides that are the same 
(e.g., about 60% identity, preferably 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 
95%, 96%, 97%, 98%, 99%, or higher identity over a specified region, when compared and 
aligned for maximum correspondence over a comparison window or designated region) as 
5 measured using, e.g., a BLAST or BLAST 2.0 sequence comparison algorithms with default 
parameters described below, or by manual alignment and visual inspection (see, e.g., NCBI 
web site http://www.ncbi.nlm.nih.gov/BLAST/ or the like). Such sequences are then said to 
be "substantially identical." This definition also refers to, or may be applied to, the 
complement of a test sequence. The definition also includes sequences that have deletions 
10 and/or insertions, substitutions, and naturally occurring, e.g., polymorphic or allelic variants, 
and man-made variants. As described below, the preferred algorithms can account for gaps 
and the like. Preferably, identity exists over a region that is at least about 25 amino acids or 
nucleotides in length, or more preferably over a region that is 50-100 amino acids or 
nucleotides in length. 

15 For sequence comparison, typically one sequence acts as a reference sequence, to 

which test sequences are compared. When using a sequence comparison algorithm, test and 
reference sequences are entered into a computer, subsequence coordinates are designated, if 
necessary, and sequence algorithm program parameters are designated. Preferably, default 
program parameters can be used, or alternative parameters can be designated. The sequence 

20 comparison algorithm then calculates the percent sequence identities for the test sequences 
relative to the reference sequence, based on the program parameters. 

A "comparison window", as used herein, includes reference to a segment of 
contiguous positions selected from the group consisting typically of from 20 to 600, usually 
about 50 to about 200, more usually about 100 to about 150 in which a sequence may be 

25 compared to a reference sequence of the same number of contiguous positions after the two 
sequences are optimally aligned. Methods of alignment of sequences for comparison are 
well-known in the art. Optimal alignment of sequences for comparison can be conducted, 
e.g., by the local homology algorithm of Smith and Waterman (1981) Adv. Anol. Math. 
2:482, by the homology alignment algorithm of Needleman and Wunsch (1970) J. Mol. Biol. 

30 48:443, by the search for similarity method of Pearson and Lipman (1 988) Proc. Nat'l. Acad. 
Sci. USA 85:2444, by computerized implementations of these algorithms (GAP, BESTFIT, 
FASTA, and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer 
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Group, 575 Science Dr., Madison, WI), or by manual alignment and visual inspection (see, 
e.g., Ausubel, et al. (eds. 1995 and supplements) Current Protocols in Molecular Biology . 

Preferred examples of algorithms that are suitable for determining percent sequence 
identity and sequence similarity include the BLAST and BLAST 2.0 algorithms, which are 
5 described in Altschul, et al. (1977) Nuc. Acids Res. 25:3389-3402 and Altschul, et al. (1990) 
J. Mol. Biol. 215:403-410. BLAST and BLAST 2.0 are used, with the parameters described 
herein, to determine percent sequence identity for the nucleic acids and proteins of the 
invention. Software for performing BLAST analyses is publicly available through the 
National Center for Biotechnology Information (http://www.ncbi.nlm.nih.gov/). This 

10 algorithm involves first identifying high scoring sequence pairs (HSPs) by identifying short 
words of length W in the query sequence, which either match or satisfy some positive-valued 
threshold score T when aligned with a word of the same length in a database sequence. T is 
referred to as the neighborhood word score threshold (Altschul, et al., supra). These initial 
neighborhood word hits act as seeds for initiating searches to find longer HSPs containing 

1 5 them. The word hits are extended in both directions along each sequence for as far as the 
cumulative alignment score can be increased. Cumulative scores are calculated using, e.g., 
for nucleotide sequences, the parameters M (reward score for a pair of matching residues; 
always > 0) and N (penalty score for mismatching residues; always < 0). For amino acid 
sequences, a scoring matrix is used to calculate the cumulative score. Extension of the word 

20 hits in each direction are halted when: the cumulative alignment score falls off by the 

quantity X from its maximum achieved value; the cumulative score goes to zero or below, 
due to the accumulation of one or more negative-scoring residue alignments; or the end of 
either sequence is reached. The BLAST algorithm parameters W, T, and X determine the 
sensitivity and speed of the alignment. The BLASTN program (for nucleotide sequences) 

25 uses as defaults a wordlength (W) of 1 1, an expectation (E) of 10, M=5, N— 4 and a 
comparison of both strands. For amino acid sequences, the BLASTP program uses as 
defaults a wordlength of 3, and expectation (E) of 10, and the BLOSUM62 scoring matrix 
(see Henikoff and Henikoff (1989) Proc. Natl. Acad. Sci. USA 89:10915) alignments (B) of 
50, expectation (E) of 10, M=5, N=-4, and a comparison of both strands. 

30 The BLAST algorithm also performs a statistical analysis of the similarity between 

two sequences (see, e.g., Karlin and Altschul (1993) Proc. Nat'l. Acad. Sci. USA 90:5873- 
5787). One measure of similarity provided by the BLAST algorithm is the smallest sum 
probability (P(N)), which provides an indication of the probability by which a match between 
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two nucleotide or amino acid sequences would occur by chance. For example, a nucleic acid 
is considered similar to a reference sequence if the smallest sum probability in a comparison 
of the test nucleic acid to the reference nucleic acid is less than about 0.2, more preferably 
less than about 0.01, and most preferably less than about 0.001. Log values may be negative 
5 large numbers, e.g., 5, 10, 20, 30, 40, 40, 70, 90, 110, 150, 170, etc. 

An indication that two nucleic acid sequences are substantially identical is that the 
polypeptide encoded by the first nucleic acid is immunologically cross reactive with the 
antibodies raised against the polypeptide encoded by the second nucleic acid. Thus, a 
polypeptide is typically substantially identical to a second polypeptide, e.g., where the two 
10 peptides differ only by conservative substitutions. Another indication that two nucleic acid 
sequences are substantially identical is that the two molecules or their complements hybridize 
to each other under stringent conditions. Yet another indication that two nucleic acid 
sequences are substantially identical is that the same primers can be used to amplify the 
sequences. 

15 A "host cell" is a naturally occurring cell or a transformed cell that contains an 

expression vector and supports the replication or expression of the expression vector. Host 
cells may be cultured cells, explants, cells in vivo, and the like. Host cells may be 
prokaryotic cells such as E. coli, or eukaryotic cells such as yeast, insect, amphibian, or 
mammalian cells such as CHO, HeLa, and the like (see, e.g., the American Type Culture 

20 Collection catalog or web site, www.atcc.org). 

The terms "isolated," "purified," or "biologically pure" refer to material that is 
substantially or essentially free from components that normally accompany it as found in its 
native state. Purity and homogeneity are typically determined using analytical chemistry 
techniques such as polyacrylamide gel electrophoresis or high performance liquid 

25 chromatography. A protein or nucleic acid that is the predominant species present in a 

preparation is substantially purified. In particular, an isolated nucleic acid is separated from 
some open reading frames that naturally flank the gene and encode proteins other than protein 
encoded by the gene. The term '^purified" in some embodiments denotes that a nucleic acid 
or protein gives rise to essentially one band in an electrophoretic gel. Preferably, it means 

30 that the nucleic acid or protein is at least about 85% pure, more preferably at least 95% pure, 
and most preferably at least 99% pure. "Purify" or purification" in other embodiments 
means removing at least one contaminant or component from the composition to be purified. 
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In this sense, purification does not require that the purified compound be homogeneous, e.g., 
100% pure. 

The terms "polypeptide," "peptide" and "protein" are used interchangeably herein to 
refer to a polymer of amino acid residues. The terms apply to amino acid polymers in which 
5 one or more amino acid residue is an artificial chemical mimetic of a corresponding naturally 
occurring amino acid, as well as to naturally occurring amino acid polymers, those containing 
modified residues, and non-naturally occurring amino acid polymer. 

The term "amino acid" refers to naturally occurring and synthetic amino acids, as well 
as amino acid analogs and amino acid mimetics that function similarly to the naturally 

10 occurring amino acids. Naturally occurring amino acids are those encoded by the genetic 
code, as well as those amino acids that are later modified, e.g., hydroxyproline, y- 
carboxyglutamate, and Ophosphoserine. Amino acid analogs refer to compounds that have 
the same basic chemical structure as a naturally occurring amino acid, e.g., an a carbon that is 
bound to a hydrogen, a carboxyl group, an amino group, and an R group, e.g., homoserine, 

1 5 norleucine, methionine sulfoxide, methionine methyl sulfonium. Such analogs may have 
modified R groups (e.g., norleucine) or modified peptide backbones, but retain some basic 
chemical structure as a naturally occurring amino acid. Amino acid mimetics refer to 
chemical compounds that have a structure that is different from the general chemical 
structure of an amino acid, but that function similarly to another amino acid. 

20 Amino acids may be referred to herein by either their commonly known three letter 

symbols or by the one-letter symbols recommended by the IUPAC-IUB Biochemical 
Nomenclature Commission. Nucleotides, likewise, may be referred to by their commonly 
accepted single-letter codes. 

"Conservatively modified variants" applies to both amino acid and nucleic acid 

25 sequences. With respect to particular nucleic acid sequences, conservatively modified 

variants refers to those nucleic acids which encode identical or essentially identical amino 
acid sequences, or where the nucleic acid does not encode an amino acid sequence, to 
essentially identical or associated, e.g., naturally contiguous, sequences. Because of the 
degeneracy of the genetic code, a large number of functionally identical nucleic acids encode 

30 most proteins. For instance, the codons GCA, GCC, GCG, and GCU each encode the amino 
acid alanine. Thus, at each position where an alanine is specified by a codon, the codon can 
be altered to another of the corresponding codons described without altering the encoded 
polypeptide. Such nucleic acid variations are "silent variations," which are one species of 
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conservatively modified variations. Every nucleic acid sequence herein which encodes a 

polypeptide also describes silent variations of the nucleic acid. In certain contexts each 

codon in a nucleic acid (except AUG, which is ordinarily the only codon for methionine, and 

TGG, which is ordinarily the only codon for tryptophan) can be modified to yield a 

5 functionally similar molecule. Accordingly, a silent variation of a nucleic acid which 

encodes a polypeptide is implicit in a described sequence with respect to the expression 

product, but not necessarily with respect to actual probe sequences. 

As to amino acid sequences, one of skill will recognize that individual substitutions, 

deletions or additions to a nucleic acid, peptide, polypeptide, or protein sequence which 

10 alters, adds or deletes a single amino acid or a small percentage of amino acids in the encoded 
sequence is a "conservatively modified variant" where the alteration results in the substitution 
of an amino acid with a chemically similar amino acid. Conservative substitution tables 
providing functionally similar amino acids are well known in the art. Such conservatively 
modified variants are in addition to and do not exclude polymorphic variants, interspecies 

15 homologs, and alleles of the invention. Typically conservative substitutions include for one 
another: 1) Alanine (A), Glycine (G); 2) Aspartic acid (D), Glutamic acid (E); 3) Asparagine 
(N), Glutamine (Q); 4) Arginine (R), Lysine (K); 5) Isoleucine (I), Leucine (L), Methionine 
(M), Valine (V); 6) Phenylalanine (F), Tyrosine (Y), Tryptophan (W); 7) Serine (S), 
Threonine (T); and 8) Cysteine (C), Methionine (M) (see, e.g., Creighton, Proteins (1984)). 

20 Macromolecular structures such as polypeptide structures can be described in terms of 

various levels of organization. For a general discussion of this organization, see, e.g., 
Alberts, et al. (1994) Molecular Biology of the Cell (3 rd ed.) and Cantor and Schimmel (1980) 
Biophysical Chemistry Part I: The Conformation of Biological Macromolecules . "Primary 
structure" refers to the amino acid sequence of a particular peptide. "Secondary structure" 

25 refers to locally ordered, three dimensional structures within a polypeptide. These structures 
are commonly known as domains. Domains are portions of a polypeptide that often form a 
compact unit of the polypeptide and are typically 25 to approximately 500 amino acids long. 
Typical domains are made up of sections of lesser organization such as stretches of P-sheet 
and ot-helices. "Tertiary structure" refers to the complete three dimensional structure of a 

30 polypeptide monomer. "Quaternary structure" refers to the three dimensional structure 
formed, usually by the noncovalent association of independent tertiary units. Anisotropic 
terms are also known as energy terms. 
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'"Nucleic acid" or "oligonucleotide" or "polynucleotide" or grammatical equivalents 
used herein means at least two nucleotides covalently linked together. Oligonucleotides are 
typically from about 5, 6, 7, 8, 9, 10, 12, 15, 25, 30, 40, 50 or more nucleotides in length, up 
to about 100 nucleotides in length. Nucleic acids and polynucleotides are a polymers of any 
5 length, including longer lengths, e.g., 200, 300, 500, 1000, 2000, 3000, 5000, 7000, 10,000, 
etc. A nucleic acid of the present invention will generally contain phosphodiester bonds, 
although in some cases, nucleic acid analogs are included that may have at least one different 
linkage, e.g., phosphoramidate, phosphorothioate, phosphorodithioate, or O- 
methylphophoroamidite linkages (see Eckstein (1992) Oligonucleotides and Analogues: A 

1 0 Practical Approach Oxford University Press); and peptide nucleic acid backbones and 
linkages. Other analog nucleic acids include those with positive backbones; non-ionic 
backbones, and non-ribose backbones, including those described in U.S. Patent Nos. 
5,235,033 and 5,034,506, and Chapters 6 and 7, in Sanghui and Cook, eds. Carbohydrate 
Modifications in Antisense Research, ASC Symposium Series 580. Nucleic acids containing 

15 one or more carbocyclic sugars are also included within one definition of nucleic acids. 

Modifications of the ribose-phosphate backbone may be done for a variety of reasons, e.g., to 
increase the stability and half-life of such molecules in physiological environments or as 
probes on a biochip. Mixtures of naturally occurring nucleic acids and analogs can be made; 
alternatively, mixtures of different nucleic acid analogs, and mixtures of naturally occurring 

20 nucleic acids and analogs may be made. 

Particularly preferred are peptide nucleic acids (PNA) which includes peptide nucleic 
acid analogs. These backbones are substantially non-ionic under neutral conditions, in 
contrast to the highly charged phosphodiester backbone of naturally occurring nucleic acids. 
This results in two advantages. First, the PNA backbone exhibits improved hybridization 

25 kinetics. PNAs have larger changes in the melting temperature (Tm) for mismatched versus 
perfectly matched basepairs. DNA and RNA typically exhibit a 2-4° C drop in T m for an 
internal mismatch. With the non-ionic PNA backbone, the drop is closer to 7-9° C. 
Similarly, due to their non-ionic nature, hybridization of the bases attached to these 
backbones is relatively insensitive to salt concentration. In addition, PNAs are not degraded 

30 by cellular enzymes, and thus can be more stable. 

The nucleic acids may be single stranded or double stranded, as specified, or contain 
portions of both double stranded or single stranded sequence. As will be appreciated by those 
in the art, the depiction of a single strand also defines the sequence of the complementary 
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strand; thus the sequences described herein also provide the complement of the sequence. 

The nucleic acid may be DNA, both genomic and cDNA, RNA, or a hybrid, where the 

nucleic acid may contain combinations of deoxyribo- and ribo-nucleotides, and combinations 

of bases, including uracil, adenine, thymine, cytosine, guanine, inosine, xanthine 

5 hypoxanthine, isocytosine, isoguanine, etc. 'Transcript* ' typically refers to a naturally 

occurring RNA, e.g., a pre-mRNA, hnRNA, or mKNA. As used herein, the term 

"nucleoside" includes nucleotides and nucleoside and nucleotide analogs, and modified 

nucleosides such as amino modified nucleosides. In addition, "nucleoside" includes non- 

naturally occurring analog structures. Thus, e.g., the individual units of a peptide nucleic 

10 acid, each containing a base, are referred to herein as a nucleoside. 

A "label" or a "detectable moiety" is a composition detectable by spectroscopic, 
photochemical, biochemical, immunochemical, physiological, chemical, or other physical 
means. For example, useful labels include 32 P, fluorescent dyes, electron-dense reagents, 
enzymes (e.g., as commonly used in an ELISA), biotin, digoxigenin, or haptens and proteins 

15 or other entities which can be made detectable, e.g., by incorporating a radiolabel into the 
peptide or used to detect antibodies specifically reactive with the peptide. The labels may be 
incorporated into the cancer nucleic acids, proteins, and antibodies. Many methods known in 
the art for conjugating the antibody to the label may be employed, including those methods 
described by Hunter, et al. (1962) Nature 144:945; David, et al. (1974) Biochemistry 

20 13:1014-1021; Pain, et al. (1981) J. Immunol. Meth., 40:219-230; and Nygren (1982) 1 
Histochem. and Cvtochem. 30:407-412. 

An "effector" or "effector moiety" or "effector component" is a molecule that is 
bound (or linked, or conjugated), either covalently, through a linker or a chemical bond, or 
noncovalently, through ionic, van der Waals, electrostatic, or hydrogen bonds, to an antibody. 

25 The "effector" can be a variety of molecules including, e.g., detection moieties including 

radioactive compounds, fluorescent compounds, an enzyme or substrate, tags such as epitope 
tags, a toxin; activatable moieties, a chemotherapeutic agent; a lipase; an antibiotic; or a 
radioisotope emitting "hard" e.g., beta radiation. 

A "labeled nucleic acid probe or oligonucleotide" is one that is bound, either 

30 covalently, through a linker or a chemical bond, or noncovalently, through ionic, van der 

Waals, electrostatic, or hydrogen bonds to a label such that the presence of the probe may be 
detected by detecting the presence of the label bound to the probe. Alternatively, method 
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using high affinity interactions may achieve the same results where one of a pair of binding 

partners binds to the other, e.g., biotin, streptavidin. 

As used herein a "nucleic acid probe or oligonucleotide" is a nucleic acid capable of 

binding to a target nucleic acid of complementary sequence through one or more types of 

5 chemical bonds, usually through complementary base pairing, e.g., through hydrogen bond 

formation. As used herein, a probe may include natural (i.e., A, G, C, or T) or modified bases 

(7-deazaguanosine, inosine, etc.). In addition, the bases in a probe may be joined by a 

linkage other than a phosphodiester bond, preferably one that does not functionally interfere 

with hybridization. Thus, e.g., probes may be peptide nucleic acids in which the constituent 

1 0 bases are joined by peptide bonds rather than phosphodiester linkages. Probes may bind 

target sequences lacking complete complementarity with the probe sequence depending upon 
the stringency of the hybridization conditions. The probes are preferably directly labeled, 
e.g., with isotopes, chromophores, lumiphores, chromogens, or indirectly labeled, e.g., with 
biotin to which a streptavidin complex may later bind By assaying for the presence or 

15 absence of the probe, one can detect the presence or absence of the select sequence or 

subsequence. Diagnosis or prognosis may be based at the genomic level, or at the level of 
RNA or protein expression. 

The term "recombinant" when used with reference, e.g., to a cell, or nucleic acid, 
protein, or vector, indicates that the cell, nucleic acid, protein or vector, has been modified by 

20 the introduction of a heterologous nucleic acid or protein or the alteration of a native nucleic 
acid or protein, or that the cell is derived from a cell so modified. Thus, e.g., recombinant 
cells express genes that are not found within the native (non-recombinant) form of the cell or 
express native genes that are otherwise abnormally expressed, under expressed or not 
expressed at all. By the term "recombinant nucleic acid" herein is meant nucleic acid, 

25 originally formed in vitro, in general, by the manipulation of nucleic acid, e.g., using 

polymerases and endonucleases, in a form not normally found in nature. In this manner, 
operably linkage of different sequences is achieved. Thus an isolated nucleic acid, in a linear 
form, or an expression vector formed in vitro by ligating DNA molecules that are not 
normally joined, are both considered recombinant for the purposes of this invention. It is 

30 understood that once a recombinant nucleic acid is made and reintroduced into a host cell or 
organism, it will replicate non-recombinantly, i.e., using the in vivo cellular machinery of the 
host cell rather than in vitro manipulations; however, such nucleic acids, once produced 
recombinantly, although subsequently replicated non-recombinantly, are still considered 
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recombinant for the purposes of the invention. Similarly, a "recombinant protein" is a protein 
made using recombinant techniques, i.e., through the expression of a recombinant nucleic 
acid as depicted above. 

The term heterologous" when used with reference to portions of a nucleic acid 
5 indicates that the nucleic acid comprises two or more subsequences that are not normally 
found in the same relationship to each other in nature. For instance, the nucleic acid is 
typically recombinantly produced, having two or more sequences, e.g., from unrelated genes 
arranged to make a new functional nucleic acid, e.g., a promoter from one source and a 
coding region from another source. Similarly, a heterologous protein will often refer to two 
10 or more subsequences that are not found in the same relationship to each other in nature (e.g., 
a fusion protein). 

A "promoter" is typically an array of nucleic acid control sequences that direct 
transcription of a nucleic acid. As used herein, a promoter includes necessary nucleic acid 
sequences near the start site of transcription, such as, in the case of a polymerase II type 

15 promoter, a TATA element. A promoter also optionally includes distal enhancer or repressor 
elements, which can be located as much as several thousand base pairs from the start site of 
transcription. A "constitutive" promoter is a promoter that is active under most 
environmental and developmental conditions. An "inducible" promoter is a promoter that is 
active under environmental or developmental regulation. The term "operably linked" refers 

20 to a functional linkage between a nucleic acid expression control sequence (such as a 

promoter, or array of transcription factor binding sites) and a second nucleic acid sequence, 
e.g., wherein the expression control sequence directs transcription of the nucleic acid 
corresponding to the second sequence. 

An "expression vector" is a nucleic acid construct, generated recombinantly or 

25 synthetically, with a series of specified nucleic acid elements that permit transcription of a 

particular nucleic acid in a host cell. The expression vector can be part of a plasmid, virus, or 
nucleic acid fragment. Typically, the expression vector includes a nucleic acid to be 
transcribed in operable linkage to a promoter. 

The phrase "selectively (or specifically) hybridizes to" refers to the binding, 

30 duplexing, or hybridizing of a molecule selectively to a particular nucleotide sequence under 
stringent hybridization conditions when that sequence is present in a complex mixture (e.g., 
total cellular or library DNA or RNA). 
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The phrase "stringent hybridization conditions" refers to conditions under which a 

probe will hybridize to its target subsequence, typically in a complex mixture of nucleic 

acids, but to essentially no other sequences. Stringent conditions are sequence-dependent and 

will be different in different circumstances. Longer sequences hybridize specifically at 

5 higher temperatures. An extensive guide to the hybridization of nucleic acids is found in 

"Overview of principles of hybridization and the strategy of nucleic acid assays" in Tijssen 

(1993) Techniques in Biochemistry and Molecular Biology-Hybridization with Nucleic 

Probes (vol. 24) Elsevier. Generally, stringent conditions are selected to be about 5-10° C 

lower than the thermal melting point (T m ) for the specific sequence at a defined ionic strength 

10 pH. The T m is the temperature (under defined ionic strength, pH, and nucleic concentration) 
at which 50% of the probes complementary to the target hybridize to the target sequence at 
equilibrium (as the target sequences are present in excess, at T^ 50% of the probes are 
occupied at equilibrium). Stringent conditions will be those in which the salt concentration is 
less than about 1.0 M sodium ion, typically about 0.01 to 1.0 M sodium ion concentration (or 

15 other salts) at pH 7.0 to 8.3 and the temperature is at least about 30° C for short probes (e.g., 
10 to 50 nucleotides) and at least about 60° C for long probes (e.g., greater than 50 
nucleotides). Stringent conditions may also be achieved with the addition of destabilizing 
agents such as formamide. For selective or specific hybridization, a positive signal is 
typically at least two times background, preferably 10 times background hybridization. 

20 Exemplary stringent hybridization conditions are often: 50% formamide, 5x SSC, and 1% 
SDS, incubating at 42° C, or, 5x SSC, 1% SDS, incubating at 65° C, with wash in 0.2x SSC, 
and 0.1% SDS at 65° C. For PCR, a temperature of about 36° C is typical for low stringency 
amplification, although annealing temperatures may vary between about 32° C and 48° C 
depending on primer length. For high stringency PCR amplification, a temperature of about 

25 62° C is typical, although high stringency annealing temperatures can range from about 50° C 
to about 65° C, depending on the priijier length and specificity. Typical cycle conditions for 
both high and low stringency amplifications include a denaturation phase of 90° C - 95° C for 
0.5 - 2 min., an annealing phase lasting 0.5 - 2 min., and an extension phase of about 72° C 
for 1 - 2 min. Protocols and guidelines for low and high stringency amplification reactions 

30 are provided, e.g., in Innis, et al.(1990) PCR Protoco ls. A Guide to Methods and 
Applications. 

Nucleic acids that do not hybridize to each other under stringent conditions are still 
substantially identical if the polypeptides which they encode are substantially identical. This 
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occurs, e.g., when a copy of a nucleic acid is created using the max im um codon degeneracy 

permitted by the genetic code. In such cases, the nucleic acids typically hybridize under 

moderately stringent hybridization conditions. Exemplary "moderately stringent 

hybridization conditions" include a hybridization in a buffer of 40% formamide, 1 M NaCl, 

5 1 % SDS at 37° C, and a wash in IX SSC at 45° C. A positive hybridization is at least twice 

background. Alternative hybridization and wash conditions can be utilized to provide 

conditions of similar stringency. Additional-guidelines for determining hybridization 

parameters are provided in numerous reference, e.g., Ausubel, et al. (ed.) Current Pr otocols in 

Molecular Biology Lippincott. 

1 o The phrase "functional effects" in the context of assays for testing compounds that 

modulate activity of a lung cancer protein includes the determination of a parameter that is 
indirectly or directly under the influence of the lung cancer protein or nucleic acid, e.g., a 
physiological, enzymatic, functional, physical, or chemical effect, such as the ability to 
decrease lung cancer. It includes ligand binding activity; cell viability, cell growth on soft 

15 agar; anchorage dependence; contact inhibition and density limitation of growth; cellular 
proliferation; cellular transformation; growth factor or serum dependence; tumor specific 
marker levels; invasiveness into Matrigel; tumor growth and metastasis in vivo; mRNA and 
protein expression in cells undergoing metastasis, and other characteristics of lung cancer 
cells. "Functional effects" include in vitro, in vivo, and ex vivo activities. 

20 By "determining the functional effect" is meant assaying for a compound that 

increases or decreases a parameter that is indirectly or directly under the influence of a lung 
cancer protein sequence, e.g., physiological, functional, enzymatic, physical, or chemical 
effects. Such functional effects can be measured by many means known to those skilled in 
the art, e.g., changes in spectroscopic characteristics (e.g., fluorescence, absorbance, 

25 refractive index), hydrodynamic (e.g., shape), chromatographic, or solubility properties for 
the protein, measuring inducible markers or transcriptional activation of the lung cancer 
protein; measuring binding activity or binding assays, e.g., binding to antibodies or other 
ligands, and measuring cellular proliferation. Determination of the functional effect of a 
compound on lung cancer can also be performed using lung cancer assays known to those of 

30 skill in the art such as an in vitro assays, e.g., cell growth on soft agar; anchorage 

dependence; contact inhibition and density limitation of growth; cellular proliferation; 
cellular transformation; growth factor or serum dependence; tumor specific marker levels; 
invasiveness into Matrigel; tumor growth and metastasis in vivo; mRNA and protein 
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expression in cells undergoing metastasis, and other characteristics of lung cancer cells. The 
functional effects can be evaluated by many means known to those skilled in the art, e.g., 
microscopy for quantitative or qualitative measures of alterations in morphological features, 
measurement of changes in RNA or protein levels for lung cancer-associated sequences, 
5 measurement of RNA stability, identification of downstream or reporter gene expression 
(CAT, luciferase, p-gal, GFP, and the like), e.g., via chemiluminescence, fluorescence, 
colorimetric reactions, antibody binding, inducible markers, and ligand binding assays. 

"Inhibitors", "activators", and "modulators" of lung cancer polynucleotide and 
polypeptide sequences are used to refer to activating, inhibitory, or modulating molecules or 

10 compounds identified using in vitro and in vivo assays of lung cancer polynucleotide and 

polypeptide sequences. Inhibitors are compounds that, e.g., bind to, partially or totally block 
activity, decrease, prevent, delay activation, inactivate, desensitize, or down regulate the 
activity or expression of lung cancer proteins, e.g., antagonists. Antisense or inhibitory 
nucleic acids may seem to inhibit expression and subsequent function of the protein. 

15 "Activators" are compounds that increase, open, activate, facilitate, enhance activation, 
sensitize, agonize, or up regulate lung cancer protein activity. Inhibitors, activators, or 
modulators also include genetically modified versions of lung cancer proteins, e.g., versions 
with altered activity, as well as naturally occurring and synthetic ligands, antagonists, 
agonists, antibodies, small chemical molecules and the like. Such assays for inhibitors and 

20 activators include, e.g., expressing the lung cancer protein in vitro, in cells, or cell 

membranes, applying putative modulator compounds, and then determining the functional 
effects on activity, as described above. Activators and inhibitors of lung cancer can also be 
identified by incubating lung cancer cells with the test compound and determining increases 
or decreases in the expression of 1 or more lung cancer proteins, e.g., 1, 2, 3, 4, 5, 10, 15, 20, 

25 25, 30, 40, 50 or more lung cancer proteins, such as lung cancer proteins encoded by the 
sequences set out in Tables 1A-16. 

Samples or assays comprising lung cancer proteins that are treated with a potential 
activator, inhibitor, or modulator are compared to control samples without the inhibitor, 
activator, or modulator to examine the extent of inhibition. Control samples (untreated with 

30 inhibitors) are assigned a relative protein activity value of 100%. Inhibition of a polypeptide 
is achieved when the activity value relative to the control is about 80%, preferably 50%, more 
preferably 25-0%. Activation of a lung cancer polypeptide is achieved when the activity 
value relative to the control (untreated with activators) is 1 10%, more preferably 150%, more 
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preferably 200-500% (i.e., two to five fold higher relative to the control), more preferably 
1000-3000% higher. 

The phrase "changes in cell growth" refers to any change in cell growth and 
proliferation characteristics in vitro or in vivo, such as cell viability, formation of foci, 
5 anchorage independence, semi-solid or soft agar growth, changes in contact inhibition and 
density limitation of growth, loss of growth factor or serum requirements, changes in cell 
morphology, gaining or losing immortalization, gaining or losing tumor specific markers, 
ability to form or suppress tumors when injected into suitable animal hosts, and/or 
immortalization of the cell. See, e.g., Freshney (1994) Culture of Animal Cell s a Manual of 

10 Basic Technique pp. 231-241 (3 rd ed.). 

"Tumor cell" refers to precancerous, cancerous, and normal cells in a tumor. 
"Cancer cells," 'transformed" cells, or "transformation" in tissue culture, refers to 
spontaneous or induced phenotypic changes that do not necessarily involve the uptake of new 
genetic material. Although transformation can arise from infection with a transforming virus 

15 and incorporation of new genomic DNA, or uptake of exogenous DNA, it can also arise 

spontaneously or following exposure to a carcinogen, thereby mutating an endogenous gene. 
Transformation is associated with phenotypic changes, such as immortalization of cells, 
aberrant growth control, nonmorphological changes, and/or malignancy (see, Freshney 
(1994) Culture of Animal Cells a Manual of Basic Technique (3 rd ed.)). 

20 "Antibody refers to a polypeptide comprising a framework region from an 

immunoglobulin gene or fragments thereof that specifically binds and recognizes an antigen. 
The recognized immunoglobulin genes include the kappa, lambda, alpha, gamma, delta, 
epsilon, and mu constant region genes, as well as the myriad immunoglobulin variable region 
genes. Light chains are classified as either kappa or lambda. Heavy chains are classified as 

25 gamma, mu, alpha, delta, or epsilon, which in turn define the immunoglobulin classes, IgG, 
IgM, IgA, IgD, and IgE, respectively. Typically, the antigen-binding region of an antibody or 
its functional equivalent will be most critical in specificity and affinity of binding. See Paul, 
Fundamental Immunology . 

An exemplary immunoglobulin (antibody) structural unit comprises a tetramer. Each 

30 tetramer is composed of two identical pairs of polypeptide chains, each pair having one 
"light" (about 25 kD) and one "heavy" chain (about 50-70 kD). The N-terminus of each 
chain defines a variable region of about 100 to 110 or more amino acids primarily responsible 
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for antigen recognition. The terms variable light chain (Vl) and variable heavy chain (V h) 
refer to these light and heavy chains respectively. 

Antibodies exist, e.g., as intact immunoglobulins or as a number of well-characterized 
fragments produced by digestion with various peptidases. Thus, e.g., pepsin digests an 
5 antibody below the disulfide linkages in the hinge region to produce F(ab)' 2 , a dimer of Fab 
which itself is a light chain joined to V H -C H 1 by a disulfide bond. The F(ab)' 2 may be 
reduced under mild conditions to break the disulfide linkage in the hinge region, thereby 
converting the F(ab)' 2 dimer into an Fab' monomer. The Fab' monomer is essentially Fab 
with part of the hinge region (see Paul (ed. 1999) Fundamental Immunology (4th ed.). While 

10 various antibody fragments are defined in terms of the digestion of an intact antibody, one of 
skill will appreciate that such fragments may be synthesized de novo either chemically or by 
using recombinant DNA methodology. Thus, the term antibody, as used herein, also includes 
antibody fragments either produced by the modification of whole antibodies, or those 
synthesized de novo using recombinant DNA methodologies (e.g., single chain Fv) or those 

15 identified using phage display libraries (see, e.g., McCafferty, et al. (1990) Nature 348:552- 
554). 

For preparation of antibodies, e.g., recombinant, monoclonal, or polyclonal 
antibodies, many technique known in the art can be used (see, e.g., Kohler and Milstein 
(1975) Nature 256:495-497; Kozbor, et al. (1983) Immunology Today 4:72; Cole, et al. 

20 (1985), pp. 77-96 in Monoclonal Antibodies and Cancer Therapy : Coligan (1991 and 

supplements) Current Protocols in Immunology: Harlow and Lane (1988) Antibodies, A 
Laboratory Manual : and Goding (1986) Monoclonal Antibodies: Principl es and Practice (2d 
ed.)). Techniques for the production of single chain antibodies (U.S. Patent 4,946,778) can 
be adapted to produce antibodies to polypeptides of this invention. Also, transgenic mice, or 

25 other organisms such as other mammals, may be used to express humanized antibodies. 

Alternatively, phage display technology can be used to identify antibodies and heteromeric 
Fab fragments that specifically bind to selected antigens (see, e.g., McCafferty, et al. (1990) 
Nature 348:552-554; Marks, et al. (1992) Biotechnology 10:779-783). 

A "chimeric antibody*' is an antibody molecule in which, e.g, (a) the constant region, 

30 or a portion thereof, is altered, replaced, or exchanged so that the antigen binding site 
(variable region) is linked to a constant region of a different or altered class, effector 
function, and/or species, or an entirely different molecule which confers new properties to the 
chimeric antibody, e.g., an enzyme, toxin, hormone, growth factor, drug, etc.; or (b) the 
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variable region, or a portion thereof, is altered, replaced, or exchanged with a variable region 

having a different or altered antigen specificity. 



Identification of lung cancer-associated sequences 

5 In one aspect, the expression levels of genes are determined in different patient 

samples for which diagnosis information is desired, to provide expression profiles. An 
expression profile of a particular sample is essentially a "fingerprint" of the state of the 
sample; while two states may have any particular gene similarly expressed, the evaluation of 
a number of genes simultaneously allows the generation of a gene expression profile that is 

10 characteristic of the state of the cell. That is, normal tissue may be distinguished from 
cancerous or metastatic cancerous tissue, or metastatic cancerous tissue can be compared 
with tissue from surviving cancer patients. By comparing expression profiles of tissue in 
known different lung cancer states, information regarding which genes are important 
(including both up- and down-regulation of genes) in each of these states is obtained. 

1 5 Molecular profiling may distinguish subtypes of a currently collective disease designation, 
e.g., different forms of lung cancer (chronic disease, adenocarcinoma, etc.) 

The identification of sequences that are differentially expressed in lung cancer versus 
non-lung cancer tissue allows the use of this information in a number of ways. For example, 
a particular treatment regime may be evaluated: does a chemotherapeutic drug act to down- 

20 regulate lung cancer, and thus tumor growth or recurrence, in a particular patient. 

Alternatively, a treatment step may induce other markers which may be used as targets to 
destroy tumor cells. Similarly, diagnosis and treatment outcomes may be done or confirmed 
by comparing patient samples with the known expression profiles. Malignant diseasemay be 
compared to non-malignant conditions. Metastatic tissue can also be analyzed to determine 

25 the stage of lung cancer in the tissue, or origin of primary tumor, e.g., metastasis from a 

remote primary site. Furthermore, these gene expression profiles (or individual genes) allow 
screening of drug candidates with an eye to mimicking or altering a particular expression 
profile; e.g., screening can be done for drugs that suppress the lung cancer expression profile. 
This may be done by making biochips comprising sets of the important lung cancer genes, 

30 which can then be used in these screens. PCR methods may be applied with selected primer 
pairs, and analysis may be of RNA or of genomic sequences. These methods can also be 
done on the protein basis; that is, protein expression levels of the lung cancer proteins can be 
evaluated for diagnostic purposes or to screen candidate agents. In addition, the lung cancer 
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nucleic acid sequences can be administered for gene therapy purposes, including the 
administration of antisense nucleic acids, or the lung cancer proteins (including antibodies 
and other modulators thereof) administered as therapeutic drugs or as protein or DNA 
vaccines. 

5 Thus the present invention provides nucleic acid and protein sequences that are 

differentially expressed in lung cancer relative to normal tissues and/or non-malignant lung 
disease, or in different types of lung disease, herein termed "lung cancer sequences." As 
outlined below, lung cancer sequences include those that are up-regulated (i.e., expressed at a 
higher level) in lung cancer, as well as those that are down-regulated (i.e., expressed at a 

10 lower level). In a preferred embodiment, the lung cancer sequences are from humans; 
however, as will be appreciated by those in the art, lung cancer sequences from other 
organisms may be useful in animal models of disease and drug evaluation; thus, other lung 
cancer sequences are provided, from vertebrates, including mammals, including rodents (rats, 
mice, hamsters, guinea pigs, etc.), primates, farm animals (including sheep, goats, pigs, cows, 

15 horses, etc.) and pets (dogs, cats, etc.). Lung cancer sequences from other organisms may be 
obtained using the techniques outlined below. 

Lung cancer sequences can include both nucleic acid and amino acid sequences. As 
will be appreciated by those in the art and is more fully outlined below, lung cancer nucleic 
acid sequences are useful in a variety of applications, including diagnostic applications, 

20 which will detect naturally occurring nucleic acids, as well as screening applications; e.g., 
biochips comprising nucleic acid probes or PCR microtiter plates with selected probes to the 
lung cancer sequences can be generated. 

A lung cancer sequence can be initially identified by substantial nucleic acid and/or 
amino acid sequence homology to the lung cancer sequences outlined herein. Such 

25 homology can be based upon the overall nucleic acid or amino acid sequence, and is 

generally determined as outlined below, e.g., using homology programs or hybridization 
conditions. 

For identifying lung cancer-associated sequences, the lung cancer screen typically 
includes comparing genes identified in different tissues, e.g., normal and cancerous tissues, 
30 cancer and non-malignant conditions, non-malignant conditions and normal tissues, or tumor 
tissue samples from patients who have metastatic disease vs. non metastatic tissue. Other 
suitable tissue comparisons include comparing lung cancer samples with metastatic cancer 
samples from other cancers, such as, breast, other gastrointestinal cancers, prostate, ovarian, 
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etc. Samples of, non metastatic disease tissue and tissue undergoing metastasis are applied to 

biochips comprising nucleic acid probes. The samples are first microdissected, if applicable, 
and treated as is known in the art for the preparation of mRNA. Suitable biochips are 
commercially available, e.g., from Affymetrix, Santa Clara, CA. Gene expression profiles as 
5 described herein are generated and the data analyzed. 

In one embodiment, the genes showing changes in expression as between normal and 
disease states are compared to genes expressed in other normal tissues, preferably normal 
lung, but also including, and not limited to colon, heart, brain, liver, breast, kidney, muscle, 
prostate, small intestine, large intestine, spleen, bone, and/or placenta. In a preferred 
1 0 embodiment, those genes identified during the lung cancer screen that are expressed in 
significant amounts in other tissues (e.g., essential organs) are removed from the profile, 
although in some embodiments, this is not necessary (e.g., where organs may be dispensible 
at a later stage of life). That is, when screening for drugs, it is usually preferable that the 
target expression be disease specific, to minimize possible side effects on other organs. 
15 In a preferred embodiment, lung cancer sequences are those that are up-regulated in 

lung cancer; that is, the expression of these genes is higher in cancerous tissue than in normal 
lung or other tissue. "Up-regulation" as used herein means, when the ratio is presented as a 
number greater than one, that the ratio is greater than one, preferably 1 .5 or greater, more 
preferably 2.0 or greater. Another embodiment is directed to sequences up-regulated in non- 
20 malignant conditions relative to normal. Unigene cluster identification numbers and 

accession numbers herein are for the GenBank sequence database and the sequences of the 
accession numbers are hereby expressly incorporated by reference. GenBank is known in the 
art, see, e.g., Benson, DA, et al (1998) Nucleic Acids Research 26:1-7 and 
http://www.ncbi.nlm.nih.gov/. Sequences are also available in other databases, e.g., 
25 European Molecular Biology Laboratory (EMBL) and DNA Database of Japan (DDBJ). 
Another embodiment is directed to sequences up-regulated in non-malignant conditions 
relative to normal. In some situations, the sequences may be derived from assembly of 
available sequences or be predicted from genomic DNA using exon prediction algorithms, 
such as FGENESH (Salamov and Solovyev (2000) Genome Res. 10:516-522). In other 
30 situations, sequences have been derived from cloning and sequencing of isolated nucleic 
acids. 

In another preferred embodiment, lung cancer sequences are those that are dojvn- 
regulated in the lung cancer; that is, the expression of these genes is lower in cancerous tissue 
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or normal lung or other tissue. *T>own-regulation" as used herein means, when the ratio is 

presented as a number greater than one, that the ratio is greater than one, preferably 1 .5 or 

greater, more preferably 2.0 or greater, or, when the ratio is presented as a number less than 

one, that the ratio is less than one, preferably 0.5 or less, more preferably 0.25 or less. 

5 

Informatics 

The ability to identify genes that are over or under expressed in lung cancer can 
additionally provide high-resolution, high-sensitivity datasets which can be used in the areas 
of diagnostics, therapeutics, drug development, pharmacogenetics, protein structure, 

1 0 biosensor development, and other related areas. For example, the expression profiles can be 
used in diagnostic or prognostic evaluation of patients with lung cancer. Or as another 
example, subcellular toxicological information can be generated to better direct drug structure 
and activity correlation (see Anderson (1998) Pharmaceutical Proteomics: Targets, 
Mechanism, and Function, paper presented at the EBC Proteomics conference, Coronado, CA 

15 (June 1 1-12, 1998)). Subcellular toxicological information can also be utilized in a biological 
sensor device to predict the likely toxicological effect of chemical exposures and likely 
tolerable exposure thresholds (see U.S. Patent No. 5,8 1 1,23 1). Similar advantages accrue 
from datasets relevant to other biomolecules and bioactive agents (e.g., nucleic acids, 
saccharides, lipids, drugs, and the like). 

20 Thus, in another embodiment, the present invention provides a database that includes 

at least one set of assay data. The data contained in the database is acquired, e.g., using array 
analysis either singly or in a library format. The database can be in a form in which data can 
be maintained and transmitted, but is preferably an electronic database. The electronic 
database of the invention can be maintained on any electronic device allowing for the storage 

25 of and access to the database, such as a personal computer, but is preferably distributed on a 
wide area network, such as the World Wide Web. 

The focus of the present section on databases that include peptide sequence data is for 
clarity of illustration only. It will be apparent to those of skill in the art that similar databases 
can be assembled for assay data acquired using an assay of the invention. 

30 The compositions and methods for identifying and/or quantitating the relative and/or 

absolute abundance of a variety of molecular and macromolecular species from a biological 
sample representing lung cancer, i.e., the identification of lung cancer-associated sequences 
described herein, provide an abundance of information, which can be correlated with 
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pathological conditions, predisposition to disease, drug testing, therapeutic monitoring, gene- 
disease causal linkages, identification of correlates of immunity and physiological status, 
among others. Although the data generated from the assays of the invention is suited for 
manual review and analysis, in a preferred embodiment, data processing using high-speed 
5 computers is utilized. 

An array of methods for indexing and retrieving biomolecular information is known 
in the art. For example, U.S. Patents 6,023,659 and 5,966,712 disclose a relational database 
system for storing biomolecular sequence information in a manner that allows sequences to 
be catalogued and searched according to one or more protein function hierarchies. U.S. 

1 0 Patent 5,953,727 discloses a relational database having sequence records containing 
information in a format that allows a collection of partial-length DNA sequences to be 
catalogued and searched according to association with one or more sequencing projects for 
obtaining full-length sequences from the collection of partial length sequences. U.S. Patent 
5,706,498 discloses a gene database retrieval system for making a retrieval of a gene 

1 5 sequence similar to a sequence data item in a gene database based on the degree of similarity 
between a key sequence and a target sequence. U.S. Patent 5,538,897 discloses a method 
using mass spectroscopy fragmentation patterns of peptides to identify amino acid sequences 
in computer databases by comparison of predicted mass spectra with experimentally-derived 
mass spectra using a closeness-of-fit measure. U.S. Patent 5,926,818 discloses a multi- 

20 dimensional database comprising a functionality for multi-dimensional data analysis 
described as on-line analytical processing (OLAP), which entails die consolidation of 
projected and actual data according to more than one consolidation path or dimension. U.S. 
Patent 5,295,261 reports a hybrid database structure in which the fields of each database 
record are divided into two classes, navigational and informational data, with navigational 

25 fields stored in a hierarchical topological map which can be viewed as a tree structure or as 
the merger of two or more such tree structures. 

See also Mount, et al. (2001) Bioinformatics; Durbin, et al. (eds., 1999) Biological 
Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids (; Baxevanis and 
Oeullette (eds., 1998) Bioinformatics: A Practical Guide to the Analysi s of Genes and 

30 Proteins^ : Rashidi and Buehler (1999) Bioinformatics: Basic Applicatio ns in Biological 
Science and Medicine : Setubal, et al. (eds 1997) Introduction to Compu tational Molecular 
Biology : Misener and Krawetz (eds, 2000) Bioinformatics: Metho ds and Protocols: Higgins 
and Taylor (eds., 2000) Bioinformatics: Sequence. Structure, and Datab anks: A Practical 
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Approach: Brown (2001) Bioinfonnatics: A Biologist's Guide to Biocomputing and the 
Internet : Han and Kamber (2000) Data Mining: Concepts and Techniques (2000); and 
Waterman (1995) Introduction to Computational Biology: Maps. Sequences, and Genomes . 
The present invention provides a computer database comprising a computer and 
5 software for storing in computer-retrievable form assay data records cross-tabulated, e.g., 
with data specifying the source of the target-containing sample from which each sequence 
specificity record was obtained. 

In an exemplary embodiment, at least one of the sources of target-containing sample 
is from a control tissue sample known to be free of pathological disorders. In a variation, at 

10 least one of the sources is a known pathological tissue specimen, e.g., a neoplastic lesion or 
another tissue specimen to be analyzed for lung cancer. In another variation, the assay 
records cross-tabulate one or more of the following parameters for each target species in a 
sample: (1) a unique identification code, which can include, e.g., a target molecular structure 
and/or characteristic separation coordinate (e.g., electrophoretic coordinates); (2) sample 

15 source; and (3) absolute and/or relative quantity of the target species present in the sample. 

The invention also provides for the storage and retrieval of a collection of target data 
in a computer data storage apparatus, which can include magnetic disks, optical disks, 
magneto-optical disks, DRAM, SRAM, SGRAM, SDRAM, RDRAM, DDR RAM, magnetic 
bubble memory devices, and other data storage devices, including CPU registers and on-CPU 

20 data storage arrays. Typically, the target data records are stored as a bit pattern in an array of 
magnetic domains on a magnetizable medium or as an array of charge states or transistor gate 
states, such as an array of cells in a DRAM device (e.g., each cell comprised of a transistor 
and a charge storage area, which may be on the transistor). In one embodiment, the invention 
provides such storage devices, and computer systems built therewith, comprising a bit pattern 

25 encoding a protein expression fingerprint record comprising unique identifiers for at least 10 
target data records cross-tabulated with target source. 

When the target is a peptide or nucleic acid, the invention preferably provides a 
method for identifying related peptide or nucleic acid sequences, comprising performing a 
computerized comparison between a peptide or nucleic acid sequence assay record stored in 

30 or retrieved from a computer storage device or database and at least one other sequence. The 
comparison can include a sequence analysis or comparison algorithm or computer program 
embodiment thereof (e.g., FASTA, TFASTA, GAP, BESTFIT) and/or the comparison may 
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be of the relative amount of a peptide or nucleic acid sequence in a pool of sequences 

determined from a polypeptide or nucleic acid sample of a specimen. 

The invention also preferably provides a magnetic disk, such as an IBM-compatible 

(DOS, Windows, Windows95/98/2000, Windows NT, OS/2) or other format (e.g., Linux, 

5 SunOS, Solaris, ADC, SCO Unix, VMS, MV, Macintosh, etc.) floppy diskette or hard (fixed, 

Winchester) disk drive, comprising a bit pattern encoding data from an assay of the invention 

in a file format suitable for retrieval and processing in a computerized sequence analysis, 

comparison, or relative quantitation method. 

The invention also provides a network, comprising a plurality of computing devices 

10 linked via a data link, such as an Ethernet cable (coax or lOBaseT), telephone line, ISDN 

line, wireless network, optical fiber, or other suitable signal transmission medium, whereby at 
least one network device (e.g., computer, disk array, etc.) comprises a pattern of magnetic 
domains (e.g., magnetic disk) and/or charge domains (e.g., an array of DRAM cells) 
composing a bit pattern encoding data acquired from an assay of the invention. 

15 The invention also provides a method for transmitting assay data that includes 

generating an electronic signal on an electronic communications device, such as a modem, 
ISDN terminal adapter, DSL, cable modem, ATM switch, or the like, wherein the signal 
includes (in native or encrypted format) a bit pattern encoding data from an assay or a 
database comprising a plurality of assay results obtained by the method of the invention. 

20 In a preferred embodiment, the invention provides a computer system for comparing a 

query target to a database containing an array of data structures, such as an assay result 
obtained by the method of the invention, and ranking database targets based on the degree of 
identity and gap weight to the target data. A central processor is preferably initialized to load 
and execute the computer program for alignment and/or comparison of the assay results. 

25 Data for a query target is entered into the central processor via an I/O device. Execution of 
the computer program results in the central processor retrieving the assay data from the data 
file, which comprises a binary description of an assay result. 

The target data or record and the computer program can be transferred to secondary 
memory, which is typically random access memory (e.g., DRAM, SRAM, SGRAM, or 

30 SDRAM). Targets are ranked according to the degree of correspondence between a selected 
assay characteristic (e.g., binding to a selected affinity moiety) and the same characteristic of 
the query target and results are output via an I/O device. For example, a central processor 
can be a conventional computer (e.g., Intel Pentium, PowerPC, Alpha, PA-8000, SPARC, 
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MIPS 4400, MIPS 10000, VAX, etc.); a program can be a commercial or public domain 

molecular biology software package (e.g., UWGCG Sequence Analysis Software, Darwin); a 

data file can be an optical or magnetic disk, a data server, a memory device (e.g., DRAM, 

SRAM, SGRAM, SDRAM, EPROM, bubble memory, flash memory, etc.); an I/O device can 

5 be a terminal comprising a video display and a keyboard, a modem, an ISDN terminal 

adapter, an Ethernet port, a punched card reader, a magnetic strip reader, or other suitable I/O 

device. 

The invention also preferably provides the use of a computer system, such as that 
described above, which comprises: (1) a computer; (2) a stored bit pattern encoding a 
10 collection of peptide sequence specificity records obtained by the methods of the invention, 
which may be stored in the computer; (3) a comparison target, such as a query target; and (4) 
a program for alignment and comparison, typically with rank-ordering of comparison results 
on the basis of computed similarity values. 

1 5 Characteristics of lung cancer-associated proteins 

Lung cancer proteins of the present invention may be classified as secreted proteins, 
transmembrane proteins or intracellular proteins. In one embodiment, the lung cancer protein 
is an intracellular protein. Intracellular proteins may be found in the cytoplasm and/or in the 
nucleus. Intracellular proteins are involved in all aspects of cellular function and replication 

20 (including, e.g., signaling pathways); aberrant expression of such proteins often results in 
unregulated or disregulated cellular processes (see, e.g., Alberts (ed. 1994) Molecular 
Biology of the Cell (3d ed.). For example, many intracellular proteins have enzymatic 
activity such as protein kinase activity, protein phosphatase activity, protease activity, 
nucleotide cyclase activity, polymerase activity and the like. Intracellular proteins also serve 

25 as docking proteins that are involved in organizing complexes of proteins, or targeting 

proteins to various subcellular localizations, and are involved in maintaining the structural 
integrity of organelles. 

An increasingly appreciated concept in characterizing proteins is the presence in the 
proteins of one or more structural motifs for which defined functions have been attributed. In 

30 addition to the highly conserved sequences found in the enzymatic domain of proteins, highly 
conserved sequences have been identified in proteins that are involved in protein-protein 
interaction. For example, Src-homology-2 (SH2) domains bind tyrosine-phosphorylated 
targets in a sequence dependent manner. PTB domains, which are distinct from SH2 
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domains, also bind tyrosine phosphorylated targets. SH3 domains bind to proline-rich 
targets. In addition, PH domains, tetratricopeptide repeats and WD domains to name only a 
few, have been shown to mediate protein-protein interactions. Some of these may also be 
involved in binding to phospholipids or other second messengers. As will be appreciated by 
5 one of ordinary skill in the art, these motifs can be identified on the basis of amino acid 
sequence; thus, an analysis of the sequence of proteins may provide insight into both the 
enzymatic potential of the molecule and/or molecules with which the protein may associate. 
One useful database is Pfam (protein families), which is a large collection of multiple 
sequence alignments and hidden Markov models covering many common protein domains. 
10 Versions are available via the internet from Washington University in St. Louis, the Sanger 
Center in England, and the Karolinska Institute in Sweden (see, e.g., Bateman, et al (2000) 
Nuc. Acids Res. 28:263-266; Sonnhammer, et al. (1997) Proteins 28:405-420; Bateman, et al. 
(1999) Nuc. Acids Res. 27:260-262; and Sonnhammer, et al. (1998) Nuc. Acids Res. 26:320- 
322). 

15 In another embodiment, the lung cancer sequences are transmembrane proteins. 

Transmembrane proteins are molecules that span a phospholipid bilayer of a cell. They may 
have an intracellular domain, an extracellular domain, or both. The intracellular domains of 
such proteins may have a number of functions including those already described for 
intracellular proteins. For example, the intracellular domain may have enzymatic activity 

20 and/or may serve as a binding site for additional proteins. Frequently the intracellular 

domain of transmembrane proteins serves both roles. For example certain receptor tyrosine 
kinases have both protein kinase activity and SH2 domains. In addition, autophosphorylation 
of tyrosines on the receptor molecule itself, creates binding sites for additional SH2 domain 
containing proteins. 

25 Transmembrane proteins may contain from one to many transmembrane domains. 

For example, receptor tyrosine kinases, certain cytokine receptors, receptor guanylyl cyclases 
and receptor serine/threonine protein kinases contain a single transmembrane domain. 
However, various other proteins including channels, pumps, and adenylyl cyclases contain 
numerous transmembrane domains. Many important cell surface receptors such as G protein 

30 coupled receptors (GPCRs) are classified as "seven transmembrane domain" proteins, as they 
contain 7 membrane spanning regions. Characteristics of transmembrane domains include 
approximately 17 consecutive hydrophobic amino acids that may be followed by charged 
amino acids. Therefore, upon analysis of the amino acid sequence of a particular protein, the 
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localization and number of transmembrane domains within the protein may be predicted (see, 

e.g., PSORT web site http://psort.nibb.ac.jp/). 

The extracellular domains of transmembrane proteins are diverse; however, conserved 
motifs are found repeatedly among various extracellular domains. Conserved structure 
5 and/or functions have been ascribed to different extracellular motifs. Many extracellular 
domains are involved in binding to other molecules. In one aspect, extracellular domains are 
found on receptors. Factors that bind the receptor domain include circulating ligands, which 
may be peptides, proteins, or small molecules such as adenosine and the like. For example, 
growth factors such as EGF, FGF, and PDGF are circulating growth factors that bind to their 

10 cognate receptors to initiate a variety of cellular responses. Other factors include cytokines, 
mitogenic factors, hormones, neurotrophic factors and the like. Extracellular domains also 
bind to cell-associated molecules. In this respect, they may mediate cell-cell interactions. 
Cell-associated ligands can be tethered to the cell, e.g., via a glycosylphosphatidylinositol 
(GPT) anchor, or may themselves be transmembrane proteins. Extracellular domains may 

1 5 also associate with the extracellular matrix and contribute to the maintenance of the cell 
structure. 

Lung cancer proteins that are transmembrane are particularly preferred in the present 
invention as they are readily accessible targets for extracellular immunotherapeutics, as are 
described herein. In addition, as outlined below, transmembrane proteins can be also useful 

20 in imaging modalities. Antibodies may be used to label such readily accessible proteins in 
situ or in histological analysis. Alternatively, antibodies can also label intracellular proteins, 
in which case analytical samples are typically permeablized to provide access to intracellular 
proteins. In addition, some membrane proteins can be processed to release a soluble protein, 
or to expose a residual fragment. Released soluble proteins may be useful diagnostic 

25 markers, processed residual protein fragments may be useful lung markers of disease. 

It will also be appreciated by those in the art that a transmembrane protein can be 
made soluble by removing transmembrane sequences, e.g., through recombinant methods. 
Furthermore, transmembrane proteins that have been made soluble can be made to be 
secreted through recombinant means by adding an appropriate signal sequence. 

30 In another embodiment, the lung cancer proteins are secreted proteins; the secretion of 

which can be either constitutive or regulated. These proteins may have a signal peptide or 
signal sequence that targets the molecule to the secretory pathway. Secreted proteins are 
involved in numerous physiological events; e.g., if circulating, they often serve to transmit 
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signals to various other cell types. The secreted protein may function in an autocrine manner 

(acting on the cell that secreted the factor), a paracrine manner (acting on cells in close 

proximity to the cell that secreted the factor), an endocrine manner (acting on cells at a 

distance, e.g., secretion into the blood stream), or exocrine (secretion, e.g., through a duct or 

5 to adjacent epithelial surface as sweat glands, sebaceous glands, pancreatic ducts, lacrimal 

glands, mammary glands, sax producing glands of the ear, etc.). Thus secreted molecules 

often find use in modulating or altering numerous aspects of physiology. Lung cancer 

proteins that are secreted proteins are particularly preferred in the present invention as they 

serve as good targets for diagnostic markers, e.g., for blood, plasma, serum, or stool tests. 

10 Those which are enzymes may be antibody or small molecule targets. Others may be useful 

as vaccine targets, e.g., via CTL mechanisms. 

Use of lung cancer nucleic acids 

As described above, lung cancer sequence is initially identified by substantial nucleic 
15 acid and/or amino acid sequence homology or linkage to the lung cancer sequences outlined 
herein. Such homology can be based upon the overall nucleic acid or amino acid sequence, 
and is generally determined as outlined below, using either homology programs or 
hybridization conditions. Typically, linked sequences on a mRNA are found on the same 
molecule. 

20 The lung cancer nucleic acid sequences of the invention, e.g., the sequences in Tables 

1A-16, can be fragments of larger genes, i.e., they are nucleic acid segments. "Genes" in this 
context includes coding regions, non-coding regions, and mixtures of coding and non-coding 
regions. Accordingly, as will be appreciated by those in the art, using the sequences provided 
herein, extended sequences, in either direction, of the lung cancer genes can be obtained, 

25 using techniques well known in the art for cloning either longer sequences or the full length 
sequences; see Ausubel, et al., supra. Much can be done by informatics and many sequences 
can be clustered to include multiple sequences corresponding to a single gene, e.g., systems 
such as UniGene (see, http://www.ncbi.nlm.nih.gov/UniGene/). 

Once a lung cancer nucleic acid is identified, it can be cloned and, if necessary, its 

30 constituent parts recombined to form the entire lung cancer nucleic acid coding regions or the 
entire mRNA sequence. Once isolated from its natural source, e.g., contained within a 
plasmid or other vector or excised therefrom as a linear nucleic acid segment, the 
recombinant lung cancer nucleic acid can be further-used as a probe to identify and isolate 
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other lung cancer nucleic acids, e.g., extended coding regions. It can also be used as a 

precursor" nucleic acid to make modified or variant lung cancer nucleic acids and proteins. 

The lung cancer nucleic acids of the present invention are used in several ways. In a 

first embodiment, nucleic acid probes to the lung cancer nucleic acids are made and attached 

5 to biochips to be used in screening and diagnostic methods, as outlined below, or for 

administration, e.g., for gene therapy, RNAi, vaccine, and/or antisense applications. 

Alternatively, the lung cancer nucleic acids that include coding regions of lung cancer 

proteins can be put into expression vectors for the expression of lung cancer proteins, again 

for screening purposes or for administration to a patient. 

10 In a preferred embodiment, nucleic acid probes to lung cancer nucleic acids (both the 

nucleic acid sequences outlined in the figures and/or the complements thereof) are made. 
The nucleic acid probes attached to the biochip are designed to be substantially 
complementary to the lung cancer nucleic acids, i.e., the target sequence (either the target 
sequence of the sample or to other probe sequences, e.g., in sandwich assays), such that 

15 hybridization of the target sequence and the probes of the present invention occurs. As 

outlined below, this complementarity need not be perfect; there may be any number of base 
pair mismatches which will interfere with hybridization between the target sequence and the 
single stranded nucleic acids of the present invention. However, if the number of mutations 
is so great that no hybridization can occur under even the least stringent of hybridization 

20 conditions, the sequence is not a complementary target sequence. Thus, by "substantially 
complementary" herein is meant that the probes are sufficiently complementary to the target 
sequences to hybridize under appropriate reaction conditions, particularly high stringency 
conditions, as outlined herein. 

A nucleic acid probe is generally single stranded but can be partially single and 

25 partially double stranded. The strandedness of the probe is dictated by the structure, 

composition, and properties of the target sequence. In general, the nucleic acid probes range 
from about 8 to about 100 bases long, with from about 10 to about 80 bases being preferred, 
and from about 30 to about 50 bases being particularly preferred. That is, generally 
complements of ORFs or whole genes are not used. In some embodiments, nucleic acids of 

30 lengths up to hundreds of bases can be used. 

In a preferred embodiment, more than one probe per sequence is used, with either 
overlapping probes or probes to different sections of the target being used. That is, two, 
three, four or more probes, with three being preferred, are used to build in a redundancy for a 
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particular target. The probes can be overlapping (i.e., have some sequence in common), or 

separate. In some cases, PCR primers may be used to amplify signal for higher sensitivity. 

As will be appreciated by those in the art, nucleic acids can be attached or 

immobilized to a solid support in a wide variety of ways. By "immobilized" and grammatical 

5 equivalents herein is meant the association or binding between the nucleic acid probe and the 

solid support is sufficient to be stable under the conditions of binding, washing, analysis, and 

removal as outlined below. The binding can typically be covalent or non-covalent. By "non- 

covalent binding" and grammatical equivalents herein is typically meant one or more of 

electrostatic, hydrophilic, and hydrophobic interactions. Included in non-covalent binding is 

10 the covalent attachment of a molecule, such as, streptavidin to the support and the non- 
covalent binding of the biotinylated probe to the streptavidin. By "covalent binding" and 
grammatical equivalents herein is meant that the two moieties, the solid support and the 
probe, are attached by at least one bond, including sigma bonds, pi bonds and coordination 
bonds. Covalent bonds can be formed directly between the probe and the solid support or can 

15 be formed by a cross linker or by inclusion of a specific reactive group on either the solid 
support or the probe or both molecules. Immobilization may also involve a combination of 
covalent and non-covalent interactions. 

In general, the probes are attached to a biochip in a wide variety of ways, as will be 
appreciated by those in the art. As described herein, the nucleic acids can either be 

20 synthesized first, with subsequent attachment to the biochip, or can be directly synthesized on 
the biochip. 

The biochip comprises a suitable solid substrate. By "substrate" or "solid support" or 
other grammatical equivalents herein is meant a material that can be modified for the 
attachment or association of the nucleic acid probes and is amenable to at least one detection 

25 method. Often the substrate may contain discrete individual sites appropriate for ndivitual 
partitioning and identification. As will be appreciated by those in the art, the number of 
possible substrates are very large, and include, but are not limited to, glass and modified or 
functionalized glass, plastics (including acrylics, polystyrene and copolymers of styrene and 
other materials, polypropylene, polyethylene, polybutylene, polyurethanes, Teflon, etc.), 

30 polysaccharides, nylon or nitrocellulose, resins, silica or silica-based materials including 
silicon and modified silicon, carbon, metals, inorganic glasses, plastics, etc. In general, the 
substrates allow optical detection and do not appreciably fluoresce. A preferred substrate is 
described in US application entitled Reusable Low Fluorescent Plastic Biochip, U.S. 



36 



WO 02/086443 PCT/US02/12476 
Application Serial No. 09/270,214, filed March 15, 1999, herein incorporated by reference in 

its entirety. 

Generally the substrate is planar, although as will be appreciated by those in the art, 
other configurations of substrates may be used as well. For example, the probes may be 
5 placed on the inside surface of a tube, for flow-through sample analysis to minimize sample 
volume. Similarly, the substrate may be flexible, such as a flexible foam, including closed 
cell foams made of particular plastics. 

In a preferred embodiment, the surface of the biochip and the probe may be 
derivatized with chemical functional groups for subsequent attachment of the two. Thus, e.g., 

10 the biochip is derivatized with a chemical functional group including, but not limited to, 
amino groups, carboxy groups, oxo groups and thiol groups, with amino groups being 
particularly preferred. Using these functional groups, the probes can be attached using 
functional groups on the probes. For example, nucleic acids containing amino groups can be 
attached to surfaces comprising amino groups, e.g., using linkers as are known in the art; e.g., 

15 homo-or hetero-bifunctional linkers as are well known (see 1994 Pierce Chemical Company 
catalog, technical section on cross-linkers, pages 155-200). In addition, in some cases, 
additional linkers, such as alkyl groups (including substituted and heteroalkyl groups) may be 
used. 

In this embodiment, oligonucleotides are synthesized, and then attached to the surface 
20 of the solid support. Either the 5' or 3* terminus may be attached to the solid support, or 

attachment may be via linkage to an internal nucleoside. 

In another embodiment, the immobilization to the solid support may be very strong, 

yet non-coValent. For example, biotinylated oligonucleotides can be made, which bind to 

surfaces covalently coated with streptavidin, resulting in attachment. 
25 Alternatively, the oligonucleotides may be synthesized on the surface, as is known in 

the art. For example, photoactivation techniques utilizing photopolymerization compounds 

and techniques are used. In a preferred embodiment, the nucleic acids can be synthesized in 

situ, using known photolithographic techniques, such as those described in WO 95/251 16; 

WO 95/35505; U.S. Patent Nos. 5,700,637 and 5,445,934; and references cited within, all of 
30 which are expressly incorporated by reference; these methods of attachment form the basis of 

the Affymetrix GeneChip™ technology. 

Often, amplification-based assays are performed to measure the expression level of 

lung cancer-associated sequences. These assays are typically performed in conjunction with 



37 



WO 02/086443 PCT/US02/12476 

reverse transcription. In such assays, a lung cancer-associated nucleic acid sequence acts as a 
template in an amplification reaction (e.g., Polymerase Chain Reaction, or PCR). In a 
quantitative amplification, the amount of amplification product will be proportional to the 
amount of template in the original sample. Comparison to appropriate controls provides a 
5 measure of the amount of lung cancer-associated RNA. Methods of quantitative 

amplification are well known to those of skill in the art. Detailed protocols for quantitative 
PCR are provided, e.g., in Innis, et al. (1990) PCR Protoco ls. A Guide to Methods and 
Applications . 

In some embodiments, a TaqMan based assay is used to measure expression. 

10 TaqMan based assays use a fluorogenic oligonucleotide probe that contains a 5' fluorescent 
dye and a 3' quenching agent. The probe hybridizes to a PCR product, but cannot itself be 
extended due to a blocking agent at the 3 ' end. When the PCR product is amplified in 
subsequent cycles, the 5' nuclease activity of the polymerase, e.g., AmpliTaq, results in the 
cleavage of the TaqMan probe. This cleavage separates the 5' fluorescent dye and the 3' 

1 5 quenching agent, thereby resulting in an increase in fluorescence as a function of 

amplification (see, e.g., literature provided by Perkin-Elmer, e.g., www2.perkin-elmer.com). 

Other suitable amplification methods include, but are not limited to, ligase chain 
reaction (LCR) (see Wu and Wallace (1989) Genomics 4:560, Landegren, et al. (1988) 
Science 241:1077, and Barringer, et al. (1990) Gene 89:117), transcription amplification 

20 (Kwoh, et al. (1989) Proc. Natl. Acad. Sci. USA 86:1 173), self-sustained sequence 

replication (Guatelli, et al. (1990) Proc. Nat. Acad. Sci. USA 87:1874), dot PCR, and linker 
adapter PCR, etc. 

Expression of lung cancer proteins from nucleic acids 

25 In a preferred embodiment, lung cancer nucleic acids, e.g., encoding lung cancer 

proteins, are used to make a variety of expression vectors to express lung cancer proteins 
which can then be used in screening assays, as described below. Expression vectors and 
recombinant DNA technology are well known to those of skill in the art (see, e.g., Ausubel, 
supra, and Fernandez and Hoeffler (eds 1999) Gene Expression Systems) and are used to 

30 express proteins. The expression vectors may be either self-replicating extrachromosomal 
vectors or vectors which integrate into a host genome. Generally, these expression vectors 
include transcriptional and translational regulatory nucleic acid operably linked to the nucleic 
acid encoding the lung cancer protein. The term "control sequences" refers to DNA 
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sequences used for the expression of an operably linked coding sequence in a particular host 

organism. Control sequences that are suitable for prokaryotes, e.g., include a promoter, 

optionally an operator sequence, and a ribosome binding site. Eukaryotic cells are known to 

utilize promoters, polyadenylation signals, and enhancers. 

5 Nucleic acid is "operably linked" when it is placed into a functional relationship with 

another nucleic acid sequence. For example, DNA for a presequence or secretory leader is 

operably linked to DNA for a polypeptide if it is expressed as a preprotein that participates in 

the secretion of the polypeptide; a promoter or enhancer is operably linked to a coding 

sequence if it affects the transcription of the sequence; or a ribosome binding site is operably 

10 linked to a coding sequence if it is positioned so as to facilitate translation. Generally, 

"operably linked" means that the DNA sequences being linked are contiguous, and, in the 
case of a secretory leader, contiguous and in reading phase. However, enhancers do not have 
to be contiguous. Linking is typically accomplished by ligation at convenient restriction 
sites. If such sites do not exist, synthetic oligonucleotide adaptors or linkers are used in 

1 5 accordance with conventional practice. Transcriptional and translational regulatory nucleic 
acid will generally be appropriate to the host cell used to express the lung cancer protein. 
Numerous types of appropriate expression vectors, and suitable regulatory sequences are 
known in the art for a variety of host cells. 

In general, transcriptional and translational regulatory sequences may include, but are 

20 not limited to, promoter sequences, ribosomal binding sites, transcriptional start and stop 
sequences, translational start and stop sequences, and enhancer or activator sequences. In a 
preferred embodiment, the regulatory sequences include a promoter and transcriptional start 
and stop sequences. 

Promoter sequences may be either constitutive or inducible promoters. The promoters 
25 may be either naturally occurring promoters or hybrid promoters. Hybrid promoters, which 
combine elements of more than one promoter, are also known in the art, and are useful in the 
present invention. 

In addition, an expression vector may comprise additional elements. For example, the 
expression vector may have two replication systems, thus allowing it to be maintained in two 
30 organisms, e.g., in mammalian or insect cells for expression and in a prokaryotic host for 
cloning and amplification. Furthermore, for integrating expression vectors, the expression 
vector often contains at least one sequence homologous to the host cell genome, and 
preferably two homologous sequences which flank the expression construct. The integrating 
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vector may be directed to a specific locus in the host cell by selecting the appropriate 
homologous sequence for inclusion in the vector. Constructs for integrating vectors are well 
known in the art (e.g., Fernandez and Hoeffler, supra). 

In addition, in a preferred embodiment, the expression vector contains a selectable 
5 marker gene to allow the selection of transformed host cells. Selection genes are well known 
in the art and will vary with the host cell used. 

The lung cancer proteins of the present invention are usually produced by culturing a 
host cell transformed with an expression vector containing nucleic acid encoding a lung 
cancer protein, under the appropriate conditions to induce or cause expression of the lung 

10 cancer protein. Conditions appropriate for lung cancer protein expression will vary with the 
choice of the expression vector and the host cell, and will be easily ascertained by one skilled 
in the art through routine experimentation or optimization. For example, the use of 
constitutive promoters in the expression vector will require optimizing the growth and 
proliferation of the host cell, while the use of an inducible promoter requires the appropriate 

15 growth conditions for induction. In addition, in some embodiments, the timing of the harvest 
is important. For example, the baculoviral systems used in insect cell expression are lytic 
viruses, and thus harvest time selection can be crucial for product yield. 

Appropriate host cells include yeast, bacteria, archaebacteria, fungi, and insect and 
animal cells, including mammalian cells. Of particular interest are Saccharomyces cerevisiae 

20 and other yeasts, E. coli, Bacillus subtilis, Sf9 cells, C129 cells, 293 cells, Neurospora, BHK, 
CHO, COS, HeLa cells, HUVEC (human umbilical vein endothelial cells), THP1 cells (a 
macrophage cell line) and various other human cells and cell lines. 

In a preferred embodiment, the lung cancer proteins are expressed in mammalian 
cells. Mammalian expression systems are also known in the art, and include retroviral and 

25 adenoviral systems. Of particular use as mammalian promoters are the promoters from 
mammalian viral genes, since the viral genes are often highly expressed and have a broad 
host range. Examples include the SV40 early promoter, mouse mammary tumor virus LTR 
promoter, adenovirus major late promoter, herpes simplex virus promoter, and the CMV 
promoter (see, e.g., Fernandez and Hoeffler, supra). Typically, transcription termination and 

30 polyadenylation sequences recognized by mammalian cells are regulatory regions located 3' 
to the translation stop codon and thus, together with the promoter elements, flank the coding 
sequence. Examples of transcription terminator and polyadenylation signals include those 
derived form SV40. 
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The methods of introducing exogenous nucleic acid into mammalian hosts, as well as 
other hosts, is well known in the art, and will vary with the host cell used. Techniques 
include dextran-mediated transfection, calcium phosphate precipitation, polybrene mediated 
transfection, protoplast fusion, electroporation, viral infection, encapsulation of the 
5 polynucleotide(s) in liposomes, and direct microinjection of the DNA into nuclei. 

In a preferred embodiment, lung cancer proteins are expressed in bacterial systems. 
Promoters from bacteriophage may also be used and are known in the art. In addition, 
synthetic promoters and hybrid promoters are also useful; e.g., the tac promoter is a hybrid of 
the tip and lac promoter sequences. Furthermore, a bacterial promoter can include naturally 

1 0 occurring promoters of non-bacterial origin that have the ability to bind bacterial RNA 
polymerase and initiate transcription. In addition to a functioning promoter sequence, an 
efficient ribosome binding site is desirable. The expression vector may also include a signal 
peptide sequence that provides for secretion of the lung cancer protein in bacteria. The 
protein is either secreted into the growth media (gram-positive bacteria) or into the 

1 5 periplasmic space, located between the inner and outer membrane of the cell (gram-negative 
bacteria). The bacterial expression vector may also include a selectable marker gene to allow 
for the selection of bacterial strains that have been transformed. Suitable selection genes 
include genes which render the bacteria resistant to drugs such as ampicillin, 
chloramphenicol, erythromycin, kanamycin, neomycin and tetracycline. Selectable markers 

10 also include biosynthetic genes, such as those in the histidine, tryptophan and leucine 

biosynthetic pathways. These components are assembled into expression vectors. Expression 
vectors for bacteria are well known in the art, and include vectors for Bacillus subtilis, E. 
coli, Streptococcus cremoris, and Streptococcus Hvidans, among others (e.g., Fernandez and 
Hoeffler, supra). The bacterial expression vectors are transformed into bacterial host cells 

15 using techniques well known in the art, such as calcium chloride treatment, electroporation, 
and others. 

In one embodiment, lung cancer proteins are produced in insect cells. Expression 
vectors for the transformation of insect cells, and in particular, baculovirus-based expression 
vectors, are well known in the art. 
50 In a preferred embodiment, lung cancer protein is produced in yeast cells. Yeast 

expression systems are well known in the art, and include expression vectors for 
Saccltaromyces cerevisiae, Candida albicans and C. maltosa, Hansenula potymorpha, 
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Kluyveromyces fragilis and K. lactis, Pichia guillerimondii, and P. pastoris, 

Schizosaccharomyces pombe, and Yarrowia lipolytica. 

The lung cancer protein may also be made as a fusion protein, using techniques well 

known in the art. Thus, e.g., for the creation of monoclonal antibodies, if the desired epitope 

5 is small, the lung cancer protein may be fused to a carrier protein to form an immunogen. 

Alternatively, the lung cancer protein may be made as a fusion protein to increase expression 

for affinity purification purposes, or for other reasons. For example, when the lung cancer 

protein is a lung cancer peptide, the nucleic acid encoding the peptide may be linked to other 

nucleic acid for expression purposes. 

10 In a preferred embodiment, the lung cancer protein is purified or isolated after 

expression. Lung cancer proteins may be isolated or purified in a variety of appropriate 
ways. Standard purification methods include electrophoretic, molecular, immunological and 
chromatographic techniques, including ion exchange, hydrophobic, affinity, and reverse- 
phase HPLC chromatography, and chromatofocusing. For example, the lung cancer protein 

1 5 may be purified using a standard anti-lung cancer protein antibody column. Ultrafiltration 
and diafiltration techniques, in conjunction with protein concentration, are also useful. For 
general guidance in suitable purification techniques, see Scopes (1982) Protein Purification. 
The degree of purification necessary will vary depending on the use of the lung cancer 
protein. In some instances no purification will be necessary. 

20 Once expressed and purified if necessary, the lung cancer proteins and nucleic acids 

are useful in a number of applications. They may be used as immunoselection reagents, as 
vaccine reagents, as screening agents, therapeutic entities, for production of antibodies, as 
transcription or translation inhibitors, etc. 

25 Variants of lung cancer proteins 

In one embodiment, the lung cancer proteins are derivative or variant lung cancer 
proteins as compared to the wild-type sequence. That is, as outlined more fully below, the 
derivative lung cancer peptide will often contain at least one amino acid substitution, deletion 
or insertion, with amino acid substitutions being particularly preferred. The amino acid 
30 substitution, insertion or deletion may occur at a particular residue within the lung cancer 
peptide. 

Also included within one embodiment of lung cancer proteins of the present invention 
are amino acid sequence variants. These variants typically fall into one or more of three 
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classes: substitutional, insertional or deletional variants. These variants ordinarily are 
prepared by site specific mutagenesis of nucleotides in the DNA encoding the lung cancer 
protein, using cassette or PCR mutagenesis or other techniques, to produce DNA encoding 
the variant, and thereafter expressing the DNA in recombinant cell culture as outlined above. 
5 However, variant lung cancer protein fragments having up to about 100-150 residues may be 
prepared by in vitro synthesis. Amino acid sequence variants are characterized by the 
predetermined nature of the variation, a feature that sets them apart from naturally occurring 
allelic or interspecies variation of the lung cancer protein amino acid sequence. The variants 
typically exhibit a similar qualitative biological activity as the naturally occurring analogue, 

10 although variants can also be selected which have modified characteristics as will be more 
fully outlined below. 

While the site or region for introducing an amino acid sequence variation is often 
predetermined, the mutation per se need not be predetermined. For example, in order to 
optimize the performance of a mutation at a given site, random mutagenesis may be 

1 5 conducted at the target codon or region and the expressed lung cancer variants screened for 
the optimal combination of desired activity. Techniques exist for making substitution 
mutations at predetermined sites in DNA having a known sequence, e.g., M13 primer 
mutagenesis and PCR mutagenesis. Screening of mutants is often done using assays of lung 
cancer protein activities. 

20 Amino acid substitutions are typically of single residues; insertions usually will be on 

the order of from about 1 to 20 amino acids, although considerably larger insertions may be 
occasionally tolerated. Deletions generally range from about 1 to about 20 residues, although 
in some cases deletions may be much larger. 

Substitutions, deletions, insertions or any combination thereof may be used to arrive 

25 at a final derivative. Generally these changes are done on a few amino acids to minimize the 
alteration of the molecule. Larger changes may be tolerated in certain circumstances. When 
small alterations in the characteristics of a lung cancer protein are desired, substitutions are 
generally made in accordance with the amino acid substitution chart provided in the 
definition section. 

30 Variants typically exhibit essentially the same qualitative biological activity and will 

elicit the same immune response as a naturally-occurring analog, although variants also are 
selected to modify the characteristics of lung cancer proteins as needed. Alternatively, the 



43 



WO 02/086443 PCT/US02/12476 
variant may be designed or reorganized such that a biological activity of the lung cancer 

protein is altered. For example, glycosylation sites may be added, altered, or removed. 

Covalent modifications of lung cancer polypeptides are included within the scope of 

this invention. One type of covalent modification includes reacting targeted amino acid 

5 residues of a lung cancer polypeptide with an organic derivatizing agent that is capable of 

reacting with selected side chains or the N-or C-terminal residues of a lung cancer 

polypeptide. Derivatization with bifunctional agents is useful, for instance, for crosslinking 

lung cancer polypeptides to a water-insoluble support matrix or surface for use in a method 

for purifying anti-lung cancer polypeptide antibodies or screening assays, as is more fully 

10 described below. Commonly used crosslinking agents include, e.g., l,l-bis(diazoacetyl)-2- 
phenylethane, glutaraldehyde, N-hydroxysuccinimide esters, e.g., esters with 4-azidosalicylic 
acid, homobifunctional imidoesters, including disuccinimidyl esters such as 3,3'- 
dithiobis(succinimidylpropionate), bifunctional maleimides such as bis-N-maleimido-1,8- 
octane and agents such as methyl-3-((p-azidophenyl)dithio)propipiniidate. 

15 Other modifications include deamidation of glutaminyl and asparaginyl residues to 

the corresponding glutamyl and aspartyl residues, respectively, hydroxylation of proline and 
lysine, phosphorylation of hydroxyl groups of serinyl, threonyl or tyrosyl residues, 
methylation of the y-amino groups of lysine, arginine, and histidine side chains (Creighton 
(1983) Proteins: Structure and Molecular Properties, pp. 79-86), acetylation of the N-tejminal 

20 amine, and amidation of any C-terminal carboxyl group. 

Another type of covalent modification of the lung cancer polypeptide encompassed by 
this invention is an altered native glycosylation pattern of the polypeptide. "Altering the 
native glycosylation pattern" is intended herein to mean adding to or deleting one or more 
carbohydrate moieties of a native sequence lung cancer polypeptide. Glycosylation patterns 

25 can be altered in many ways. For example the use of different cell types to express lung 
cancer-associated sequences can result in different glycosylation patterns. 

Addition of glycosylation sites to lung cancer polypeptides may also be accomplished 
by altering the amino acid sequence thereof. The alteration may be made, e.g., by the 
addition of, or substitution by, one or more serine or threonine residues to the native sequence 

30 lung cancer polypeptide (for O-linked glycosylation sites). The lung cancer amino acid 
sequence may optionally be altered through changes at the DNA level, particularly by 
mutating the DNA encoding the lung cancer polypeptide at preselected bases such that 
codons are generated that will translate into the desired amino acids. 



44 



WO 02/086443 PCT/US02/12476 
Another means of increasing the number of carbohydrate moieties on the lung cancer 

polypeptide is by chemical or enzymatic coupling of glycosides to the polypeptide. Such 

methods are described in the art, e.g., in WO 87/05330, and in Aplin and Wriston (1981) 

CRC Crit. Rev. Biochem.. pp. 259-306. 

5 Removal of carbohydrate moieties present on the lung cancer polypeptide may be 

accomplished chemically or enzymatically or by mutational substitution of codons encoding 

for amino acid residues that serve as targets for glycosylation. Chemical deglycosylation 

techniques are known in the art and described, for instance, by Hakimuddin, et al. (1987) 

Arch. Biochem. Bionhvs., 259:52 and by Edge, et al. (1981) Anal. Biochem. . 118:131. 

10 Enzymatic cleavage of carbohydrate moieties on polypeptides can be achieved by the use of a 

variety of endo-and exo-glycosidases as described by Thotakura, et al. (1987) Meth. 

Enzvmol., 138:350. 

Another type of covalent modification of lung cancer comprises linking the lung 
cancer polypeptide to one of a variety of nonproteinaceous polymers, e.g., polyethylene 

15 glycol, polypropylene glycol, or polyoxyalkylenes, in the manner set forth in U.S. Patent 
Nos. 4,640,835; 4,496,689; 4,301,144; 4,670,417; 4,791,192, or 4,179,337. 

Lung cancer polypeptides of the present invention may also be modified in a way to 
form chimeric molecules comprising a lung cancer polypeptide fused to another, 
heterologous polypeptide or amino acid sequence. In one embodiment, such a chimeric 

20 molecule comprises a fusion of a lung cancer polypeptide with a tag polypeptide which 
provides an epitope to which an anti-tag antibody can selectively bind. The epitope tag is 
generally placed at the amino-or carboxyl-terminus of the lung cancer polypeptide. The 
presence of such epitope-tagged forms of a lung cancer polypeptide can be detected using an 
antibody against the tag polypeptide. Also, provision of the epitope tag enables the lung 

25 cancer polypeptide to be readily purified by affinity purification using an anti-tag antibody or 
another type of affinity matrix that binds to the epitope tag. In an alternative embodiment, 
the chimeric molecule may comprise a fusion of a lung cancer polypeptide with an 
immunoglobulin or a particular region of an immunoglobulin. For a bivalent form of the 
chimeric molecule, such a fusion could be to the Fc region of an IgG molecule. 

30 Various tag polypeptides and their respective antibodies are well known and examples 

include poly-histidine (poly-his) or poly-histidine-glycine (poly-his-gly) tags; HIS 6 and metal 
chelation tags, the flu HA tag polypeptide and its antibody 12CA5 (Field, et al. (1988) Mol. 
Cell. Biol. 8:2159-2165); the c-myc tag and the 8F9, 3C7, 6E10, G4, B7 and 9E10 antibodies 
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thereto (Evan, et al. (1985) Molecular and Cellular Biology 5:3610-3616); and the Herpes 

Simplex virus glycoprotein D (gD) tag and its antibody (Paborsky, et al. (1990) Protein 

Engineering 3(6):547-553). Other tag polypeptides include the Flag-peptide (Hopp, et al. 

(1988) BioTechnologv 6:1204-1210); the KT3 epitope peptide (Martin, et al. (1992) Science 

5 255:192-194); tubulin epitope peptide (Skinner, et al. (1991) J. Biol. Chem. 266:15163- 

15 166); and the T7 gene 10 protein peptide tag (Lutz-Freyermuth, et al. (1990) Proc. Nat'l 

Acad. Sci. USA 87:6393-6397). 

Also included are other lung cancer proteins of the lung cancer family, and lung 

cancer proteins from other organisms, which are cloned and expressed as outlined below. 

10 Thus, probe or degenerate polymerase chain reaction (PCR) primer sequences may be used to 
find other related lung cancer proteins from primates or other organisms. As will be 
appreciated by those in the art, particularly useful probe and/or PCR primer sequences 
include unique areas of the lung cancer nucleic acid sequence. As is generally known in the 
art, preferred PCR primers are from about 15 to about 35 nucleotides in length, with from 

15 about 20 to about 30 being preferred, and may contain inosine as needed. PCR reaction 
conditions are well known in the art (e.g., Innis, PCR Protocols, supra). 

Antibodies to lung cancer proteins 

In a preferred embodiment, when a lung cancer protein is to be used to generate 
20 antibodies, e.g., for immunotherapy or immunodiagnosis, the lung cancer protein should 
share at least one epitope or determinant with the fiill length protein. By "epitope" or 
"determinant*' herein is typically meant a portion of a protein which will generate and/or bind 
an antibody or T-cell receptor in the context of MHC. Thus, in most instances, antibodies 
made to a smaller lung cancer protein will be able to bind to the full-length protein, 
25 particularly linear epitopes. In a preferred embodiment, the epitope is unique; that is, 
antibodies generated to a unique epitope show little or no cross-reactivity. 

Methods of preparing polyclonal antibodies are well known (e.g., Coligan, supra; and 
Harlow and Lane, supra). Polyclonal antibodies can be raised in a mammal, e.g., by one or 
more injections of an immunizing agent and, if desired, an adjuvant. Typically, the 
30 immunizing agent and/or adjuvant will be injected in the mammal by multiple subcutaneous 
or intraperitoneal injections. The immunizing agent may include a protein encoded by a 
nucleic acid of Tables 1 A- 16 or fragment thereof or a fusion protein thereof. It may be useful 
to conjugate the immunizing agent to a protein known to be immunogenic in the mammal 
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being immunized. Immunogenic proteins include, e.g., keyhole limpet hemocyanin, serum 
albumin, bovine thyroglobulin, and soybean trypsin inhibitor. Adjuvants include, e.g., 
Freund's complete adjuvant and MPL-TDM adjuvant (monophosphoryl Lipid A, synthetic 
trehalose dicorynomycolate). The immunization protocol may be selected by one skilled in 
5 the art. 

The antibodies may, alternatively, be monoclonal antibodies. Monoclonal antibodies 
may be prepared using hybridoma methods, such as those described by Kohler and Milstein 
(1975) Nature 256:495. In a hybridoma method, a mouse, hamster, or other appropriate host 
animal, is typically immunized with an immunizing agent to elicit lymphocytes that produce 
10 or are capable of producing antibodies that will specifically bind to the immunizing agent. 
Alternatively, the lymphocytes may be immunized in vitro. The immunizing agent will 
typically include a polypeptide encoded by a nucleic acid of the tables, or fragment thereof, 
or a fusion protein thereof. Generally, either peripheral blood lymphocytes ("PBLs") are 
used if cells of human origin are desired, or spleen cells or lymph node cells are used if non- 
15 human mammalian sources are desired. The lymphocytes are then fused with an 

immortalized cell line using a suitable fusing agent, such as polyethylene glycol, to form a 
hybridoma cell (Goding (1986) Monoclonal Antibodies: Principles and Practice, pp. 59-103 ). 
Immortalized cell lines are usually transformed mammalian cells, particularly myeloma cells 
of rodent, bovin, or primate origin. Usually, rat or mouse myeloma cell lines are employed. 
20 The hybridoma cells may be cultured in a suitable culture medium that preferably contains 
one or more substances that inhibit the growth or survival of the unfused, immortalized cells. 
For example, if the parental cells lack the enzyme hypoxanthine guanine phosphoribosyl 
transferase (HGPRT or HPRT), the culture medium for the hybridomas typically will include 
hypoxanthine, aminopterin, and thymidine ("HAT medium"), which substances prevent the 
25 growth of HGPRT-deficient cells. 

In one embodiment, the antibodies are bispecific antibodies. Bispecific antibodies are 
typically monoclonal, preferably human or humanized, antibodies that have binding 
specificities for at least two different antigens or that have binding specificities for two 
epitopes on the same antigen. In one embodiment, one of the binding specificities is for a 
30 protein encoded by a nucleic acid of the tables or a fragment thereof, the other one is for any 
other antigen, and preferably for a cell-surface protein or receptor or receptor subunit, 
preferably one that is tumor specific. Alternatively, tetramer-type technology may create 
multivalent reagents. 
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In a preferred embodiment, the antibodies to lung cancer protein are capable of 
reducing or eliminating a biological function of a lung cancer protein, in a naked form or 
conjugated to an effector moiety. That is, the addition of anti-lung cancer protein antibodies 
(either polyclonal or preferably monoclonal) to lung cancer tissue (or cells containing lung 
5 cancer) may reduce or eliminate the lung cancer. Generally, at least a 25% decrease in 
activity, growth, size or the like is preferred, with at least about 50% being particularly 
preferred and about a 95-100% decrease being especially preferred. 

In a preferred embodiment the antibodies to the lung cancer proteins are humanized 
antibodies (e.g., Xenerex Biosciences, Medarex, Inc., Abgenix, Inc., Protein Design Labs, 

10 Inc.) Humanized forms of non-human (e.g., murine) antibodies are chimeric molecules of 
immunoglobulins, immunoglobulin chains or fragments thereof (such as Fv, Fab, Fab\ 
F(ab*)2 or other antigen-binding subsequences of antibodies) which contain minimal 
sequence derived from non-human immunoglobulin. Humanized antibodies include human 
immunoglobulins (recipient antibody) in which residues from a complementary deteimining 

1 5 region (CDR) of the recipient are replaced by residues from a CDR of a non-human species 
(donor antibody) such as mouse, rat or rabbit having the desired specificity, affinity and 
capacity. In some instances, Fv framework residues of a human immunoglobulin are 
replaced by corresponding non-human residues. Humanized antibodies may also comprise 
residues which are found neither in the recipient antibody nor in the imported CDR or 

20 framework sequences. In general, a humanized antibody will comprise substantially all of at 
least one, and typically two, variable domains, in which all or substantially all of the CDR 
regions correspond to those of a non-human immunoglobulin and all or substantially all of 
the framework (FR) regions are those of a human immunoglobulin consensus sequence. A 
humanized antibody optimally also will typically comprise at least a portion of an 

25 immunoglobulin constant region (Fc), typically that of a human immunoglobulin (Jones, et 
al. (1986) Nature 321 :522-525; Riechmann, et al. (1988) Nature 332:323-329; and Presta 
(1992) Curr. Op. Struct. Biol. 2:593-596), Humanization can be performed following the 
method of Winter and co-workers (Jones, et al. (1986) Nature 321 :522-525; Riechmann, et al. 
(1988) Nature 332:323-327; Verhoeyen, et al. (1988) Science 239:1534-1536), by 

30 substituting rodent CDRs or CDR sequences for corresponding sequences of a human 

antibody. Accordingly, such humanized antibodies are chimeric antibodies (U.S. Patent No. 
4,816,567), wherein substantially less than an intact human variable domain has been 
substituted by corresponding sequence from a non-human species. 



48 



WO 02/086443 PCT/US02/12476 
Human-like antibodies can also be produced using various techniques known in the 

art, including phage display libraries (Hoogenboom and Winter (1991) J. Mol. Biol. 227:381; 

Marks, et al. (1991) J. Mol. Biol. 222:581). The techniques of Cole, et al. and Boerner, et al. 

are also available for the preparation of human monoclonal antibodies (Cole, et al. (1985) 

5 Monoclonal Antibodies and Cancer Therapy, p. 77 and Boerner, et al. (1991) J. Immunol. 

147(l):86-95). Similarly, human antibodies can be made by introducing human 

immunoglobulin loci into transgenic animals, e.g., mice in which the endogenous 

immunoglobulin genes have been partially or completely inactivated. Upon challenge, 

human antibody production is observed, which closely resembles that seen in humans in 

10 nearly all respects, including gene rearrangement, assembly, and antibody repertoire. This 
approach is described, e.g., in U.S. Patent Nos. 5,545,807; 5,545,806; 5,569,825; 5,625,126; 
5,633,425; 5,661,016, and in the following scientific publications: Marks, et al. (1992) 
Bio/Technology 10:779-783; Lonberg, et al. (1994) Nature 368:856-859; Morrison (1994) 
Nature 368:812-13; Fishwild, et al. (1996) Nature Biotechnology 14:845-51; Neuberger 

15 (1996) Nature Biotechnology 14:826; and Lonberg and Huszar (1995) Intern. Rev. Immunol. 
13:65-93. 

By immunotherapy is meant treatment of lung cancer with an antibody raised against 
a lung cancer proteins. As used herein, immunotherapy can be passive or active. Passive 
immunotherapy as defined herein is the passive transfer of antibody to a recipient (patient). 

20 Active immunization is the induction of antibody and/or T-cell responses in a recipient 

(patient). Induction of an immune response is the result of providing the recipient with an 
antigen to which antibodies are raised. The antigen may be provided by injecting a 
polypeptide against which antibodies are desired to be raised into a recipient, or contacting 
the recipient with a nucleic acid capable of expressing the antigen and under conditions for 

25 expression of the antigen, leading to an immune response. 

In a preferred embodiment the lung cancer proteins against which antibodies are 
raised are secreted proteins as described above. Without being bound by theory, antibodies 
used for treatment, may bind and prevent the secreted protein from binding to its receptor, 
thereby inactivating the secreted lung cancer protein. 

30 In another preferred embodiment, the lung cancer protein to which antibodies are 

raised is a transmembrane protein. Without being bound by theory, antibodies used for 
treatment may bind the extracellular domain of the lung cancer protein and prevent it from 
binding to other proteins, such as circulating ligands or cell-associated molecules. The 
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antibody may cause down-regulation of the transmembrane lung cancer protein. The 

antibody may be a competitive, non-competitive or uncompetitive inhibitor of protein binding 

to the extracellular domain of the lung cancer protein. The antibody may be an antagonist of 

the lung cancer protein or may prevent activation of a transmembrane lung cancer protein, or 

5 may induce or suppress a particular cellular pathway. In some embodiments, when the 

antibody prevents the binding of other molecules to the lung cancer protein, the antibody 

prevents growth of the cell. The antibody may also be used to target or sensitize the cell to 

cytotoxic agents, including, but not limited to TNF-ct, TNF-p, EL-1, INF-y, and EL-2, or 

chemotherapeutic agents including 5FU, vinblastine, actinomycin D, cisplatin, methotrexate, 

10 and the like. In some instances the antibody may belong to a sub-type that activates serum 
complement when complexed with the transmembrane protein thereby mediating cytotoxicity 
or antigen-dependent cytotoxicity (ADCC). Thus, lung cancer may be treated by 
administering to a patient antibodies directed against the transmembrane lung cancer protein. 
Antibody-labeling may activate a co-toxin, localize a toxin payload, or otherwise provide 

1 5 means to locally ablate cells. 

In another preferred embodiment, the antibody is conjugated to an effector moiety. 
The effector moiety can be various molecules, including labeling moieties such as radioactive 
labels or fluorescent labels, or can be a therapeutic moiety. In one aspect the therapeutic 
moiety is a small molecule that modulates the activity of a lung cancer protein. In another 

20 aspect the therapeutic moiety may modulate an activity of molecules associated with or in 
close proximity to a lung cancer protein. The therapeutic moiety may inhibit enzymatic or 
signaling activity such as protease or collagenase activity associated with lung cancer. 

In a preferred embodiment, the therapeutic moiety can also be a cytotoxic agent. In 
this method, targeting the cytotoxic agent to lung cancer tissue or cells results in a reduction 

25 in the number of afflicted cells, thereby reducing symptoms associated with lung cancer. 

Cytotoxic agents are numerous and varied and include, but are not limited to, cytotoxic drugs 
or toxins or active fragments of such toxins. Suitable toxins and their corresponding 
fragments include diphtheria A chain, exotoxin A chain, ricin A chain, abrin A chain, curcin, 
crotin, phenomycin, enomycin, saporin, auristatin, and the like. Cytotoxic agents also include 

30 radiochemicals made by conjugating radioisotopes to antibodies raised against lung cancer 

proteins, or binding of a radionuclide to a chelating agent that has been covalently attached to. 
the antibody. Targeting the therapeutic moiety to transmembrane lung cancer proteins not 
only serves to increase the local concentration of therapeutic moiety in the lung cancer 
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afflicted area, but also serves to reduce deleterious side effects that may be associated with 

the untargeted therapeutic moiety. 

In another preferred embodiment, the lung cancer protein against which the antibodies 

are raised is an intracellular protein. In this case, the antibody may be conjugated to a protein 

5 or other entity which facilitates entry into the cell. In one case, the antibody enters the cell by 

endocytosis. In another embodiment, a nucleic acid encoding the antibody is administered to 

the individual or cell. Moreover, wherein the lung cancer protein can be targeted within a 

cell, i.e., the nucleus, an antibody theretomay contain a signal for that target localization, i.e., 

a nuclear localization signal. 

1 0 The lung cancer antibodies of the invention specifically bind to lung cancer proteins. 

By "specifically bind" herein is meant that the antibodies bind to the protein with a Ka of at 

least about 0.1 mM, more usually at least about 1 pM, preferably at least about 0.1 pM or 

better, and most preferably, 0.01 \jM or better. Selectivity of binding to the specific target 

and not to related other sequences is also important. 

15 

Detection of lung cancer sequence for diagnostic and therapeutic applications 

In one aspect, the RNA expression levels of genes are determined for different 
cellular states in the lung cancer phenotype. Expression levels of genes in normal tissue (e.g., 
not undergoing lung cancer), in lung cancer tissue (and in some cases, for varying severities 

20 of lung cancer that relate to prognosis, as outlined below), or in non-malignant disease are 
evaluated to provide expression profiles. A gene expression profile of a particular cell state 
or point of development is essentially a "fingerprint" of the state of the cell. While two states 
may have a particular gene similarly expressed, the evaluation of a number of genes 
simultaneously allows the generation of a gene expression profile that is reflective of the state 

25 of the cell. By comparing expression profiles of cells in different states, information 

regarding which genes are important (including both up- and down-regulation of genes) in 
each of these states is obtained. Then, diagnosis may be performed or confirmed to 
determine whether a tissue sample has the gene expression profile of normal or cancerous 
tissue. This will provide for molecular diagnosis of related conditions. 

30 "Differential expression," or grammatical equivalents as used herein, refers to 

qualitative or quantitative differences in the temporal and/or cellular gene expression 
patterns within and among cells and tissue. Thus, a differentially expressed gene can 
qualitatively have its expression altered, including an activation or inactivation, in, e.g., 
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normal versus lung cancer tissue. Genes may be turned on or turned off in a particular state, 
relative to another state thus permitting comparison of two or more states. A qualitatively 
regulated gene will exhibit an expression pattern within a state or cell type which is 
detectable by standard techniques. Some genes will be expressed in one state or cell type, but 
5 not in both. Alternatively, the difference in expression may be quantitative, e.g., in that 
expression is increased or decreased; i.e., gene expression is either upregulated, resulting in 
an increased amount of transcript, or downregulated, resulting in a decreased amount of 
transcript. The degree to which expression differs need only be large enough to quantify via 
standard characterization techniques as outlined below, such as by use of Asymetrix 

10 GeneChip™ expression arrays, Lockhart (1996) Nature Biotechnology 14:1675-1680, hereby 
expressly incorporated by reference. Other techniques include, but are not limited to, 
quantitative reverse transcriptase PCR, northern analysis and RNase protection. As outlined 
above, preferably the change in expression (i.e., upregulation or downregulation) is typically 
at least about 50%, more preferably at least about 100%, more preferably at least about 

1 5 1 50%, more preferably at least about 200%, with from 300 to at least 1 000% being especially 
preferred. 

Evaluation may be at the gene transcript or the protein level. The amount of gene 
expression may be monitored using nucleic acid probes to the RNA or DNA equivalent of the 
gene transcript, and the quantification of gene expression levels, or, alternatively, the final 

20 gene product itself (protein) can be monitored, e.g., with antibodies to the lung cancer protein 
and standard immunoassays (ELISAs, etc.) or other techniques, including mass spectroscopy 
assays, 2D gel electrophoresis assays, etc. Proteins corresponding to lung cancer genes, e.g., 
those identified as being important in a lung cancer or disease phenotype, can be evaluated in 
a lung cancer diagnostic test. In a preferred embodiment, gene expression monitoring is 

25 performed simultaneously on a number of genes. 

The lung cancer nucleic acid probes may be attached to biochips as outlined herein for 
the detection and quantification of lung cancer sequences in a particular cell. The assays are 
further described below in the example. PCR techniques can be used to provide greater 
sensitivity. Multiple protein expression monitoring can be performed as well. Similarly, 

30 these assays may be performed on an individual basis as well. 

In a preferred embodiment nucleic acids encoding the lung cancer protein are 
detected. Although DNA or RNA encoding the lung cancer protein may be detected, of 
particular interest are methods wherein an mRNA encoding a lung cancer protein is detected. 
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Probes to detect mRNA can be a nucleotide/deoxynucleotide probe that is complementary to 

and hybridizes with the mRNA and includes, but is not limited to, oligonucleotides, cDNA or 

KNA. Probes also should contain a detectable label, as defined herein. In one method the 

mRNA is detected after immobilizing the nucleic acid to be examined on a solid support such 

5 as nylon membranes and hybridizing the probe with the sample. Following washing to 

remove the non-specifically bound probe, the label is detected. In another method detection 

of the mRNA is performed in situ. In this method permeabilized cells or tissue samples are 

contacted with a detectably labeled nucleic acid probe for sufficient time to allow the probe 

to hybridize with the target mRNA. Following washing to remove the non-specifically bound 

10 probe, the label is detected. For example a digoxygenin labeled riboprobe (RNA probe) that 
is complementary to the mRNA encoding a lung cancer protein is detected by binding the 
digoxygenin with an anti-digoxygenin secondary antibody and developed with nitro blue 
tetrazolium and 5-bromo-4-chloro-3-indoyl phosphate. 

In a preferred embodiment, various proteins from the three classes of proteins as 

15 described herein (secreted, transmembrane or intracellular proteins) are used in diagnostic 
assays. The lung cancer proteins, antibodies, nucleic acids, modified proteins and cells 
containing lung cancer sequences are used in diagnostic assays. This can be performed on an 
individual gene or corresponding polypeptide level. In a preferred embodiment, the 
expression profiles are used, preferably in conjunction with high throughput screening 

20 techniques to allow monitoring for expression profile genes and/or corresponding 
polypeptides. 

As described and defined herein, lung cancer proteins, including intracellular, 
transmembrane, or secreted proteins, find use as markers of lung cancer, e.g., for prognostic 
or diagnostic purposes. Detection of these proteins in putative lung cancer tissue allows for 

25 detection, prognosis, or diagnosis of lung cancer or similar disease, and perhaps for selection 
of therapeutic strategy. In one embodiment, antibodies are used to detect lung cancer 
proteins. A preferred method separates proteins from a sample by electrophoresis on a gel 
(typically a denaturing and reducing protein gel, but may be another type of gel, including 
isoelectric focusing gels and the like). Following separation of proteins, the lung cancer 

30 protein is detected, e.g., by immunoblotting with antibodies raised against the lung cancer 
protein. Methods of immunoblotting are well known to those of ordinary skill in the art. 

In another preferred method, antibodies to the lung cancer protein find use in in situ 
imaging techniques, e.g., in histology (e.g., Asai (ed. 1993) Methods in Cell Biology: 
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Antibodies in Cell Biology, volume 37. In this method cells are contacted with from one to 
many antibodies to the lung cancer protein(s). Following washing to remove non-specific 
antibody binding, the presence of the antibody or antibodies is detected. In one embodiment 
the antibody is detected by incubating with a secondary antibody that contains a detectable 
5 label, e.g., multicolor fluorescence or confocal imaging. In another method the primary 

antibody to the lung cancer protein(s) contains a detectable label, e.g., an enzyme marker that 
can act on a substrate. In another preferred embodiment each one of multiple primary 
antibodies contains a distinct and detectable label. This method finds particular use in 
simultaneous screening for a plurality of lung cancer proteins. Many other histological 

1 0 imaging techniques are also provided by the invention. 

In a preferred embodiment the label is detected in a fluorometer which has the ability 
to detect and distinguish emissions of different wavelengths. In addition, a fluorescence 
activated cell sorter (FACS) can be used in the method. 

In another preferred embodiment, antibodies find use in diagnosing lung cancer from 

IS blood, serum, plasma, stool, and other samples. Such samples, therefore, are useful as 

samples to be probed or tested for the presence of lung cancer proteins. Antibodies can be 
used to detect a lung cancer protein by previously described immunoassay techniques 
including ELIS A, immunoblotting (western blotting), immunoprecipitation, BIACORE 
technology and the like. Conversely, the presence of antibodies may indicate an immune 

20 response against an endogenous lung cancer protein or vaccine. 

In a preferred embodiment, in situ hybridization of labeled lung cancer nucleic acid 
probes to tissue arrays is done. For example, arrays of tissue samples, including lung cancer 
tissue and/or normal tissue, are made. In situ hybridization (see, e.g., Ausubel, supra) is then 
performed. When comparing the fingerprints between an individual and a standard, the 

25 skilled artisan can make a diagnosis, a prognosis, or a prediction based on the findings. It is 
further understood that the genes which indicate the diagnosis may differ from those which 
indicate the prognosis and molecular profiling of the condition of the cells may lead to 
distinctions between responsive or refractory conditions or may be predictive of outcomes. 
In a preferred embodiment, the lung cancer proteins, antibodies, nucleic acids, 

30 modified proteins and cells containing lung cancer sequences are used in prognosis assays. 
As above, gene expression profiles can be generated that correlate to lung cancer, clinical, 
pathological, or other information, in terms of long term prognosis. Again, this may be done 
on either a protein or gene level, with the use of genes being preferred. Single or multiple 
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genes may be useful in various combinations. As above, lung cancer probes may be attached 
to biochips for the detection and quantification of lung cancer sequences in a tissue or patient. 
The assays proceed as outlined above for diagnosis. PCR method may provide more 
sensitive and accurate quantification. 

5 

Assays for therapeutic compounds 

In a preferred embodiment, the proteins, nucleic acids, and antibodies as described 
herein are used in drug screening assays. The lung cancer proteins, antibodies, nucleic acids, 
modified proteins and cells containing lung cancer sequences are used in drug screening 

10 assays or by evaluating the effect of drug candidates on a "gene expression profile" or 

expression profile of polypeptides. In a preferred embodiment, the expression profiles are 
used, preferably in conjunction with high throughput screening techniques to allow 
monitoring for expression profile genes after treatment with a candidate agent (e.g., 
Zlokamik, et al. (1998) Science 279:84-8; Heid (1996) Genome Res. 6:986-94. 

15 In a preferred embodiment, the lung cancer proteins, antibodies, nucleic acids, 

modified proteins and cells containing the native or modified lung cancer proteins are used in 
screening assays. That is, the present invention provides novel methods for screening for 
compositions which modulate the lung cancer phenotype or an identified physiological 
function of a lung cancer protein. As above, this can be done on an individual gene level or 

20 by evaluating the effect of drug candidates on a "gene expression profile*'. In a preferred 

embodiment, the expression profiles are used, preferably in conjunction with high throughput 
screening techniques to allow monitoring for expression profile genes after treatment with a 
candidate agent, see Zlokarnik, supra. 

Having identified differentially expressed genes herein, a variety of assays may be 

25 performed. In a preferred embodiment, assays may be run on an individual gene or protein 
level. That is, having identified a particular gene with altered regulation in lung cancer, test 
compounds can be screened for the ability to modulate gene expression or for binding to the 
lung cancer protein. "Modulation" thus includes an increase or a decrease in gene 
expression. The preferred amount of modulation will depend on the original change of the 

30 gene expression in normal versus tissue undergoing lung cancer, with changes of at least 

10%, preferably 50%, more preferably 100-300%, and in some embodiments 300-1000% or 
greater. Thus, if a gene exhibits a 4-fold increase in lung cancer tissue compared to normal 
tissue, a decrease of about four-fold is often desired; similarly, a 10-fold decrease in lung 
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cancer tissue compared to normal tissue often provides a target value of a 10-fold increase in 

expression to be induced by the test compound. 

The amount of gene expression may be monitored using nucleic acid probes and the 

quantification of gene expression levels, or, alternatively, the gene product itself can be 

5 monitored, e.g., through the use of antibodies to the lung cancer protein and standard 

immunoassays. Proteomics and separation techniques may also allow quantification of 

expression. 

In a preferred embodiment, gene or protein expression monitoring of a number of 
entities, i.e., an expression profile, is monitored simultaneously. Such profiles will typically 
1 0 involve a plurality of those entities described herein. 

In this embodiment, the lung cancer nucleic acid probes are attached to biochips as 
outlined herein for the detection and quantification of lung cancer sequences in a particular 
cell. Alternatively, PCR may be used. Thus, a series, e.g., of microtiter plate, may be used 
with dispensed primers in desired wells. A PCR reaction can then be performed and analyzed 
15 for each well. 

Expression monitoring can be performed to identify compounds that modify the 
expression of one or more lung cancer-associated sequences, e.g., a polynucleotide sequence 
set out in the tables. Generally, in a preferred embodiment, a test compound is added to the 
cells prior to analysis. Moreover, screens are also provided to identify agents that modulate 

20 lung cancer, modulate lung cancer proteins, bind to a lung cancer protein, or interfere with 
the binding of a lung cancer protein and an antibody, substrate, or other binding partner. 

The term "test compound" or "drug candidate" or "modulator" or grammatical 
equivalents as used herein describes a molecule, e.g., protein, oligopeptide, small organic 
molecule, polysaccharide, polynucleotide, etc., to be tested for the capacity to directly or^ 

25 indirectly alter the lung cancer phenotype or the expression of a lung cancer sequence, e.g., a 
nucleic acid or protein sequence. In preferred embodiments, modulators alter expression 
profiles of nucleic acids or proteins provided herein. In one embodiment, the modulator 
suppresses a lung cancer phenotype, e.g., to a normal or non-malignant tissue fingerprint. In 
another embodiment, a modulator induces a lung cancer phenotype. Generally, a plurality of 

30 assay mixtures are run in parallel with different agent concentrations to obtain a differential 
' response to the various concentrations. Typically, one of these concentrations serves as a 
negative control, i.e., at zero concentration or below the level of detection. 
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In one aspect, a modulator will neutralize the effect of a lung cancer protein. By 
''neutralize" is meant that activity of a protein and the consequent effect on the cell is 
inhibited or blocked. 

In certain embodiments, combinatorial libraries of potential modulators will be 
5 screened for an ability to bind to a lung cancer polypeptide or to modulate activity. 

Conventionally, new chemical entities with useful properties are generated by identifying a 
chemical compound (called a "lead compound") with some desirable property or activity, 
e.g., inhibiting activity, creating variants of the lead compound, and evaluating the property 
and activity of those variant compounds. Often, high throughput screening (HTS) methods 

10 are employed for such an analysis. 

In one preferred embodiment, high throughput screening methods involve providing a 
library containing a large number of potential therapeutic compounds (candidate 
compounds). Such "combinatorial chemical libraries" are then screened in one or more 
assays to identify those library members (particular chemical species or subclasses) that 

15 display a desired characteristic activity. The compounds thus identified can serve as 

conventional "lead compounds" or can themselves be used as potential or actual therapeutics. 

A combinatorial chemical library is a collection of diverse chemical compounds 
generated by either chemical synthesis or biological synthesis by combining a number of 
chemical "building blocks" such as reagents. For example, a linear combinatorial chemical 

10 library, such as a polypeptide (e.g., mutein) library, is formed by combining a set of chemical 
building blocks called amino acids in every possible way for a given compound length (i.e., 
the number of amino acids in a polypeptide compound). Millions of chemical compounds 
can be synthesized through such combinatorial mixing of chemical building blocks (Gallop, 
et al. (1994) J. Med. Chem. 37(9):1233-1251). 

15 Preparation and screening of combinatorial chemical libraries is well known to those 

of skill in the art. Such combinatorial chemical libraries include, but are not limited to, 
peptide libraries (see, e.g., U.S. Patent No. 5,010,175, Furka (1991) Pept. Prot Res. 37:487- 
493, Houghton, et al. (1991) Nature. 354:84-88), peptoids (PCT Publication No WO 
91/19735), encoded peptides (PCT Publication WO 93/20242), random bio-oligomers (PCT 

\0 Publication WO 92/00091), benzodiazepines (U.S. Pat. No. 5,288,5 14), diversomers such as 
hydantoins, benzodiazepines and dipeptides (Hobbs, et al. (1993) Proc. Nat. Acad. Sci. USA 
90:6909-6913), vinylogous polypeptides (Hagihara, et al. (1992) J. Amer. Chem. Soc. 
1 14:6568), nonpeptidal peptidomimetics with a Beta-D-Glucose scaffolding (Hirschmann, et 
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al. (1992) J. Amer. Chem. Soc. 114:9217-9218), analogous organic syntheses of small 

compound libraries (Chen, et al. (1994) J. Amer. Chem. Soc. 116:2661), oligocarbamates 

(Cho, et al. (1993) Science 261:1303), and/or peptidyl phosphonates (Campbell, et al. (1994) 

J. Org. Chem. 59:658). See, generally, Gordon, et al. (1994) J. Med. Chem. 37:1385, nucleic 

5 acid libraries (see, e.g., Stratagene, Corp.), peptide nucleic acid libraries (see, e.g., U.S. 

Patent 5,539,083), antibody libraries (see, e.g., Vaughn, et al. (1996) Nature Biotechnology 

14(3):309-314, and PC17US96/10287), carbohydrate libraries (see, e.g., Liang, et al. (1996) 

Science 274:1520-1522, and U.S. Patent No. 5,593,853), and small organic molecule libraries 

(see, e.g., benzodiazepines, Baum (1993) C&EN, Jan 18, page 33; isoprenoids, U.S. Patent 

10 No. 5,569,588; thiazolidinones and metathiazanones, U.S. Patent No. 5,549,974; pyrrolidines, 
U.S. Patent Nos. 5,525,735 and 5,519,134; morpholino compounds, U.S. Patent No. 
5,506,337; benzodiazepines, U.S. Patent No. 5,288,514; and the like). 

Devices for the preparation of combinatorial libraries are commercially available (see, 
e.g., 357 MPS, 390 MPS, Advanced Chem Tech, Louisville KY, Symphony, Rainin, 

15 Woburn, MA, 433 A Applied Biosystems, Foster City, CA, 9050 Plus, Millipore, Bedford, 
MA). 

A number of well known robotic systems have also been developed for solution phase 
chemistries. These systems include automated workstations like the automated synthesis 
apparatus developed by Takeda Chemical Industries, LTD. (Osaka, Japan) and many robotic 

20 systems utilizing robotic arms (Zymate II, Zymark Corporation, Hopkinton, Mass.; Orca, 

Hewlett-Packard, Palo Alto, Calif.), which mimic the manual synthetic operations performed 
by a chemist. The above devices, with appropriate modification, are suitable for use with the 
present invention. In addition, numerous combinatorial libraries are themselves 
commercially available (see, e.g., ComGenex, Princeton, N. J., Asinex, Moscow, Ru, Tripos, 

25 Inc., St. Louis, MO, ChemStar, Ltd, Moscow, RU, 3D Pharmaceuticals, Exton, PA, Martek 
Biosciences, Columbia, MD, etc.). 

The assays to identify modulators are amenable to high throughput screening. 
Preferred assays thus detect modulation of lung cancer gene transcription, polypeptide 
expression, and polypeptide activity. 

30 High throughput assays for evaluating the presence, absence, quantification, or other 

properties of particular nucleic acids or protein products are well known to those of skill in 
the art. Similarly, binding assays and reporter gene assays are similarly well known. Thus, 
e.g., U.S. Patent No. 5,559,410 discloses high throughput screening methods for proteins, 
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U.S. Patent No. 5,585,639 discloses high throughput screening methods for nucleic acid 
binding (i.e., in arrays), while U.S. Patent Nos. 5,576,220 and 5,541,061 disclose high 
throughput methods of screening for ligand/antibody binding. 

In addition, high throughput screening systems are commercially available (see, e.g., 
5 Zymark Corp., Hopkinton, MA; Air Technical Industries, Mentor, OH; Beckman 

Instruments, Inc. Fullerton, CA; Precision Systems, Inc., Natick, MA, etc.). These systems 
typically automate procedures, including sample and reagent pipetting, liquid dispensing, 
timed incubations, and final readings of the microplate in detector(s) appropriate for the 
assay. These configurable systems provide high throughput and rapid start up as well as a 

1 0 high degree of flexibility and customization. The manufacturers of such systems provide 
detailed protocols for various high throughput systems. Thus, e.g., Zymark Corp. provides 
technical bulletins describing screening systems for detecting the modulation of gene 
transcription, ligand binding, and the like. 

In one embodiment, modulators are proteins, often naturally occurring proteins or 

1 5 fragments of naturally occurring proteins. Thus, e.g., cellular extracts containing proteins, or 
random or directed digests of proteinaceous cellular extracts, may be used. In this way 
libraries of proteins may be made for screening in the methods of the invention. Particularly 
preferred in this embodiment are libraries of bacterial, fungal, viral, and mammalian proteins, 
with the latter being preferred, and human proteins being especially preferred. Particularly 

20 useful test compound will be directed to the class of proteins to which the target belongs, e.g., 
substrates for enzymes or ligands and receptors. 

In a preferred embodiment, modulators are peptides of from about 5 to about 30 
amino acids, with from about 5 to about 20 amino acids being preferred, and from about 7 to 
about 15 being particularly preferred. The peptides may be digests of naturally occurring 

25 proteins, random peptides, or "biased" random peptides. By "randomized" or grammatical 
equivalents herein is meant that the nucleic acid or peptide consists of essentially random 
sequences of nucleotides and amino acids, respectively. Since these random peptides (or 
nucleic acids, discussed below) are often chemically synthesized, they may incorporate a 
nucleotide or amino acid at any position. The synthetic process can be designed to generate 

30 randomized proteins or nucleic acids, to allow the formation of all or most of the possible 

combinations over the length of the sequence, thus forming a library of randomized candidate 
bioactive proteinaceous agents. 

59 



WO 02/086443 PCT/US02/12476 

In one embodiment, the library is fully randomized, with no sequence preferences or 
J constants at any position. In a preferred embodiment, the library is biased. That is, some 
positions within the sequence are either held constant, or are selected from a limited number 
of possibilities. In a preferred embodiment, the nucleotides or amino acid residues are 
5 randomized within a defined class, e.g., of hydrophobic amino acids, hydrophilic residues, 
sterically biased (either small or large) residues, towards the creation of nucleic acid binding 
domains, the creation of cysteines, for cross-linking, prolines for SH-3 domains, serines, 
threonines, tyrosines or histidines for phosphorylation sites, etc. 

Modulators of lung cancer can also be nucleic acids, as defined above. 
10 As described above generally for proteins, nucleic acid modulating agents may be 

naturally occurring nucleic acids, random nucleic acids, or <c biased" random nucleic acids. 
Digests of procaryotic or eucaryotic genomes may be used as is outlined above for proteins. 

In a preferred embodiment, the candidate compounds are organic chemical moieties, a 
wide variety of which are available in the literature. 
15 After a candidate agent has been added and the cells allowed to incubate for some 

period of time, the sample containing a target sequence is analyzed. If required, the target 
sequence is prepared using known techniques. For example, the sample may be treated to 
lyse the cells, using known lysis buffers, electroporation, etc., with purification and/or 
amplification such as PCR performed as appropriate. For example, an in vitro transcription 
20 with labels covalently attached to the nucleotides is performed. Generally, the nucleic acids 
are labeled with biotin-FITC or PE, or with cy3 or cy5. 

In a preferred embodiment, the target sequence is labeled with, e.g., a fluorescent, a 
chemiluminescent, a chemical, or a radioactive signal, to provide a means of detecting the 
target sequence's specific binding to a probe. The label also can be an enzyme, such as, 
25 alkaline phosphatase or horseradish peroxidase, which when provided with an appropriate 
substrate produces a product that can be detected. Alternatively, the label can be a labeled 
compound or small molecule, such as an enzyme inhibitor, that binds but is not catalyzed or 
altered by the enzyme. The label also can be a moiety or compound, such as, an epitope tag 
or biotin which specifically binds to streptavidin. For the example of biotin, the streptavidin 
30 is labeled as described above, thereby, providing a detectable signal for the bound target 
sequence. Unbound labeled streptavidin is typically removed prior to analysis. 

Nucleic acid assays can be direct hybridization assays or can comprise "sandwich 
assays", which include the use of multiple probes, as is generally outlined in U.S. Patent Nos. 
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5,681,702, 5,597,909, 5,545,730, 5,594,117, 5,591,584, 5,571,670, 5,580,731, 5,571,670, 

5,591,584, 5,624,802, 5,635,352, 5,594,118, 5,359,100, 5,124,246 and 5,681,697, all of 

which are hereby incorporated by reference. In this embodiment, in general, the target nucleic 

acid is prepared as outlined above, and then added to the biochip comprising a plurality of 

5 nucleic acid probes, under conditions that allow the formation of a hybridization complex. 

A variety of hybridization conditions may be used in the present invention, including 

high, moderate and low stringency conditions as outlined above. The assays are generally 

run under stringency conditions which allow formation of the label probe hybridization 

complex only in the presence of target. Stringency can be controlled by altering a step 

10 parameter that is a thermodynamic variable, including, but not limited to, temperature, 
formamide concentration, salt concentration, chaotropic salt concentration, pH, organic 
solvent concentration, etc. 

These parameters may also be used to control non-specific binding, as is generally 
outlined in U.S. Patent No. 5,681,697: Thus it may be desirable to perform certain steps at 

1 5 higher stringency conditions to reduce non-specific binding. 

The reactions outlined herein may be accomplished in a variety of ways. Components 
of the reaction may be added simultaneously, or sequentially, in different orders, with 
preferred embodiments outlined below. In addition, the reaction may include a variety of 
other reagents. These include salts, buffers, neutral proteins, e.g., albumin, detergents, etc. 

20 which may be used to facilitate optimal hybridization and detection, and/or reduce non- 
specific or background interactions. Reagents that otherwise improve the efficiency of the 
assay, such as protease inhibitors, nuclease inhibitors, anti-microbial agents, etc., may also be 
used as appropriate, depending on the sample preparation methods and purity of the target. 
The assay data are analyzed to determine the expression levels, and changes in 

25 expression levels as between states, of individual genes, forming a gene expression profile. 

Screens are performed to identify modulators of the lung cancer phenotype. In one 
embodiment, screening is performed to identify modulators that can induce or suppress a 
particular expression profile, thus preferably generating the associated phenotype. In another 
embodiment, e.g., for diagnostic applications, having identified differentially expressed genes 

30 important in a particular state, screens can be performed to identify modulators that alter 

expression of individual genes. In an another embodiment, screening is performed to identify 
modulators that alter a biological function of the expression product of a differentially 
expressed gene. Again, having identified the importance of a gene in a particular state, 
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screens are performed to identify agents that bind and/or modulate the biological activity of 

the gene product, or evaluate genetic polymorphisms. 

Genes can be screened for those that are induced in response to a candidate agent. 

After identifying a modulator based upon its ability to suppress a lung cancer expression 

5 pattern leading to a normal expression pattern, or to modulate a single lung cancer gene 

expression profile so as to mimic the expression of the gene from normal tissue, a screen as 

described above can be performed to identify genes that are specifically modulated in 

response to the agent. Comparing expression profiles between normal tissue and agent 

treated lung cancer tissue reveals genes that are not expressed in normal tissue or lung cancer 

10 tissue, but are expressed in agent treated tissue. These agent-specific sequences can be 
identified and used by methods described herein for lung cancer genes or proteins. In 
particular these sequences and the proteins they encode find use in marking or identifying 
agent treated cells. In addition, antibodies can be raised against the agent induced proteins 
and used to target novel therapeutics to the treated lung cancer tissue sample. 

15 Thus, in one embodiment, a test compound is administered to a population of lung 

cancer cells, that have an associated lung cancer expression profile. By "administration" or 
"contacting" herein is meant that the candidate agent is added to the cells in such a manner as 
to allow the agent to act upon the cell, whether by uptake and intracellular action, or by 
action at the cell surface. In some embodiments, nucleic acid encoding a proteinaceous 

20 candidate agent (i.e., a peptide) may be put into a viral construct such as an adenoviral or 
retroviral construct, and added to the cell, such that expression of the peptide agent is 
accomplished, e.g., PCT US97/01019. Regulatable gene therapy systems can also be used. 

Once a test compound has been administered to the cells, the cells can be washed if 
desired and are allowed to incubate under preferably physiological conditions for some 

25 period of time. The cells are then harvested and a new gene expression profile is generated, 
as outlined herein. 

Thus, e.g., lung cancer or non-malignant tissue may be screened for agents that 
modulate, e.g., induce or suppress a lung cancer phenotype. A change in at least one gene, 
preferably many, of the expression profile indicates that the agent has an effect on lung 
30 cancer activity. By defining such a signature for the lung cancer phenotype, screens for new 
drugs that alter the phenotype can be devised. With this approach, the drug target need not be 
known and need not be represented in the original expression screening platform, nor does 
the level of transcript for the target protein need to change. 
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Measure of lung cancer polypeptide activity, or of lung cancer or the lung cancer 

phenotype can be performed using a variety of assays. For example, the effects of the test 

compounds upon the function of the metastatic polypeptides can be measured by exa min i n g 

parameters described above. A suitable physiological change that affects activity can be used 

5 to assess the influence of a test compound on the polypeptides of this invention. When the 

functional consequences are determined using intact cells or animals, one can also measure a 

variety of effects such as, in the case of lung cancer associated with tumors, tumor growth, 

tumor metastasis, neovascularization, hormone release, transcriptional changes to both known 

and uncharacterized genetic markers (e.g., northern blots), changes in cell metabolism such as 

10 cell growth or pH changes, and changes in intracellular second messengers such as cGMP. In 
the assays of the invention, mammalian lung cancer polypeptide is typically used, e.g., 
mouse, preferably human. 

Assays to identify compounds with modulating activity can be performed in vitro. 
For example, a lung cancer polypeptide is first contacted with a potential modulator and 

15 incubated for a suitable amount of time, e.g., from 0.5 to 48 hours. In one embodiment, the 
lung cancer polypeptide levels are determined in vitro by measuring the level of protein or 
mRNA. The level of protein is typically measured using immunoassays such as western 
blotting, ELISA and the like with an antibody that selectively binds to the lung cancer 
polypeptide or a fragment thereof. For measurement of mRNA, amplification, e.g., using 

20 PCR, LCR, or hybridization assays, e.g., northern hybridization, RNAse protection, dot 

blotting, are preferred. The level of protein or mRNA is typically detected using directly or 
indirectly labeled detection agents, e.g., fluorescently or radioactively labeled nucleic acids, 
radioactively or enzymatically labeled antibodies, and the like, as described herein. 

Alternatively, a reporter gene system can be devised using a lung cancer protein 

25 promoter operably linked to a reporter gene such as luciferase, green fluorescent protein, 
CAT, or p-gal. The reporter construct is typically transfected into a cell. After treatment 
with a potential modulator, the amount of reporter gene transcription, translation, or activity 
is measured according to standard techniques known to those of skill in the art. 

In a preferred embodiment, as outlined above, screens may be done on individual 

30 genes and gene products (proteins). That is, having identified a particular differentially 

expressed gene as important in a particular state, screening of modulators of the expression of 
the gene or the gene product itself can be done. The gene products of differentially expressed 
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genes are sometimes referred to herein as "lung cancer proteins." The lung cancer protein 
may be a fragment, or alternatively, be the full length protein to a fragment shown herein. 

In one embodiment, screening for modulators of expression of specific genes is 
performed. Typically, the expression of only one or a few genes are evaluated. In another 
5 embodiment, screens are designed to first find compounds that bind to differentially 
expressed proteins. These compounds are then evaluated for the ability to modulate 
differentially expressed activity. Moreover, once initial candidate compounds are identified, 
variants can be further screened to better evaluate structure activity relationships. 

In a preferred embodiment, binding assays are done. In general, purified or isolated 
10 gene product is used; that is, the gene products of one or more differentially expressed 

nucleic acids are made. For example, antibodies are generated to the protein gene products, 
and standard immunoassays are run to determine the amount of protein present. Alternatively, 
cells comprising the lung cancer proteins can be used in the assays. 

Thus, in a preferred embodiment, the methods comprise combining a lung cancer 
15 protein and a candidate compound, and determining the binding of the compound to the lung 
cancer protein. Preferred embodiments utilize the human lung cancer protein, although other 
mammalian proteins may also be used, e.g., for the development of animal models of human 
disease. In some embodiments, as outlined herein, variant or derivative lung cancer proteins 
may be used. 

20 Generally, in a preferred embodiment of the methods herein, the lung cancer protein 

or the candidate agent is non-diffusably bound to an insoluble support, preferably having 
isolated sample receiving areas (e.g., a microtiter plate, an array, etc.). The insoluble 
supports may be made of a composition to which the compositions can be bound, is readily 
separated from soluble material, and is otherwise compatible with the overall method of 

25 screening. The surface of such supports may be solid or porous and of a convenient shape. 
Examples of suitable insoluble supports include microtiter plates, arrays, membranes and 
beads. These are typically made of glass, plastic (e.g., polystyrene), polysaccharides, nylon 
or nitrocellulose, teflon™, etc. Microtiter plates and arrays are especially convenient because 
a large number of assays can be carried out simultaneously, using small amounts of reagents 

30 and samples. The particular manner of binding of the composition is typically not crucial so 
long as it is compatible with the reagents and overall methods of the invention, maintains the 
activity of the composition, and is nondiffiisable. Preferred methods of binding include the 
use of antibodies (which do not sterically block either the ligand binding site or activation 
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sequence when the protein is bound to the support), direct binding to "sticky" or ionic 

supports, chemical crosslinking, the synthesis of the protein or agent on the surface, etc. 
Following binding of the protein or agent, excess unbound material is removed by washing. 
The sample receiving areas may then be blocked through incubation with bovine serum 
5 albumin (BSA), casein or other innocuous protein or other moiety. 

In a preferred embodiment, the lung cancer protein is bound to the support, and a test 
compound is added to the assay. Alternatively, the candidate agent is bound to the support 
and the lung cancer protein is added. Novel binding agents include specific antibodies, non- 
natural binding agents identified in screens of chemical libraries, peptide analogs, etc/Of 

10 particular interest are screening assays for agents that have a low toxicity for human cells. A 
wide variety of assays may be used for this purpose, including labeled in vitro protein-protein 
binding assays, electrophoretic mobility shift assays, immunoassays for protein binding, 
functional assays (phosphorylation assays, etc.) and the like. 

The determination of the binding of the test modulating compound to the lung cancer 

1 5 protein may be done in a number of ways. In a preferred embodiment, the compound is 

labeled, and binding determined directly, e.g., by attaching all or a portion of the lung cancer 
protein to a solid support, adding a labeled candidate agent (e.g., a fluorescent label), washing 
off excess reagent, and determining whether the label is present on the solid support. Various 
blocking and washing steps may be utilized as appropriate. 

20 In some embodiments, only one of the components is labeled, e.g., the proteins (or 

proteinaceous candidate compounds) can be labeled. Alternatively, more than one 
component can be labeled with different labels, e.g., 125 I for the proteins and a fluorophor for 
the compound. Proximity reagents, e.g., quenching or energy transfer reagents are also 
useful. 

25 In one embodiment, the binding of the test compound is determined by competitive 

binding assay. The competitor may be a binding moiety known to bind to the target molecule 
(i.e., a lung cancer protein), such as an antibody, peptide, binding partner, ligand, etc. Under 
certain circumstances, there may be competitive binding between the compound and the 
binding moiety, with the binding moiety displacing the compound. In one embodiment, the 

30 test compound is labeled. Either the compound, or the competitor, or both, is added first to 
the protein for a time sufficient to allow binding, if present. Incubations may be performed at 
a temperature which facilitates optimal activity, typically between 4 and 40° C. Incubation 
periods are typically optimized, e.g., to facilitate rapid high throughput screening. Typically 
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between 0.1 and 1 hour will be sufficient. Excess reagent is generally removed or washed 

away. The second component is then added, and the presence or absence of the labeled 

component is followed, to indicate binding. 

In a preferred embodiment, the competitor is added first, followed by a test 

5 compound. Displacement of the competitor is an indication that the test compound is binding 

to the lung cancer protein and thus is capable of binding to, and potentially modulating, the 

activity of the lung cancer protein. In this embodiment, either component can be labeled. 

Thus, e.g., if the competitor is labeled, the presence of label in the wash solution indicates 

displacement by the agent. Alternatively, if the test compound is labeled, the presence of the 

1 0 label on the support indicates displacement. 

In an alternative embodiment, the test compound is added first, with incubation and 
washing, followed by the competitor. The absence of binding by the competitor may indicate 
that the test compound is bound to the lung cancer protein with a higher affinity. Thus, if the 
test compound is labeled, the presence of the label on the support, coupled with a lack of • 

15 competitor binding, may indicate that the test compound is capable of binding to the lung 
cancer protein. 

In a preferred embodiment, the methods comprise differential screening to identity 
agents that are capable of modulating the activity of the lung cancer proteins. In one 
embodiment, the methods comprise combining a lung cancer protein and a competitor in a 

20 first sample. A second sample comprises a test compound, a lung cancer protein, and a 

competitor. The binding of the competitor is determined for both samples, and a change, or 
difference in binding between the two samples indicates the presence of an agent capable of 
binding to the lung cancer protein and potentially modulating its activity. That is, if the 
binding of the competitor is different in the second sample relative to the first sample, the 

25 agent is capable of binding to the lung cancer protein. 

Alternatively, differential screening is used to identify drug candidates that bind to the 
native lung cancer protein, but cannot bind to modified lung cancer proteins. The structure of 
the lung cancer protein may be modeled, and used in rational drug design to synthesize agents 
that interact with that site. Drug candidates that affect the activity of a lung cancer protein 

30 are also identified by screening drugs for the ability to either enhance or reduce the activity of 
the protein. 

Positive controls and negative controls may be used in the assays. Preferably control 
and test samples are performed in at least triplicate to obtain statistically significant results. 
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Incubation of all samples is for a time sufficient for the binding of the agent to the protein. 
Following incubation, samples are washed free of non-specifically bound material and the 
amount of bound, generally labeled agent determined. For example, where a radiolabel is 
employed, the samples may be counted in a scintillation counter to determine the amount of 
5 bound compound. 

A variety of other reagents may be included in the screening assays. These include 
reagents like salts, neutral proteins, e.g., albumin, detergents, etc. which may be used to 
facilitate optimal protein-protein binding and/or reduce non-specific or background 
interactions. Also reagents that otherwise improve the efficiency of the assay, such as 
10 protease inhibitors, nuclease inhibitors, anti-microbial agents, etc., may be used. The mixture 
of components may be added in an order that provides for the requisite binding. 

In a preferred embodiment, the invention provides methods for screening for a 
compound capable of modulating the activity of a lung cancer protein. The methods 
comprise adding a test compound, as defined above, to a cell comprising lung cancer 
1 5 proteins. Preferred cell types include almost any cell. The cells contain a recombinant 
nucleic acid that encodes a lung cancer protein. In a preferred embodiment, a library of 
candidate agents are tested on a plurality of cells. 

In one aspect, the assays are evaluated in the presence or absence or previous or 
subsequent exposure of physiological signals, e.g., hormones, antibodies, peptides, antigens, 
20 cytokines, growth factors, action potentials, pharmacological agents including 

chemotherapeutics, radiation, carcinogenics, or other cells (e.g., cell-cell contacts). In another 
example, the determinations are determined at different stages of the cell cycle process. 

In this way, compounds that modulate lung cancer agents are identified. Compounds 
with pharmacological activity are able to enhance or interfere with the activity of the lung 
25 cancer protein. Once identified, similar structures are evaluated to identify critical structural 
feature of the compound. 

In one embodiment, a method of inhibiting lung cancer cell division is provided. The 
method comprises administration of a lung cancer inhibitor. In another embodiment, a 
method of inhibiting lung cancer is provided. The method may comprise administration of a 
30 lung cancer inhibitor. In a further embodiment, methods of treating cells or individuals with 
lung cancer are provided, e.g., comprising administration of a lung cancer inhibitor. 

In one embodiment, a lung cancer inhibitor is an antibody as discussed above. In 
another embodiment, the lung cancer inhibitor is an antisense molecule. 
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A variety of cell growth, proliferation, viability, and metastasis assays are known to 
those of skill in the art, as described below. 

Soft agar growth or colony formation in suspension 

Normal cells require a solid substrate to attach and grow. When the cells are 
transformed, they lose this phenotype and grow detached from the substrate. For example, 
transformed cells can grow in stirred suspension culture or suspended in semi-solid media, 
such as semi-solid or soft agar. The transformed cells, when transfected with tumor 
suppressor genes, regenerate normal phenotype and require a solid substrate to attach and 
grow. Soft agar growth or colony formation in suspension assays can be used to identify 
modulators of lung cancer sequences, which when expressed in host cells, inhibit abnormal 
cellular proliferation and transformation. A therapeutic compound would reduce or eliminate 
the host cells' ability to grow in stirred suspension culture or suspended in semi-solid media, 
such as semi-solid or soft. 

Techniques for soft agar growth or colony formation in suspension assays are 
described in Freshney (1994) Culture of Animal Cells a Manual of Basic Technique (3 rd ed.), 
herein incorporated by reference. See also, the methods section of Garkavtsev, et al. (1996), 
supra, herein incorporated by reference. 

Contact inhibition and density limitation of growth 

Normal cells typically grow in a flat and organized pattern in a petri dish until they 
touch other cells. When the cells touch one another, they are contact inhibited and stop 
growing. When cells are transformed, however, the cells are not contact inhibited and 
continue to grow to high densities in disorganized foci. Thus, the transformed cells grow to a 
25 higher saturation density than normal cells. This can be detected morphologically by the 
formation of a disoriented monolayer of cells or rounded cells in foci within the regular 
pattern of normal surrounding cells. Alternatively, labeling index with ( 3 H)-thymidine at 
saturation density can be used to measure density limitation of growth. See Freshney (1994), 
supra. The transformed cells, when transfected with tumor suppressor genes, regenerate a 
30 normal phenotype and become contact inhibited and would grow to a lower density. 

In this assay, labeling index with ( 3 H)-thymidine at saturation density is a preferred 
method of measuring density limitation of growth. Transformed host cells are transfected 
with a lung cancer-associated sequence and are grown for 24 hours at saturation density in 
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non-limiting medium conditions. The percentage of cells labeling with ( 3 H)-thymidine is 

determined autoradiographically. See, Freshney (1994), supra. 



Growth factor or serum dependence 
5 Transformed cells typically have a lower serum dependence than their normal 

counterparts (see, e.g., Temin (1966) J. Natl. Cancer Insti. 37:167-175; Eagle, et al. (1970) L 
Exp. Med. 13 1:836-879); Freshney, supra. This is in part due to release of various growth 
factors by the transformed cells. Growth factor or serum dependence of transformed host 
cells can be compared with that of control. 

10 

Tumor specific markers levels 

Tumor cells release an increased amount of certain factors (hereinafter "tumor 
specific markers") than their normal counterparts. For example, plasminogen activator (PA) 
is released from human glioma at a higher level than from normal brain cells (see, e.g., 

15 Gullino, "Angiogenesis, tumor vascularization, and potential interference with tumor growth" 
in Mihich (ed. 1985) Biological Responses in Cancer, pp. 178-184). Similarly, Tumor 
angiogenesis factor (TAF) is released at a higher level in tumor cells than their normal 
counterparts. See, e.g., Folkman (1992) "Angiogenesis and Cancer" in Sem Cancer Biol.) . 
Various techniques which measure the release of these factors are described in 

20 Freshney (1994), supra. Also, see, Unkeless, et al. (1974) J. Biol. Chem. 249:4295-4305; 

Strickland and Beers (1976) J. Biol. Chem . 251:5694-5702; Whur, et al. (1980) Br. J. Cancer 
42:305-312; Gullino, "Angiogenesis, tumor vascularization, and potential interference with 
tumor growth" in Mihich (ed. 1985) Biological Responses in Cancer , pp. 178-184; Freshney 
Anticancer Res. 5:111-130(1 985). 

25 

Invasiveness into Matrigel 

The degree of invasiveness into Matrigel or some other extracellular matrix 
constituent can be used as an assay to identify compounds that modulate lung cancer- 
associated sequences. Tumor cells exhibit a good correlation between malignancy and 
30 invasiveness of cells into Matrigel or some other extracellular matrix constituent. In this 
assay, tumorigenic cells are typically used as host cells. Expression of a tumor suppressor 
gene in these host cells would decrease invasiveness of the host cells. 
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Techniques described in Freshney (1994), supra, can be used. Briefly, the level of 

invasion of host cells can be measured by using filters coated with Matrigel or some other 

extracellular matrix constituent. Penetration into the gel, or through to the distal side of the 

filter, is rated as invasiveness, and rated histologically by number of cells and distance 

5 moved, or by prelabeling the cells with l25 I and counting the radioactivity on the distal side of 

the filter or bottom of the dish. See, e.g., Freshney (1984), supra. 

Tumor growth in vivo 

Effects of lung cancer-associated sequences on cell growth can be tested in transgenic 

10 or immune-suppressed mice. Knock-out transgenic mice can be made, in which the lung 
cancer gene is disrupted or in which a lung cancer gene is inserted. Knock-out transgenic 
mice can be made by insertion of a marker gene or other heterologous gene into the 
endogenous lung cancer gene site in the mouse genome via homologous recombination. 
Such mice can also be made by substituting the endogenous lung cancer gene with a mutated 

15 version of the lung cancer gene, or by mutating the endogenous lung cancer gene, e.g., by 
exposure to carcinogens. 

A DNA construct is introduced into the nuclei of embryonic stem cells. Cells 
containing the newly engineered genetic lesion are injected into a host mouse embryo, which 
is re-implanted into a recipient female. Some of these embryos develop into chimeric mice 

20 that possess germ cells partially derived from the mutant cell line. Therefore, by breeding the 
chimeric mice it is possible to obtain a new line of mice containing the introduced genetic 
lesion (see, e.g., Capecchi, et al. (1989) Science 244:1288). Chimeric targeted mice can be 
derived according to Hogan, et al. (1988) Manipulating the Mouse Embryo: A Laboratory 
Manual, Cold Spring Harbor Laboratory and Robertson (ed. 1987) Teratocarcinomas and 

25 Embryonic Stem Cells: A Practical Approach. , IRL Press, Washington, D.C. 

Alternatively, various immune-suppressed or immune-deficient host animals can be 
used. For example, genetically athymic "nude" mouse (see, e.g., Giovanella, et al. (1974) JL 
Natl. Cancer Inst. 52:921), a SCID mouse, a thymectomized mouse, or an irradiated mouse 
(see, e.g., Bradley, et al. (1978) Br. J. Cancer 38:263; Selby, et al. (1980) Br. J. Cancer 41 :52) 

30 can be used as a host. Transplantable tumor cells (typically about 10 6 cells) injected into 

isogenic hosts will produce invasive tumors in a high proportions of cases, while normal cells 
of similar origin will not. In hosts which developed invasive tumors, cells expressing a lung 
cancer-associated sequences are injected subcutaneously. After a suitable length of time, 
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preferably 4-8 weeks, tumor growth is measured (e.g., by volume or by its two largest 

dimensions) and compared to the control. Tumors that have statistically significant reduction 

(using, e.g., Student's T test) are said to have inhibited growth. 

5 Polynucleotide modulators of lung cancer 

Antisense and RNAi Polynucleotides 

In certain embodiments, the activity of a lung cancer-associated protein is 
downregulated, or entirely inhibited, by the use of antisense or an inhibitory polynucleotide, 
i.e., a nucleic acid complementary to, and which can preferably hybridize specifically to, a 

10 coding mRNA nucleic acid sequence, e.g., a lung cancer protein mRNA, or a subsequence 
thereof. Binding of the antisense polynucleotide to the mRNA reduces the translation and/or 
stability of the mRNA. 

In the context of this invention, antisense polynucleotides can comprise naturally- 
occurring nucleotides, or synthetic species formed from naturaUy-occurring subunits or their 

1 5 close homologs. Antisense polynucleotides may also have altered sugar moieties or inter- 
sugar linkages. Exemplary among these are the phosphorothioate and other sulfur containing 
species which are known for use in the art. Analogs are comprehended by this invention so 
long as they function effectively to hybridize with the lung cancer protein mRNA. See, e.g., 
Isis Pharmaceuticals, Carlsbad, CA; Sequitor, Inc., Natick, MA. 

20 Such antisense polynucleotides can readily be synthesized using recombinant means, 

or can be synthesized in vitro. Equipment for such synthesis is sold by several vendors, 
including Applied Biosystems. The preparation of other oligonucleotides such as 
phosphorothioates and alkylated derivatives is also well known to those of skill in the art. 
Antisense molecules as used herein include antisense or sense oligonucleotides. 

25 Sense oligonucleotides can, e.g., be employed to block transcription by binding to the anti- 
sense strand. The antisense and sense oligonucleotide comprise a single-stranded nucleic 
acid sequence (either RNA or DNA) capable of binding to target mRNA (sense) or DNA 
(antisense) sequences for lung cancer molecules. A preferred antisense molecule is for a lung 
cancer sequence in the tables, or for a ligand or activator thereof. Antisense or sense 

30 oligonucleotides, according to the present invention, comprise a fragment generally at least 
about 14 nucleotides, preferably from about 14 to 30 nucleotides. The ability to derive an 
antisense or a sense oligonucleotide, based upon a cDNA sequence encoding a given protein 
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is described in, e.g., Stein and Cohen (1988) Cancer Res. 48:2659 and van der Krol, et al. 

(1988) BioTechnicmes 6:958). 

RNA interference is a mechanism to suppress gene expression in a sequence specific 

manner. See, e.g., Brumelkamp, et al. f2002^ Sciencexpress (21March2002); Sharp (1999) 

Genes Dev. 13:139-141; and Cathew (2001) Curr. Op. Cell BioL 13:244-248. In mammalian 

cells, short, e.g., 21 nt, double stranded small interfering RNAs (siRNA) have been shown to 

be effective at inducing an RNAi response. See, e.g., Elbashir, et al. (2001) Nature 41 1 :494- 

498. The mechanism may be used to downregulate expression levels of identified genes, e.g., 

treatment of or validation of relevance to disease. 



Ribozymes 

In addition to antisense polynucleotides, ribozymes can be used to target and inhibit 
transcription of lung cancer-associated nucleotide sequences. A ribozyme is an RNA 
molecule that catalytically cleaves other RNA molecules. Different kinds of ribozymes have 

15 been described, including group I ribozymes, hammerhead ribozymes, hairpin ribozymes, 
RNase P, and axhead ribozymes (see, e.g., Castanotto, et al. (1994) Adv. in Pharmacology 
25: 289-317 for a general review of the properties of different ribozymes). 

The general features of hairpin ribozymes are described, e.g., in Hampel, et al. (1990) 
Nucl. Acids Res. 18:299-304; European Patent Publication No. 0 360 257; U.S. Patent No. 

20 5,254,678. Methods of preparing are well known to those of skill in the art (see, e.g., WO 
94/26877; Ojwang, et al. (1993) Proc. Natl. Acad. Sci. USA 90:6340-6344; Yamada, et al. 
(1994) Human Gene Therapy 1:39-45; Leavitt, et al. (1995) Proc. Natl. A cad. Sci. USA 
92:699-703; Leavitt, et al. (19994) Human Gene Therapy 5:1 151-120; and Yamada, et al. 
(1994) Virology 205: 121-126). 

25 Polynucleotide modulators of lung cancer may be introduced into a cell containing the 

target nucleotide sequence by formation of a conjugate with a ligand binding molecule, as 
described in WO 91/04753. Suitable ligand binding molecules include, but are not limited to, 
cell surface receptors, growth factors, other cytokines, or other ligands that bind to cell 
surface receptors. Preferably, conjugation of the ligand binding molecule does not 

30 substantially interfere with the ability of the ligand binding molecule to bind to its 

corresponding molecule or receptor, or block entry of the sense or antisense oligonucleotide 
or its conjugated version into the cell. Alternatively, a polynucleotide modulator of lung 
cancer may be introduced into a cell containing the target nucleic acid sequence, e.g., by 
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formation of an polynucleotide-lipid complex, as described in WO 90/10448. It is 
understood that the use of antisense molecules or knock out and knock in models may also be 
used in screening assays as discussed above, in addition to methods of treatment. 

Thus, in one embodiment, methods of modulating lung cancer in cells or organisms 
5 are provided. In one embodiment, the methods comprise administering to a cell an anti-lung 
cancer antibody that reduces or eliminates the biological activity of an endogenous lung 
cancer protein. Alternatively, the methods comprise administering to a cell or organism a 
recombinant nucleic acid encoding a lung cancer protein. This may be accomplished in any 
number of ways. In a preferred embodiment, e.g., when the lung cancer sequence is down- 

10 regulated in lung cancer, such state may be reversed by increasing the amount of lung cancer 
gene product in the cell. This can be accomplished, e.g., by overexpressing the endogenous 
lung cancer gene or administering a gene encoding the lung cancer sequence, using known 
gene-therapy techniques. In a preferred embodiment, the gene therapy techniques include the 
incorporation of the exogenous gene using enhanced homologous recombination (EHR), e.g., 

15 as described in PCT/US93/03868, hereby incorporated by reference in its entirety. 

Alternatively, e.g., when the lung cancer sequence is up-regulated in lung cancer, the activity 
of the endogenous lung cancer gene is decreased, e.g., by the administration of a lung cancer 
antisense or RNAi nucleic acid. 

In one embodiment, the lung cancer proteins of the present invention may be used to 

20 generate polyclonal and monoclonal antibodies to lung cancer proteins. Similarly, the lung 
cancer proteins can be coupled, using standard technology, to affinity chromatography 
columns. These columns may then be used to purify lung cancer antibodies useful for 
production, diagnostic, or therapeutic purposes. In a preferred embodiment, the antibodies 
are generated to epitopes unique to a lung cancer protein; that is, the antibodies show little or 

25 no cross-reactivity to other proteins. The lung cancer antibodies may be coupled to standard 
affinity chromatography columns and used to purify lung cancer proteins. The antibodies 
may also be used as blocking polypeptides, as outlined above, since they will specifically 
bind to the lung cancer protein. 

30 Methods of identifying variant lung cancer-associated sequences 

Without being bound by theory, expression of various lung cancer sequences is 
correlated with lung cancer. Accordingly, disorders based on mutant or variant lung cancer 
genes may be determined. In one embodiment, the invention provides methods for 
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identifying cells containing variant lung cancer genes, e.g., determining all or part of the 
sequence of at least one endogenous lung cancer genes in a cell. In a preferred embodiment, 
the invention provides methods of identifying the lung cancer genotype of an individual, e.g., 
determining all or part of the sequence of at least one lung cancer gene of the individual. 
5 This is generally done in at least one tissue of the individual, and may include the evaluation 
of a number of tissues or different samples of the same tissue. The method may include 
comparing the sequence of the sequenced lung cancer gene to a known lung cancer gene, i.e., 
a wild-type gene. 

The sequence of all or part of the lung cancer gene can then be compared to the 
10 sequence of a known lung cancer gene to determine if any differences exist. This can be 
done using known homology programs, such as Bestfit, etc. In a preferred embodiment, the 
presence of a difference in the sequence between the lung cancer gene of the patient and the 
known lung cancer gene correlates with a disease state or a propensity for a disease state, as 
outlined herein. 

15 In a preferred embodiment, the lung cancer genes are used as probes to determine the 

number of copies of the lung cancer gene in the genome. 

In another preferred embodiment, the lung cancer genes are used as probes to 

determine the chromosomal localization of the lung cancer genes. Information such as 

chromosomal localization finds use in providing a diagnosis or prognosis in particular when 
20 chromosomal abnormalities such as translocations, and the like are identified in the lung 

cancer gene locus. 

Administration of pharmaceutical and vaccine compositions 

In one embodiment, a therapeutically effective dose of a lung cancer protein or 
25 modulator thereof, is administered to a patient. By "therapeutically effective dose" herein is 
meant a dose that produces effects for which it is administered. The exact dose will depend 
on the purpose of the treatment, and will be ascertainable by one skilled in the art using 
known techniques (e.g., Ansel, et al. (1992) Pharmaceutical Dosage Forms and Drug 
Delivery: Lieberman, Pharmaceutical Dosage Forms (vols. 1-3), Dekker, ISBN 0824770846, 
30 08247691 8X, 0824712692, 0824716981; Lloyd (1999) The Art. Science and Technology of 
Pharmaceutical Compounding : and Pickar (1999> Dosage Calculations) . Adjustments for 
lung cancer degradation, systemic versus localized delivery, and rate of new protease 
synthesis, as well as the age, body weight, general health, sex, diet, time of administration, 
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drug interaction and the severity of the condition may be necessary, and will be ascertainable 
with routine experimentation by those skilled in the art. 

A "patient" for the purposes of the present invention includes both humans and other 
animals, particularly mammals. Thus the methods are applicable to both human therapy and 
5 veterinary applications. In the preferred embodiment the patient is a mammal, preferably a 
primate, and in the most preferred embodiment the patient is human. 

The administration of the lung cancer proteins and modulators thereof of the present 
invention can be done in a variety of ways, including, but not limited to, orally, 
subcutaneously, intravenously, intranasally, transdermally, intraperitoneally, intramuscularly, 

10 intrapulmonary, vaginally, rectally, or intraocularly. In some instances, e.g., in the treatment 
of wounds and inflammation, the lung cancer proteins and modulators may be directly 
applied as a solution or spray. 

The pharmaceutical compositions of the present invention comprise a lung cancer 
protein in a form suitable for administration to a patient. In the preferred embodiment, the 

15 pharmaceutical compositions are in a water soluble form, such as being present as 

pharmaceutical acceptable salts, which is meant to include both acid and base addition 
salts. "Pharmaceutically acceptable acid addition salt" refers to those salts that retain the 
biological effectiveness of the free bases and that are not biologically or otherwise 
undesirable, formed with inorganic acids such as hydrochloric acid, hydrobromic acid, 

20 sulfuric acid, nitric acid, phosphoric acid and the like, and organic acids such as acetic acid, 
propionic acid, glycolic acid, pyruvic acid, oxalic acid, maleic acid, malonic acid, succinic 
acid, fumaric acid, tartaric acid, citric acid, benzoic acid, cinnamic acid, mandelic acid, 
methanesulfonic acid, ethanesulfonic acid, p-toluenesulfonic acid, salicylic acid and the like. 
"Pharmaceutically acceptable base addition salts" include those derived from inorganic bases 

25 such as sodium, potassium, lithium, ammonium, calcium, magnesium, iron, zinc, copper, 
manganese, aluminum salts and the like. Particularly preferred are the ammonium, 
potassium, sodium, calcium, and magnesium salts. Salts derived from pharmaceutically 
acceptable organic non-toxic bases include salts of primary, secondary, and tertiary amines, 
substituted amines including naturally occurring substituted amines, cyclic amines and basic 

30 ion exchange resins, such as isopropylamine, trimethylamine, diethylamine, triethylamine, 
tripropylamine, and ethanolamine. 

The pharmaceutical compositions may also include one or more of the following: 
carrier proteins such as serum albumin; buffers; fillers such as microcrystalline cellulose, 
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lactose, corn and other starches; binding agents; sweeteners and other flavoring agents; 

coloring agents; and polyethylene glycol. 

The pharmaceutical compositions can be administered in a variety of unit dosage 

forms depending upon the method of administration. For example, unit dosage forms 

5 suitable for oral administration include, but are not limited to, powder, tablets, pills, capsules 

and lozenges. It is recognized that lung cancer protein modulators (e.g., antibodies, antisense 

constructs, ribozymes, small organic molecules, etc.) when administered orally, should be 

protected from digestion. This is typically accomplished either by complexing the 

molecule(s) with a composition to render it resistant to acidic and enzymatic hydrolysis, or by 

10 packaging the molecule(s) in an appropriately resistant carrier, such as a liposome or a 
protection barrier. Means of protecting agents from digestion are well known in the art. 

The compositions for administration will commonly comprise a lung cancer protein 
modulator dissolved in a pharmaceutically acceptable carrier, preferably an aqueous carrier. 
A variety of aqueous carriers can be used, e.g., buffered saline and the like. These solutions 

15 are sterile and generally free of undesirable matter. These compositions may be sterilized by 
conventional, well known sterilization techniques. The compositions may contain 
pharmaceutically acceptable auxiliary substances as required to approximate physiological 
conditions such as pH adjusting and buffering agents, toxicity adjusting agents and the like, 
e.g., sodium acetate, sodium chloride, potassium chloride, calcium chloride, sodium lactate 

20 and the like. The concentration of active agent in these formulations can vary widely, and 
will be selected primarily based on fluid volumes, viscosities, body weight and the like in 
accordance with the particular mode of administration selected and the patient's needs (e.g., 
Remington's Pharmaceutical Science (15th ed., 1980) and Hardman, et al. (eds. 1996) 
Goodman and Gilman: The Pharmacologial Basis of Therapeutics^ 

25 Thus, a typical pharmaceutical composition for intravenous administration would be 

about 0.1 to 10 mg per patient per day. Dosages from 0.1 up to about 100 mg per patient per 
day may be used, particularly when the drug is administered to a secluded site and not into 
the blood stream, such as into a body cavity or into a lumen of an organ. Substantially higher 
dosages are possible in topical administration. Actual methods for preparing parenterally 

30 administrable compositions will be known or apparent to those skilled in the art, e.g., 

Remington's Pharmaceutical Science and Goodman and Gilman, The Pharmacologial Basis 
of Therapeutics, supra. 
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The compositions containing modulators of lung cancer proteins can be administered 
for therapeutic or prophylactic treatments. In therapeutic applications, compositions are 
administered to a patient suffering from a disease (e.g., a cancer) in an amount sufficient to 
cure or at least partially arrest the disease and its complications. An amount adequate to 
5 accomplish this is defined as a 'therapeutically effective dose." Amounts effective for this 
use will depend upon the severity of the disease and the general state of the patient's health. 
Single or multiple administrations of the compositions may be administered depending on the 
dosage and frequency as required and tolerated by the patient. In any event, the composition 
should provide a sufficient quantity of the agents of this invention to effectively treat the 

10 patient. An amount of modulator that is capable of preventing or slowing the development of 
cancer in a mammal is referred to as a "prophylactically effective dose." The particular dose 
required for a prophylactic treatment will depend upon the medical condition and history of 
the mammal, the particular cancer being prevented, as well as other factors such as age, 
weight, gender, administration route, efficiency, etc. Such prophylactic treatments may be 

15 used, e.g., in a mammal who has previously had cancer to prevent a recurrence of the cancer, 
or in a mammal who is suspected of having a significant likelihood of developing cancer 
based, at least in part, upon gene expression profiles. Vaccine strategies may be used, in 
either a DNA vaccine form, or protein vaccine. 

It will be appreciated that the present lung cancer protein-modulating compounds can 

20 be administered alone or in combination with additional lung cancer modulating compounds 
or with other therapeutic agent, e.g., other anti-cancer agents or treatments. 

In numerous embodiments, one or more nucleic acids, e.g., polynucleotides 
comprising nucleic acid sequences set forth in the tables, such as antisense or RNAi 
polynucleotides or ribozymes, will be introduced into cells, in vitro or in vivo. The present 

25 invention provides methods, reagents, vectors, and cells useful for expression of lung cancer- 
associated polypeptides and nucleic acids using in vitro (cell-free), ex vivo 9 or in vivo (cell or 
organism-based) recombinant expression systems. 

The particular procedure used to introduce the nucleic acids into a host cell for 
expression of a protein or nucleic acid is application specific. Many procedures for 

30 introducing foreign nucleotide sequences into host cells may be used. These include the use 
of calcium phosphate transfection, spheroplasts, electroporation, liposomes, microinjection, 
plasma vectors, viral vectors and other well known methods for introducing cloned genomic 
DNA, cDNA, synthetic DNA or other foreign genetic material into a host cell (see, e.g., 
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Berger and Kimmel, Guide to Molecular Cloning Techniques. Methods in Enzvmology 

volume 152 (Berger), Ausubel, et al. (eds. 1999) Current Protocols (supplemented through 

1999), and Sambrook, et al. (1989) Molecular Cloning - A L aboratory Manual (2nd ed., Vol. 

1-3). 

5 In a preferred embodiment, lung cancer proteins and modulators are administered as 

therapeutic agents, and can be formulated as outlined above. Similarly, lung cancer genes 
(including both the full-length sequence, partial sequences, or regulatory sequences of the 
lung cancer coding regions) can be administered in a gene therapy application. These lung 
cancer genes can include antisense or inhibitory applications, e.g., as inhibitory RNA or gene 

10 therapy (e.g., for incorporation into the genome) or as antisense compositions. 

Lung cancer polypeptides and polynucleotides can also be administered as vaccine 
compositions to stimulate HTL, CTL, and antibody responses.. Such vaccine compositions 
can include, e.g., lipidated peptides (see, e.g.,Vitiello, et al. (1995) J. Clin. Invest. 95:341), 
peptide compositions encapsulated in poly(DL-lactide-co-glycolide) ("PLG") microspheres 

15 (see, e.g., Eldridge, et al. (1991) Molec. Immunol. 28:287-294; Alonso, et al. (1994) Vaccine 
12:299-306; Jones, et al. (1995) Vaccine 13:675-681), peptide compositions contained in 
immune stimulating complexes (ISCOMS) (see, e.g., Takahashi, et al. (1990) Nature 
344:873-875; Hu, et al. (1998) Clin Exp Immunol. 113:235-243), multiple antigen peptide 
systems (MAPs) (see, e.g., Tam (1988) Proc. Natl. Acad. Sci. U.S.A. 85:5409-5413; Tarn 

20 (1 996) J. Immunol. Methods 196:1 7-32), peptides formulated as multivalent peptides; 

peptides for use in ballistic delivery systems, typically crystallized peptides, viral delivery 
vectors (Perkus, et al., p. 379 In: Kaufinann (ed. 1996) Concepts in vaccine development : 
Chakrabarti, et al. (1986) Nature 320:535; Hu, et al. (1986) Nature 320:537; Kieny, et al. 
(1986) AIDS Bio/Technology 4:790; Top, et al. (1971) J. Infect. Pis. 124:148; Chanda, et al. 

25 (1990) Virology 175:535), particles of viral or synthetic origin (see, e.g., Kofler, et al. (1996) 
J. Immunol. Methods 192:25; Eldridge, et al. (1993) Sem. Hematol. 30:16; Falo, et al. (1995) 
Nature Med. 7:649), adjuvants (Warren, et al. (1986) Annu. Rev. Immunol. 4:369; Gupta, et 
al. (1993) Vaccine 1 1:293), liposomes (Reddy, et al. (1992) J. Immunol. 148:1585; Rock 
(1996) Immunol. Today 17:131), or, naked or particle absorbed cDNA (Ulmer, et al. (1993) 

30 Science 259:1745; Robinson, et al. (1993) Vaccine 1 1:957; Shiver, et al., p. 423 In: 

Kaufinann (ed. 1996) Concepts in vaccine development: Cease and Berzofsky (1994) Annu. 
Rev. Immunol. 12:923 and Eldridge, et al. (1993) Sem. Hematol. 30:16). Toxin-targeted 
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delivery technologies, also known as receptor mediated targeting, such as those of Avant 
Immunotherapeutics, Inc. (Needham, Massachusetts) may also be used. 

Vaccine compositions often include adjuvants. Many adjuvants contain a substance 
designed to protect the antigen from rapid catabolism, such as aluminum hydroxide or 
5 mineral oil, and a stimulator of immune responses, such as lipid A, Bortadella pertussis or 
Mycobacterium tuberculosis derived proteins. Certain adjuvants are commercially available 
as, e.g., Freund's Incomplete Adjuvant and Complete Adjuvant (Difco Laboratories, Detroit, 
MI); Merck Adjuvant 65 (Merck and Company, Inc., Rahway, NJ); AS-2 (SmithKline 
Beecham, Philadelphia, PA); aluminum salts such as aluminum hydroxide gel (alum) or 
10 aluminum phosphate; salts of calcium, iron or zinc; an insoluble suspension of acylated 
tyrosine; acylated sugars; cationically or anionically derivatized polysaccharides; 
polyphosphazenes; biodegradable microspheres; monophosphoryl lipid A and quil A. 
Cytokines, such as GM-CSF, interleukin-2, -7, -12, and other like growth factors, may also be 
used as adjuvants. 

1 5 Vaccines can be administered as nucleic acid compositions wherein DNA or RNA 

encoding one or more of the polypeptides, or a fragment thereof, is administered to a patient. 
This approach is described, for instance, in Wolff, et. al. (1990) Science 247:1465 as well as 
U.S. Patent Nos. 5,580,859; 5,589,466; 5,804,566; 5,739,118; 5,736,524; 5,679,647; WO 
98/04720; and in more detail below. Examples of DNA-based delivery technologies include 

20 "naked DNA", facilitated (bupivicaine, polymers, peptide-mediated) delivery, cationic lipid 
complexes, and particle-mediated ("gene gun") or pressure-mediated delivery (see, e.g., U.S. 
Patent No. 5,922,687). 

For therapeutic or prophylactic immunization purposes, the peptides of the invention 
can be expressed by viral or bacterial vectors. Examples of expression vectors include 

25 attenuated viral hosts, such as vaccinia or fowlpox. This approach involves the use of 
vaccinia virus, e.g., as a vector to express nucleotide sequences that encode lung cancer 
polypeptides or polypeptide fragments. Upon introduction into a host, the recombinant 
vaccinia virus expresses the immunogenic peptide, and thereby elicits an immune response. 
Vaccinia vectors and methods useful in immunization protocols are described in, e.g., U.S. 

30 Patent No. 4,722,848. Another vector is BCG (Bacille Calmette Guerin). BCG vectors are 
described in Stover, et al. (1991) Nature 351:456-460. A wide variety of other vectors useful 
for therapeutic administration or immunization e.g., adeno and adeno-associated virus 
vectors, retroviral vectors, Salmonella typhi vectors, detoxified anthrax toxin vectors, and the 
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like, will be apparent to those skilled in the art from the description herein (see, e.g., Shata, et 

al. (2000) MolMed Today 6:66-71; Shedlock, et al. (2000) J. Leukoc. Biol. 68:793-806; 

Hipp, et al. (2000) In Vivo 14:571-85). 

Methods for the use of genes as DNA vaccines are well known, and include placing a 

5 lung cancer gene or portion of a lung cancer gene under the control of a regulatable promoter 

or a tissue-specific promoter for expression in a lung cancer patient. The lung cancer gene 

used for DNA vaccines can encode full-length lung cancer proteins, but more preferably 

encodes portions of the lung cancer proteins including peptides derived from the lung cancer 

protein. In one embodiment, a patient is immunized with a DNA vaccine comprising a 

10 plurality of nucleotide sequences derived from a lung cancer gene. For example, lung cancer- 
associated genes or sequence encoding subfragments of a lung cancer protein are introduced 
into expression vectors and tested for their immunogenicity in the context of Class I MHC 
and an ability to generate cytotoxic T cell responses. This procedure provides for production 
of cytotoxic T cell responses against cells which present antigen, including intracellular 

15 epitopes. 

In a preferred embodiment, DNA vaccines include a gene encoding an adjuvant 
molecule with the DNA vaccine. Such adjuvant molecules include cytokines that increase 
the immunogenic response to the lung cancer polypeptide encoded by the DNA vaccine. 
Additional or alternative adjuvants are available. 

20 In another preferred embodiment lung cancer genes find use in generating animal 

models of lung cancer. When the lung cancer gene identified is repressed or diminished in 
metastatic tissue, gene therapy technology, e.g., wherein antisense or inhibitory RNA directed 
to the lung cancer gene will also diminish or repress expression of the gene. Animal models 
of lung cancer find use in screening for modulators of a lung cancer-associated sequence or 

25 modulators of lung cancer. Similarly, transgenic animal technology including gene knockout 
technology, e.g., as a result of homologous recombination with an appropriate gene targeting 
vector, will result in the absence or increased expression of the lung cancer protein. When 
desired, tissue-specific expression or knockout of the lung cancer protein may be necessary. 
It is also possible that the lung cancer protein is overexpressed in lung cancer. As 

30 such, transgenic animals can be generated that overexpress the lung cancer protein. 

Depending on the desired expression level, promoters of various strengths can be employed 
to express the transgene. Also, the number of copies of the integrated transgene can be 
determined and compared for a determination of the expression level of the transgene. 
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Animals generated by such methods will find use as animal models of lung cancer and are 

additionally useful in screening for modulators to treat lung cancer. 



Kits for Use in Diagnostic and/or Prognostic Applications 

For use in diagnostic, research, and therapeutic applications suggested above, kits are 
also provided by the invention. In diagnostic and research applications such kits may include 
at least one of the following: assay reagents, buffers, lung cancer-specific nucleic acids or 
antibodies, hybridization probes and/or primers, antisense polynucleotides, ribozymes, RNAi, 
dominant negative lung cancer polypeptides or polynucleotides, small molecule inhibitors of 
lung cancer-associated sequences, etc. A therapeutic product may include sterile saline or 
another pharmaceutically acceptable emulsion and suspension base. 

In addition, the kits may include instructional materials containing instructions (e.g., 
protocols) for the practice of the methods of this invention. While the instructional materials 
typically comprise written or printed materials they are not limited to such. A medium 
capable of storing such instructions and communicating them to an end user is contemplated 
by this invention. Such media include, but are not limited to electronic storage media (e.g., 
magnetic discs, tapes, cartridges, chips), optical media (e.g., CD ROM), and the like. Such 
media may include addresses to internet sites that provide such instructional materials. 

The present invention also provides for kits for screening for modulators of lung 
cancer-associated sequences. Such kits can be prepared from readily available materials and 
reagents. For example, such kits can comprise one or more of the following materials: a lung 
cancer-associated polypeptide or polynucleotide, reaction tubes, and instructions for testing 
lung cancer-associated activity. Optionally, the kit contains biologically active lung cancer 
protein. A wide variety of kits and components can be prepared according to the present 
invention, depending upon the intended user of the kit and the particular needs of the user. 
Diagnosis would typically involve evaluation of a plurality of genes or products. The genes 
typically will be selected based on correlations with important parameters in disease which 
may be identified in historical or outcome data. 
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Example 1: Gene Chip Analysis 

Molecular profiles of various normal and cancerous tissues were determined and 
5 analyzed using gene chips. RNA was isolated and gene chip analysis was performed as 

described (Glynne, et al. (2000) Nature 403:672-676; Zhao, et al. (2000) Genes Dev. 14:981- 
993). 
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Tables 1A and 1B were previously filed on April 18, 2001 in USSN €0/284,770 (1 8501-001 500US) and on November 29, 2001 in USSN 60/334,370 
(18501-001520US) 
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Ue 141880 


ESTs 


0.75 


0.27 


104857 


AA04091Q 


na. 1 SUOO 


ESTs 


2.6 


3.3 


104665 


A A(\AK4 0ft 


He 9957*; 


ESTs 


1.23 


0.49 


104989 


aai090Q8 


Ue 11RR15 

ns. i loo I*? 


ESTs 


0.63 


0132 


105729 


AA9Q96Q4 
rV\c94C09*r 


Ue OR 07 


ESTs; Weakly similar to PHOSPHOLEMMAN PR 


0.86 


0.34 


105847 


AA0QRRO8 


He 09941 


ESTs 


1.32 


0.4 


105894 


AMHUUy/9 


Uq OKfiOl 


calcitonin receptor-like receptor activi 


0.78 


0.28 


106490 


A A 4 DC < 

/vvtOlOOl 


Ue 1 ic;t;o7 
ns.i looj/ 


ESTs; Weakly similar to dipeptidase prec 


1.2 


0.47 


106536 




Ue 900fM 


ESTs 


0.82 


0.15 


106605 


AA/tT771R 


ns.ziiuo 


Homo sapiens mRNA; cDNA DKFZp564B076 (fr 


0.99 


0.07 


106667 


AAARIflQfi 
WVwJlUOO 


ns.iOOf o 


ESTs 


1.17 


0.4 


106773 




Uq 1 Q 88.00 

ns. loooco 


ESTs 


1.46 


0.43 


106797 


MM**/ 09DZ 


Me 1KQQ40 


ESTs 


1.18 


0.32 


106844 


AA/RKncc 
AMHOOUOO 


Ue 1RR910 

ns. i oot i J 


sperm associated antigen 6 


0.98 


0.51 


106870 


/VWJ/O/O 


Ue 98R00 
nS.^OOJU 


serum deprivation response (phosphatidyl 


1.05 


0.14 


106954 


AA/ORQRfi 


Ue 90AmR 


ESTs 


1.25 


0.33 


107054 


nftOUUIDv 


Ue 1A0fiS 

ns. i**ooo 


ESTs 


1.11 


0.4 


107292 




Ue A7RQ 


ESTs; Weakly similar to oxi dative-stress 


1.07 


2.58 


107S94 


An WOO J I 


Ue iRRn^n 
ns. iooUJU 


ESTs 


0.7 


0,21 


107997 


AA0070R8 
rvAUJ/ JOO 


Uq 89990 


Human DNA sequence from done 141 H5 on c 


1.02 


0.48 


108041 


AMU** 1 OOl 


Ue R1QT7 


ESTs 


1.44 


0.51 


108087 


A A 04 err AQ 


Uq Af\CAZ 

ns.*fvo*to 


ESTs 


1.98 


1 


108382 


f\t\\J! 4000 


We 87798 
nS.Of fcQ 


macrophage receptor with collagenous str 


1.52 


0.72 


108435 


A Af»7fl7R7 


ns. l 9h i u i 


ESTs 


2.53 


1.53 


108480 


AADRinQO 
MMUO 1 U90 


Uq 88058 

ns.oouoo 


ESTs 


1.56 


0.48 


109252 


aa^o/ao/i 


U 0 aCQAA 


ESTs 


2.69 


3.18 


109550 


rUloo4 


Ue 98081 


ESTs 


1.19 


0.65 


109613 


POO 001 


Uq 97R 1Q 


ESTs 


1.01 


0.29 


109837 


nUUOOO 


L| e 0Q7Q9 

ns.za/9i 


ESTs 


0.81 


0.15 


109893 


nu**f oo 


Uq 00484 


ESTs 


1.44 


0.32 


109984 




Ue mOQQ 

ns. luzsy 


ESTs 


0.62 


0.14 


110099 


U1 CSRA 
Ml 0000 


Ue 90748 

ns.<:o/**o 


ESTs 


1.01 


0.28 


110837 


N30796 


Ha. 17424 


ESTs; Weakly similar to semaphorin F [H. 


1.1 


0.22 


111247 




He 18769 


Homo sapiens mRNA; cDNA DKFZp564B2062 (f 


1.26 


0.26 


111341 


N80935 


Hs.22483 


ESTs 


1.57 


0.52 


111510 


R07856 


Hs.16355 


ESTs 


3.96 


1 


111737 


R25410 


Hs.9218 


ESTs 


0.97 


0.24 


113195 


T67112 




""yc20g1 1. 8 1 Stratagene lung (#937210) 


1.22 


0.35 


113238 


T62979 


Hs.189813 


ESTs 


Z27 


0.45 


113540 


T90496 


Hs.16757 


ESTs 


1.06 


0.22 


113552 


T90889 


Hs.16026 


ESTs 


1.16 


0.42 


113606 


T93093 


Hs.17125 


ESTs 


1.48 


0.7 


113695 


T96965 


Hs.17948 


ESTs 


1.54 


0.28 


113946 


W84753 


Hs.37896 


ESTs 


1.79 


0.72 


114251 


Z39898 


Hs.21948 


ESTs 


1.95 


0.25 


114359 


Z41589 


Hs.153483 


ESTs; Moderately similar to HI chloride 


1.42 


0.13 


115230 


AA278300 


Hs.182980 


ESTs 


2.62 


0.42 


115279 


AA279760 


Hs.63671 


ESTs 


1.79 


0.91 


115566 


AA398083 


Hs.43977 


ESTs 


0.86 


0.2 


115965 


AA446661 


Hs.173233 


ESTs 


0.79 


0.04 


116166 


AA461556 


H3.202949 


KIAA1 102 protein 


2.29 


0.68 


116279 


AA486073 


Hs.57362 


ESTs 


Z27 


0.78 * 


117023 


H88157 


Hs.41105 


ESTs 


1.36 


0.16 
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117209 


H99959 


Hs.42768 


ESTs 


1.46 


0.48 


118901 


N90719 


Hs.94445 


ESTs 


1.61 


1 


118981 


' N93839 


Hs.39288 


ESTs 


1.34 


0.48 


119073 


R32894 


Hs.45514 


v-ets avian erythroblastosis virus E26 o 


1.14 


0.27 


119221 


R98105 




""Vr30g1 t.s1 Soares fetal fiver spleen 


1.32 


0.53 


119824 


W74536 


Hs.184 


advanced glycosylate end product-speci 


1 


0.19 


119861 


W80715 




ESTs; Moderately similar to ALU SUB 


1.83 


0.45 


120041 


W92775 


Hs.59368 


ESTs 


1.23 


0.55 


120132 


Z38839 


Hs.125019 


ESTs; Highly similar to K1AA0886 protein 


0.91 


0.37 


120467 


AA251579 


Hs.187628 


ESTs 


1.87 


1.91 


121314 


M402799 


Hs.182538 


ESTs 


1.3 


0.31 


121643 


AA417078 


Hs,193767 


ESTs 


231 


0.68 


121690 


AA418074 


Hs.1 10286 


ESTs 


1.47 


0,51 


122633 


AA454080 


Ks.34853 


inhibitor of ONA binding 4; dominant rveg 


1.31 


0.63 


123978 


C20653 


Hs.170278 


ESTs 


1.52 


0.32 


124214 


H58608 


Hs.151323 


ESTs 


0.93 


0.35 


124357 


N22401 




""yw37g07.s1 Morton Fetal Cochlea Homo 


1.29 


1 


124438 


N40188 


Hs.102550 


ESTs 


1.36 


0.7 


125167 


W45560 


Hs.102541 


ESTs 


1.46 


0.69 


125174 


W51835 


Hs.231082 


EST 


3.07 


3.76 


125422 


AA903229 


Hs.153717 


ESTs 


1.34 


0.3 


125561 


AI417667 


Hs.22978 


ESTs 


1.89 


0.63 


125831 


D60988 




"•HUM145B09B dontech human fetal brain 


0.94 


0.36 


127002 


R35380 


Hs.24979 


ESTs 


3.02 


4.06 


127307 


AA369387 


Hs.126712 


ESTs; Weakly similar to plL2 hypofoetlca 


1.01 


0.69 


127609 


AA622559 


Hs.150318 


ESTs 


1.21 


0.32 


127959 


AI302471 


Hs,124292 


ESTs 


25 


1 


12B458 


D52193 


Hs.56340 


ESTs 


1.13 


0.33 


128624 


AA479209 


Hs-102647 


ESTs 


1.45 


0.58 


128789 


AA486567 


Hs.105695 


ESTs 


1.1 


0.34 


128798 


AF014958 


Hs.105938 


chemoWne (C-C motif) receptor-like 2 


1.16 


0.55 


128952 


R51076 


Hs.107361 


ESTs; Highly similar to Rap2 Interacting 


204 


24 


129057 


X62466 


Hs.214742 


CDW52 antigen (CAM PATH- 1 antigen) 


1.77 


0.73 


129210 


M401654 


Hs.202949 


KIAA1102 protein 


1.11 


0.36 


129240 


W24360 


Hs.237868 


Interleukin 7 receptor 


0.91 


0.41 


129402 


T63781 




""yc21g01.s1 Stratagene lung (#937210) 


1.36 


0.43 


129565 


X77777 


Hs.198726 


vasoactive Intestinal peptide receptor 1 


0.67 


0.08 


129593 


AA487015 


Hs.98314 


Homo sapiens mRNA; cDNA DKFZp586L012Q (f 


1.3 


0.42 


129626 


AA447410 


Hs.11712 


ESTs; Weakly similar to Hi! ALU SUBFAMI 


1.28 


0.46 


129699 


M458578 


Hs.12017 


KIAA0439 protein; homolog of yeast ubiqu 


1.58 


1 


129898 


N48595 


Hs.13256 


ESTs 


1.13 


0.53 


129958 


L20591 


Hs.1378 


annexinA3 


0.81 


0.31 


130273 


U59914 


Hs.1 53863 


MAD (mothers against decapentaplegic; Dr 


0.59 


0.22 


130655 


N92934 


Hs.17409 


cysteine-rich protein 1 (Intestinal) 


1.44 


0.76 


130857 


T94452 


Hs.201591 


ESTs 


0.96 


0.42 


131061 


N64328 


Hs.22567 


ESTs; Moderately slmflar to HYPOTHETICAL 


1.51 


0.45 


131066 


F09006 


Hs.22588 


ESTs 


0.97 


0.37 


131263 


R38334 


H3.249S0 


regulator of G-proteln signalling 5 


234 


282 


131589 


U52100 


Hs.29191 


epithelial membrane protein 2 


1.2 


0.62 


131686 


AA1 57428 


Hs.30687 


Grb2-assoclated binder 2 


0.95 


0.38 


131751 


H18335 


Hs.31562 


ESTs 


1.47 


0.52 


132430 


T23630 


Hs.258575 


EST 


1.86 


209 


132476 


N67192 


Hs.49476 


Homo sapiens clone TUA8 Crl-du-chat regl 


1.73 


0.58 


132836 


F09557 


Hs.57929 


slit (Drosophlla) homolog 3 


0.91 


0.29 


133120 


X64559 


Hs.65424 


tetranectin (plasminogen-blnding protein 


0.82 


0.2 


133486 


045370 


Hs.74120 


adipose specific 2 


1.29 


0.48 


133565 


H57058 


Hs.204831 


ESTs 


225 


0.57 


133651 


U97105 


Hs.173381 


dJbydropyrlmJdJnase-lfl© 2 


1.65 


0.62 


133835 


M059489 


Hs.76640 


ESTs; Highly similar to RGC-32 [R.norveg 


1.16 


0.34 


13397B 


W73859 


Hs.78061 


transcription factor 21 


0.79 


0.27 


133985 


L34657 


Hs.78146 


platelet/endotheJJa! cell adhesion molec 


0.99 


0.28 


134299 


AA487558 


Hs.8135 


ESTs 


1.02 


0.46 


134300 


U81984 


Hs.166082 


endothelial PAS domain protein t 


0.86 


0.42 


134323 


AA028976 


Hs.8175 


Homo sapiens mRNA; cDNA DKFZp564M0763 (f 


1.19 


0.27 


134343 


DS0683 


Hs.82028 


transforming growth factor, beta recepto 


1.21 


0.67 


134417 


D87969 


Hs.82921 


solute carrier family 35 (CMP-slalic ad 


1.28 


1 


134561 


U76421 


Hs.65302 


adenosine deaminase; RNA-speciffc; B1 (h 


212 


0.55 


134624 


W67147 


Hs.8700 


deleted in liver cancer 1 


235 


274 


134696 


H88354 


Hs.8861 


ESTs 


1.35 


0.33 


134749 


L10955 


Hs.89485 


carbonic anhydrase IV 


0.89 


0,2 


134786 


L06139 


Hs.69640 


TEK tyrosine kinase; endothelial (venous 


0.48 


0.21 


134869 


T35288 


Hs.90421 


ESTs; Moderately similar to 1111 ALU SUB 


214 


264 


135346 


M21056 


Hs.992 


phosphoitpase A2; group IB (pancreas) 


0.63 


0.13 


100113 


D00591 


Hs.84746 


Chromosome condensation 1 


1 


215 


100147 


D13666 


Hs.136348 


Homo sapiens mRNA for osteoblast specif] 


0.5 


2 


100280 


D42085 


Hs.155314 


IOAA0095 gene product 


1.02 


1.39 


100335 


D63391 


Hs.6793 


platelet-activating factor acetylhydrola 




5.58 


100360 


D78335 


Hs.75939 


Uridine monophosphate kinase 


0.91 


204 


100372 


D79997 


Hs.184339 


WAA0175 gene product 


0.75 


203 


100486 


HG1112-HT1112 


TIGR: ras-Dke protein TC4 


1.09 


1.93 


100559 


HG2197-HT2267 


•collagen, type VII, alpha 1' 


0.97 


3.6 


100576 


HG2290-HT2385 


'calcitonln/alpha-CGRP, alL transcript 


1 


1 


100668 


HG2981-HT3938 


"TIGR: CD44 (eplcan, alL transcript 12 


0.85 


1.9 


100906 


HG4716-HT5158 


Guanosine ff-Monophosphate Synthase 


1.18 


229 


100930 


HG721-HT4827 




"TIGR: placental protein 14, endometrial 


1 


1.45 
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100360 


J00124 


Hs.1 17729 


keratin 14 (epidermolysis bullosa simple 


0.84 


101031 


J05070 


Hs.151738 


"Matrix metalloprotelnase 9 (gelatinase 


0.77 


101111 


L08424 


Hs.1619 


Achaete-scute complex (Drosophila) homd 


1 


101124 


L10343 


Hs.1 12341 


'Protease Inhibitor 3, sMiKferfved (SKA 


0.62 


101175 


L18920 


Hs.36980 


■Melanoma antigen, family A, T 


1 


101204 


124203 


Hs.82237 


Ataxia-telangiectasia group D-associated 


0.74 


101431 


M1988B 


Hs.1076 


Small prollne-rfch protein 1B (comlfln) 


0.85 


101448 


M21389 


Hs.195850 


keratin 5 (epidermolysis bullosa simplex 


. 0.61 


101511 


M27826 


Hs.267319 


Endogenous retroviral protease 


1.03 


101526 


M29540 


Hs.220529 


Carcinoembryontc antigen-related cell ad 


1.07 


101548 


M3132B 


Hs.71642 


"Guanine nucleotide binding protein (G p 


0.97 


101625 


M57293 




"Human parathyroid hormone-related peptl 


1 


101649 


M60047 


Hs.1690 


Heparin-blnding growth factor binding pr 


1 


101724 


M69225 


Hs.620 


bullous pemphigoid antigen 1 (230/240kO) 


1 


101748 


M76482 


Hs.1925 


Desmoglein 3 (pemphigus vulgaris antigen 
"Solute carrier family 7 (cationlc amino 


1 


101759 


M80244 


Hs.184601 


1.07 


101804 


M86699 


Hs.169840 


TTK protein kinase 


1 


101806 


MS 6757 


Hs.1 12408 


S100 calcium-binding protein A7 (psorias 


0.74 


101809 


M86849 




"Homo sapiens connexin 26 (GJB2) mRNA, c 


1 


101845 


M93426 


Hs.78867 


'Protein tyrosine phosphatase, receptor- 


1 


101851 


M94250 


Hs.82045 


MIdkine (neurite growth-promoting factor 


1.13 


102083 


U10323 


Hs.75117 


"Interieukin enhancer binding factor 2, 


1.03 


102154 


U17760 


Hs.75517 


"Lamtoin, beta 3 [rilcsm (125kO). kaiini 


0.94 


102193 


U20758 


Hs.313 


secreted phosphoprotein 1 (csteoponttn; 


0.34 


102305 


U33286 


Hs.90073 


chromosome segregation 1 (yeast homolog) 


1.45 


102348 


U37519 


Hs.87539 


Aldehyde dehydrogenase B 


0.52 


102581 


U51145 


Hs.77256 


Enhancer of zeste (Drosophila) homolog 2 


0.91 


102610 


U65011 


Hs.30743 


Preferentially expressed ant/gen in meia 


1 


102623 


U66083 


Hs.37110 


•Melanoma antigen, family A, 9 (MAGE-9)" 


1 


102669 


U71207 


Hs.29279 


Eyes absent (Drosophila) homolog 2 


1 


102696 


U74612 


Hs.239 


Forkhead box M1 


1.06 


102829 


U91618 


Hs.80962 


Neurotensin 


1 


102888 


X04741 


HS.7611B 


Ubiquitin carboxyMerminal esterase L1 


1.13 


102913 


X07696 


Hs.80342 


keratin 15 


0.7 


102915 


X07820 


Hs.2258 


Matrix Metal loprotein ass 10 (Stromolystn 


1.15 


102963 


X15943 


Hs.37058 


"Calcitonin/calcitonin-related polypepti 


1 


103021 


X53587 


Hs.85266 


"Integrln, beta 4" 


1.38 


103036 


X54925 


Hs.83169 


Matrix metalloprotease 1 (interstitial c 


1 


103058 


X57348 


Hs.184510 


StraSfin 


1.25 


103060 


X57766 


Hs.1 55324 


matrix metalloprotelnase 11 (stromelysin 
"Cadherln 3, P-cadherin (placental)* 


1 


103119 


X63629 


Hs.2877 


1.16 


103206 


X72755 


Hs.77367 


monokine \nducsd by gamma interferon 


0.71 


103242 


X76342 


Hs.389 


"Alcohol dehydrogenase 7 (class IV). mu 


1 


103312 


X82693 


Hs.3185 


■Lymphocyte antigen 6 complex, locus D; 


0.92 


103478 


Y07755 


Hs.38991 


S100 calcium-binding protein A2 


1.05 


103558 


Z19574 


Hs.2785 


keratin 17 


0.65 


103576 


Z26317 


Hs.2631 


Desmoglein 2 


0.79 


103587 


Z29083 , 


Hs.82128 


5T4 Oncofetal antigen 


1 


103594 


Z31560 


Hs.816 


"SRY (sex determining region Y)-box 2, p 


0.71 


103768 


AA089997 




"ESTs, Highly similar to integral membra 


0.99 


104158 


AA454908 


Hs.8127 


KIAA0144 gene product 


0.96 


104558 


R56678 


Hs.88959 


Human DNA sequence from clone 967N21 on 


1.23 


104689 


AA010665 




ESTs 


0.96 


104733 


AA019498 


Hs.23071 


ESTs 


1.18 


104906 


AA055809 


Hs.26802 


Protein kinase domains containing protel 


1.11 


104978 


AA088458 


Hs.19322 


ESTs; Weakly simitar to III! ALU SUBFAMI 


1.64 


105012 


AA116036 


Hs.9329 


"Homo sapiens mRNA for fls353, complete 


1.19 


105175 


AA186604 


Hs.25740 


ESTs; Weakly similar to unknown [S.cerev 


0.9 


105263 


AA227926 


Hs.6662 


ESTs 


0.95 


105298 


AA233459 


Hs.26369 


ESTs 


1 


105312 


AA233854 


Hs.23348 


S-phase kinase-associated protein 2 (p45 


1.32 


105719 


AA291644 


Hs.36793 


Hypothetical protein FU23188 


1.28 


105743 


AA293300 


Hs.9598 


ESTs 


1 


106012 . 


AM1 1621 


Hs.8895 


ESTs; same as 8FH67 


0.94 


10S231 


AA429571 


Hs.38002 


WAA1355 protein 


1.04 


106540 


AA454607 


Hs.38114 


Hypothetical protein FU11100 


1.26 


106575 


AA456039 


Hs.1 05421 


ESTs 


1 


106632 


AA459897 


Hs.1 1950 


GPl-anehored metastasis-associated prote 


0.87 


106727 


AA465342 


Hs.34045 


HypotheUca) protein FU20764 


0.87 


106906 


AA490237 


Hs.222024 


Transcription factor BMAL2 (cycle-tike f 


0.61 


107059 


AA608545 


Hs.23044 


RAD51 (S. cerevistae) homolog (E cdl Re 


0.48 


107104 


AA609786 


Hs.15243 


Nucleolar protein 1 (120kD) 


1.01 


107151 


AA621169 


Hs.8687 


ESTs; procollagen l-N proteinase 


0.97 


107284 


S74G39 


Hs.291904 


Accessory proteins BAP31/BAP29 


1.15 


107901 


AA026418 


Hs.91539 


ESTs 


0.72 


107922 


AA028028 


Hs.61460 


Ig superfamily receptor LNIR precursor 


1 


107932 


AA029317 


Hs.18878 


Hypothetical protein FU21 620 


1 


108695 


M121315 


Hs.70823 


WAA1077 protein 


0.91 


108857 


AA133250 


Hs.62180 


ESTs 


1 


108860 


AA133334 


Hs.129911 


ESTs 


0.73 


108990 


AA152296 


Hs.72045 


ESTs 


1 


109166 


M179845 


Hs.73625 


"RAB6 Interacting, kinesirvlike (rabktne 


1 


109424 


AA227919 


Hs.85982 


Hyaluronan synthase 3 


1 


109665 


F05012 


Hs.27027 


Hypothetical protein DKFZp762H1311 


1.42 


109970 


H09281 


Hs.13234 


ESTs 


1.13 
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110015 


H10998 


Hs.7164 


A dlslntegrin and metalloproteinase doma 


0.84 


4 OC 

1.95 


110156 


H18957 


Hs.4213 


ESTs 




1.41 


110561 


H59617 


Hs.5199 


HSPC1 50 protein similar to ubiquilin-con 


0.91 


0.10 


111223 


N68921 


Hs.34806 


ESTs; Weakly similar to neogenin [H.sapi 


0.91 


J.lO 


111345 


N89820 


Hs.14559 


Hypothetical protein FU 10540 


4 

1 


1 95 


111876 


R36239 


Hs.293246 


"ESTs, Weakly similar to putative p150 [ 


0.83 


1.27 


111902 


R39191 


Hs.1 09445 


KIAA1 020 protein 


u.yi 


n cm 

U.9 1 


112244 


R51309 


Hs.70823 


KIAA1 077 protein 


0.77 


d.Ul 


112973 


T17271 




cDNA FU 13308 fis, clone OVARC1001436, 


4 
1 


4 
1 


112989 


T23482 


Hs.89981 


"DJacy (glycerol kinase, zeta (104kD)" 


ft CR 
U.30 


1 rw 


113047 


T25867 


Hs.7549 


ESTs 


ft 07 

U.07 


9 


113095 


T40920 


Hs.1 26733 


ESTs 


4 

1 


4 

1 


113531 


T90345 


Hs. 16740 


Hypothetical protein FU 11035 


ft AO 


1 AA 


113970 


W86748 


Hs.8109 


ESTs 


1.17 


1 71 
l./O 


114346 


Z41450 


Hs.130489 


"ATPase, aminophospholipld transporter-! 


ft CK 

U.OO 


n m 


114407 


AA010188 


Hs.1 03305 


ESTs 


ft Q 


1 Aft 
1.00 


114471 


AA028074 


Hs.1 04613 


RP42 homolog 


l.Ub 


1 ">A 


114509 


AA043551 


Hs.101799 


KIAA1 350 protein 


1 00 

l.W 


9 V> 


115060 


AA253214 


Hs.1 98249 


'Gap junction protein, beta 5 (connexln 


0.79 


A AtX 


115091 


AA255900 


Hs.1 84523 


K1AA0965 protein 


ft 71 


1 09 


115123 


AA256642 


Hs.236894 


•ESTs, High sim to LRPIJw low density 1 


0.59 


< 07 

i.yr 


115291 


AA279943 


Hs.1 22579 


ESTs 


1 


1 OC 
1.^0 


115506 


AA292537 


Hs.45207 


Hypothetical protein KIAA1335 


115 


1.48 


115522 


AA331393 


Hs.47378 


ESTs 


0.5 




115536 


AA347193 


Hs.62180 


ESTs 


1 


1 


115697 


AA411502 


Hs.63325 


Homo sapiens type II membrane serine pro 


1 


6.53 


115909 


AA436666 


Hs.59761 


ESTs 


1 


6.98 


115978 


AA447522 


Hs.69517 


Differentially expressed in Fanconi anem 


1 


2.31 


116028 


AA452112 


Hs.42644 


thioredoxln-like 


0.99 


1.68 


116107 


M45696B 


Hs.92030 


ESTs 


1.14 


1.8 


116134 


AA460246 


Hs.50441 


CGI-04 protein 


1.11 


1.86 


116157 


AA461063 


Hs.44298 


Hypothetical protein 


0.99 


1.9 


116158 


AA461187 


Hs.61762 


Hypoxia-Inducible protein 2 

"Homo sapiens cONA FU10238 fis, clone H 


0.44 


0.86 


116335 


AA495830 


Hs.87013 


0.62 


3.89 


116483 


C14092 


Hs.76118 


Ubiquitin carboxyl- terminal esterase L1 


1.04 


2.36 


117320 


N23239 


Hs.211092 


LUNX protein; PLUNC(pa!ate lung & nasal 


0.51 


0.64 


117557 


N33920 


Hs.44532 


Diubiquitin 


1.11 


2.63 


117693 


N40939 


Hs.112110 


PTD007 protein 


0.98 


1.79 


117881 


N50073 


Hs.260622 


Butyrate-induced transcript 1 


1 


1.43 


118368 


N64339 


Hs.48956 


ESTs 


0.67 


2.86 


118566 


N68558 


Hs.42824 


Hypothetical protein FU10718 


1.21 


0.83 


118695 


N71781 


Hs.50081 


KIAA1 199 see CVA7.doc 


0.88 


1.00 


119780 


W72967 


Hs.191361 


ESTs; Weakly similar to hypothetical pro 


1 


1 


119845 


W79920 


Hs.58561 


G protein-coupled receptor 87 

"ESTs, Moderately similar to p53 regulat 


1 


1 


120102 


W95428 


Hs.1 32927 


1 


1 


120104 


W95477 


Hs.1 80479 


ESTs 


0.69 


o.U/ 


120486 


AA253400 


Hs.1 37569 


Tumor protein 63 kDa with strong homolog 


1.08 


12.05 


120859 


AA350158 


H s.1619 


Achaete-scute complex (Drosophila) homo! 


1 


1 


120860 


AA360240 


Hs.97019 


EST 


1 


1 


120948 


AA397822 


Hs.104650 


Hypothetical protein FU 10292 


1.04 


2.15 


120983 


AA398209 


Hs.97687 


EST 


1 


1 


121362 


AA405500 


Hs.97932 


Chondromodulin 1 precursor 


1 


1 


121369 


M405657 


Hs.128791 


CGI-09 protein ' 


1 


1.8 


121791 


M423978 


Hs.293317 


"ESTs, Weakly similar to JM27 (H.sapiens 


1 


1 


123005 


M479726 


Hs.105577 


ESTs 


1 


1 


123044 


AA481549 


Hs.130881 


B-ceil CLL/lymphoma 11A (zinc finger pro 


0.95 


1.88 


123160 


M468687 


Hs.284235 


ESTs 


1.59 


4.98 


123479 


AA599469 


Hs.135056 


clone RP5-850E9 on chromosome 20 


1.19 


1.64 


123571 


M6Q8956 


Hs.1 12619 


"ESTs, Weakly similar to PQ0109 Purkinje 


1.03 


1.14 


123829 


AA620697 


Hs.112208 


XAGE-1 protein 


1.39 


22 


124006 


D60302 


Hs.108977 


ESTs 


1 


4.85 


124059 


F13673 


Hs.99769 


ESTs 


1.49 


8.62 


124960 


T15386 


Hs.194766 


Seizure related gene 6 (mouse)-Bke 


0.76 


0.77 


125218 


W73561 


Hs.1 10024 


NAOH:ubiquInone oxldoreductase MLRQ subu 


1.33 


1.77 


125453 


R06041 


Hs.18048 


■Melanoma antigen, family A, 10" 


0.8 


1.42 


125759 


AA425587 


Hs.82226 


Glycoprotein (transmembrane) nmb 


1.52 


2.26 


125972 


AA434562 


Hs.35406 


"ESTs, Highly similar to unnamed protein 


1.05 


2.48 


125994 


H55782 


Hs.270799 


EST 


1 


1.95 


126395 


N7Q192 


Hs.278956 


Hypothetical protein FU12929 . 


1 


1.35 


126645 


A11 67942 


Hs.61635 


STEAP1 (Homo sapiens BAG clone RG041D1 1 


1 


2.23 


127221 


AJ 354332 


Hs.72365 


ESTs 


0.73 


3.27 


127479 


AA513722 


Hs.1 79729 


collagen; type X; alpha 1 (Schrrrid metaph 


0.51 


1.94 


126192 


AI204246 




KJAA1085 protein 


1.8 


3.16 


128610 


L38608 


Hs.1 0247 


activated leucocyte cell adhesion moiecu 


0.89 


a 0*7 
0.97 


128777 


U46006 


Hs.10526 


Cysteine and glyclne-rlch protein 2 


1 


1 


128924 


AA234962 


Hs.26557 


Plakophilin 3 


1.3 


2.97 


129041 


H58873 


Hs.1 69902 


Solute carrier famiry 2 (racintateo gi 


U.04 


2.04 


129099 


H50398 


Hs.108660 


"ATP-binding cassette, sub-family C (CFT 


0.B7 


1.04 


129404 


AA172056 


Hs.111128 


ESTs 


1 


1 


129466 


L42583 




"Genbank Homo sapiens keratin 6 Isoform 


0.72 


12.67 


129605 


S72493 


Hs.1 15947 


Keratin 16 (focal non-epldermolytfc palm 


0.92 


1.5 


129628 


U26727 


Hs.1174 


"Cyclln-dependent kinase Inhibitor 2A (m 


0.85 


1.93 


130023 


X13461 


Hs.239600 


Calmodulin-like 3 


0.84 


1.22 


130080 


X14850 


Hs.147097 


"H2A hlstone family, member X" 


0.98 


1.96 


1303B5 


AA126474 


Hs.155223 


stanntocalcin 2 


1 


1 
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130410 


V01514 


Hs. 155421 


Alpha-fefoprotein 


0.63 


0.63 


130441 


U35835 


Hs.301387 


'Human ONA-PK mRNA, partial cds" 


1.15 


3.65 


130482 


132866 


Hs.1578 


Bacuiovira! IAP repeat-containing 5 (sur 


1 


1.88 


130553 


AA430032 


Hs.252587 


Pituitary tumor-transforming 1 


0.92 


1.96 


130577 


M 35410 


Hs.162 


Insulin-ifke growth factor binding prole 


1.17 


4.7 


130627 


L23B08 


Hs.1 695 


Matrix metalloproteinase 12 (macrophage 


0.69 


4.05 


130800 


AA2233B6 


Hs. 19574 


ESTs; Weakly similar to katanin p80 subu 


1.13 


2.41 


130939 


AA5986B9 


Hs.21400 


ESTs 


0.8 


ft fto 


131046 


X02530 


Hs.2248 


INTERFERON-GAMMA INDUCED PROTEIN PRECURS 0.8 


1.10 


131244 


D38076 


Hs.24763 


RAN binding protein 1 


1.13 


a or 
1.00 


131877 


J04088 


Hs. 156346 


Topoisomerase (DNA) II alpha (170kD) 
"Doublecortex; Ilssencephaly, X-linked ( 


1 


1 


131927 


M461549 


Hs.34780 


0.81 


0.62 


131965 


W90146 


Hs.35962 


ESTs 


0.74 


3.27 


131976 


080008 


Hs.36232 


KIAA0186gene product 


1 


1 


132354 


L05187 


Hs.211913 


Small prollne-nch protein 1A 


0.69 


1.43 


132543 


AA417152 


Hs.5101 


ESTs; Highly similar to protein regutatl 


0.79 


4.27 


132632 


N59764 


Hs.5398 


guanine-monophosphate synthetase 
■laminin gamma2 chain gene (LAMC2), exon 
"Low density lipoprotein receptor-relate 
"Serins (or cysteine) proteinase Inhibit 


1 


1.08 


132653 


U31201 


Hs.54451 


1 


1 


132659 


Z75190 


Hs.54481 


0.89 


0.89 


132710 


W93726 


Hs.55279 


0.64 


( 4.41 


13275B 


W52432 


Hs.56105 


"ESTs. Weakly similar to WDNM RAT WDNM1 


1.55 


2.08 


132767 


L0518B 


Hs.231622 


Small proline-rich protein 2B 


0.83 


1.66 


132816 


M74542 


Hs.575 


Aldehyde dehydrogenase 3 


0.55 


0.55 


132990 


AA458761 


Hs. 18387 


transcription factor AP-2 alpha (activat 
"A dislntegrin and metalloproteinase dom 
"SRB7 (suppressor of RNA polymerase B, y 


1 


3.53 


133070 


U69611 


Hs.64311 


1.16 


2 


133282 


U52960 


Hs.286145 


1 


2.7 


133317 


AA215299 


Hs.70830 


U6 snRNA-assoclated Sm-like protein LSm7 


0.95 


1.42 


133370 


AA156897 


Hs.72157 


Homo sapiens mRNA; cDNA OKF2p564l1922 


1.12 


2.55 


133391 


X57579 


Hs.727 


H.sapiens activin beta-A subunit (exon 2 


1.65 


1.76 


133832 


H03387 


Hs.241305 


estrogarwesponsive B box protein (EBBP) 
"Serine (or cysteine) proteinase inhibit 


1.02 


1.39 


134032 


Z81326 


Hs.78589 


1 


1 


134168 


AA398908 


Hs.181634 


"Homo sapiens cONA: FU23602 fis, clone 


0.95 


1.53 


134218 


AA227480 


Hs.80205 


Plm-2 oncogene 
""collagen, type XI, alpha 1""" 


1.36 


2.48 


134405 


R67275 


Hs.82772 


0.76 


2.86 


134453 


X70683 


Hs.83484 


SRY (sex determining region Y)-box 4 


1.89 


3.78 


134470 


X54942 


Hs.B3758 


CDC2S protein kinase 2 


1.82 


4.11 


134645 


U87459 


Hs.167379 


"Cancer/testls antigen (NY-ESO-1, CTAG1, 


0.82 


0.63 


134781 


M17183 


Ks.69626 


Parathyroid hormone-flke hormone 


1 


1 


135002 


U19147 


Hs.272484 


G antigen 6 


1 


1 


100040 


M97935 




AFFX control: STAT1 


0.92 


1.25 


101201 


L22524 


Hs.2256 


matrix metalloproteinase 7 (matrflysin; 


2.92 


8.5 


101664 


M60752 


Hs.121017 


H2A histone famffy; member A 


1 


1 


102025 


U03911 


Hs.78934 


mutS (E. coll) homotog 2 (colon cancer 


0.8 


1.61 


102031 


U04898 


Hs.2156 


RAR-reiated orphan receptor A 


1 


1 


102221 


U24576 




LIM domain only 4 


1 


1 


102270 


U30255 


Hs.75868 


phosphogluconate dehydrogenase 


1.08 


1,43 


102339 


U37022 


Hs.95577 


cydirvdependent kinase 4 


0.88 


1.32 


102391 


U41668 


Hs.77494 


deoxyguanosine kinase 


1.07 


1.58 


103000 


X51956 


Hs.146560 


enolase 2; (gamma; neuronal) 


0.91 


1.49 


103395 


X94754 


Hs.119503 


methionlne-tRNA synthetase 


0.89 


1.32 


105638 


AA2B1599 


Hs.20418 


. Homo sapiens mRNA for tor histone H2B; c 


0.91 


1.25 


105726 


AA292328 


Hs.9754 


activating transcription factor 5 


0.94 


1.48 


114841 


AA234722 


Hs.55408 


ESTs; Moderately similar to CALCIUM-DEPE 


0.78 


1.56 


115206 


AA262491 


Hs.1 86572 


ESTs 


1 


1 


115906 


AA436616 


Hs.82302 


ESTs 


0.74 


2.52 


119132 


R49046 


Hs.107911 


ATP-bindlng cassette; sub-family B (MOW 


1.1 


1.51 


124163 


H30539 


Hs.189838 


ESTs 


1 


1 


126487 


AA482505 


Hs. 184601 


solute carrier family 7 (cationic amino 


1.01 


1.46 


127141 


AA307960 


Hs.75478 


KIAA0956 protein 


0.85 


1.4 


128034 


AA905754 


Hs.75103 


tyrosine 3-monooxygenase/tryptophan 5-mo 


1 


1.18 


128609 


AA234365 


Hs. 102456 


survival of motor neuron protein interac 


1 


1.5 


128895 


R37753 


Hs. 106985 


ESTs 


1.7 


2 


130199 


Z48579 


Hs.172028 


a dislntegrin and metailoprotease domain 


1 


1 


130524 


U89995 


Hs.1 59234 


fbrkhead box E1 


1 


1 


133000 


U24152 


Hs.62402 


p21/Cdc42/Rac1 -activated kinase 1 (yeast 


1 


1 


133658 


M25766 


Hs.75426 


secretogranin II (chromogranln C) 


1 


1 


135047 


AA460466 


Hs.93597 


ESTs 


1 


1 


100053 


M27830 




AFFX control: 28S ribosomal RNA 


0.88 


1.53 


100114 


D00596 


Hs.62962 


thymidytate synthetase 


0.68 


1.86 


100128 


D11094 


Hs.61153 


proteasome (prosome; macropaln) 26S subu 


1.29 


2.03 


100154 


D14657 


Hs.81892 


KIAA0101 gene product 


0.71 


4.26 


100161 


D14694 


Hs.77329 


phosphatidylserine synthase 1 


1.02 


1.56 


100168 


014874 


Hs.394 


adrenomedullin 


0.46 


1.17 


100187 


D17793 


Hs.78183 


aldo-keto reductase family 1; member C3 


1 


1 


100188 


021053 


Hs.57101 


minichromosoms maintenance deficient (S. 


0.97 


1.4 


100217 


D26600 


Hs.89545 


proteasome (prosome; macropaln) subunit; 


1.13 


1.9 


100220 


D28364 




■"Human mRNA for annexln II, 5UTR (seq 


1.11 


1.00 


100287 


D43950 


Hs.1600 


chaperonin containing TCP1; subunit 5 (e 


1.13 


Z09 


100297 


D49489 


Hs.182429 


protein disulfide feomerase-reiated prot 


0.92 


1.78 


100330 


055716 


Hs.77152 


minichromosome maintenance deficient (S. 


1.07 


1.61 


100355 


D78129 




""Homo sapiens mRNA for squatene epoxid 


0.96 


1.87 


100364 


D78586 


Hs.154868 


carbamoyt-phosphate synthetase 2: aspart 


1.49 


246 


100368 


079987 


Hs.153479 


extra spindle poles; S. cerevls/ae; homo 


0.59 


1.32 


100398 


D84557 


Hs.155462 


minichromosome maintenance deficient (mi 


1.08 


1.9 


100438 


087448 


Hs.91417 


topoisomerase (DNA) II binding protein 


1 


Z15 



87 



WO 02/086443 



PCT/US02/12476 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



100465 


DB7953 


Hs.75789 


100491 


HG1153-HT1153 


100518 


HG174-HT174 




100528 


HG1828-HT1857 


100661 


HG2874-HT3018 




100667 


HG2981-HT3127 


100830 


HG4074-HT4344 


101061 


K03515 


Hs.944 


101131 


L10838 


Hs.167460 


101162 


L14595 


Hs.174203 


101181 


L19686 


Hs.73798 


101183 


LI 9779 


Hs.795 


101216 


L25876 


Hs.84113 


101228 


L27706 


Hs.82916 


101233 


L29008 


Hs.878 


101247 


L33801 


Hs.78802. 


101332 


L47276 




101342 


L76191 


Hs.1 82018 


101396 


M15796 


Ha.78998 


101423 


M18391 


Hs.89839 


101445 


M21259 


Hs.1066 


101505 


M27398 


Hs.75692 


101525 


M29536 


Hs.12163 


101535 


M30448 


Hs.251669 


101607 


M38690 


H3.1244 


101624 


M55998 




101758 


M77836 


Hs.79217 


101839 


M93036 


Hs.692 


101853 


M94362 


Hs.760B4 


101977 


S83364 




101992 


U01038 


Hs.77597 


102009 


U02680 


Hs.82643 


102012 


U03057 


Hs! 11 8400 


102039 


U05861 


Hs.201967 


102123 


U14518 


Hs.1 594 


102130 


U 15009 


Hs.1 575 


102148 


U16954 


Hs.75823 


102210 


U23028 


Hs.2437 


102220 


U24389 


Hs.65436 


102260 


U28386 


Hs.1 59557 


102330 


U35451 


Hs.77254 


102423 


U44754 


Hb. 17931 2 


102455 


U48705 


Hs.75562 


102499 


U51478 


Hs76941 


102522 


US 3347 


Hs 183556 


102S90 


U62136 

UUl 1 Ov 




102676 


1/72514 


Hs. 12045 


102687 


VI wJr O 


Hs.93002 


102704 


U7fifi.TR 
vi uoou 


Hs 54089 


102781 


U83843 




102784 




Hs.61796 


102827 


U91327 


Hs.6456 


102935 


X13482 


Hs 80506 


102972 


Xlfififi? 


Hs 87268 


102983 


X17620 


Ha 118638 


103023 




Hs 117950 


103038 


X54941 


H8.77550 


103075 




Hs.2934 


103168 


X68314 


Hs]2704 


103185 


XB9910 


Hs.74368 


103212 


X73874 


Hs.2393 


103223 


X74801 


Hs.1708 


103260 


X78416 


Hs.3155 


103262 


X78565 


Hs.204133 


103330 


X85373 


Hs.77496 


103364 


X90872 


Hs.75854 


103375 




Hs.54416 


103391 


X94453 


Hs, 114366 


103404 


X95586 


Hs78596 


103437 


X98260 


Hs.82254 


103448 


X99133 


Hs.204238 


103605 


Z35402 


Hs.194657 


103646 


Z68228 


Hs.2340 


103658 


Z74615 


Hs.172928 


103774 


AA092898 


Hs.92918 


104261 


AF008442 


Hs.5409 


104276 


C02193 


Hs.85222 


104289 


C16281 


Hs.75478 


104434 


L02870 


Hs.1640 


104453 


M19169 


Hs.123114 


104611 


R98280 


Hs.125845 


104758 


M024661 


Hs.7010 


105114 


AA156532 


Hs.11801 


105132 


AA159501 


Hs.247280 


105174 


AA1 86613 


Hs.34744 



N-myc downstream regulated 

Nucleoside Diphosphate Kinase Nm23-H2s 

Desmoplakin I 

'"Nexin, Glta-Derived*™ 

Ribosomal Protein L39 Homolog 

""Eplcan.AlL Splice 11"" 

Rad2 

glucose phosphate isomerase 
splicing factor; arginlne/serine-ftch 3 
solute carrier family 1 (glutamate/neutr 
macrophage migration Inhibitory factor ( 
H2A his tone family; member 0 
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pyrroline-5-carboxytate reductase 1 
membrane component; chromosomal 4; surfa 
lamtn B2 

""putative Rab5-lnteracting protein (cl 

polo (DrosophiaHike kinase 

protein tyrosine kinase 9 

singed (Drosophlla)-like (sea urchin fas 
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Table 1B shows the accession numbers for those pkeys in Table 1A lacking unigenelD's. For each probeset we have listed the gene cluster number from which the 
oligonucleotides were designed. Gene clusters were compiled using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on'sequence 
similarity using Clustering and Alignment Tools (DoubJeTwist, Oakland California). The Genbank accession numbers for sequences comprising each cluster are listed in the 
Accession column. 
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Pkey: Unique Eos probeset ide ntifier number 
CAT number. Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT Accessions 

100661 23182.1 BE623001 L0509S AA383604AW966416 N53295 AA460213 AW571519 M603655 

100667 26401 3 L05424 X56794 S66400 X55150 W60071 AW351820 X55938 M83326 BE005289 BE070059 M83324 BE005248 BE069717 BE1 81648 BE069700 

AW606203 BE069721 AW382138 AW803776 BE463954 BE005334 BE005274 T27386 M932714 AA972695 AW377728 Al 632506 T29066 
A1783934 AW377727 BE163715 AL047291 AA279047 AA523003 BE008048 BE440141 W23614 BE090519 BE092193 N291B1 N2035B N44153 
BE546944 T69231 AW377441 AA907406 H50799 AW051 41 6 AI420712 BE620922 A12791 61 AA992549 W4719B BE005241 AI342698 H50700 
A1969974 A1863855 AA374490 AW1 30675 Al 950633 AA146687 H99482 X55150 BE005414 BE005339 N28294 A1673068 A1887890 AW804171 
A1675961 AW804172 AA778841 AL048050 A1127757 AI095568 AW204965 AW468978 W31898 AI052595 AI278771 BE464018 AI081503 AI824196 
AA51321 1 AA411062 AW084376 N48752 AA703209 N35580 AW059918 AA054563 AI280942 T27619 BE621435 N66010 AW589527 Alt 60414 
AA283090 AA962536 H82726 W52115 W45432 W60433 AA577548 AA146714 BE150994 AA054615 AW796025 AW382768 BE565671 C00444 
AA054555 

100668 26401 3 L05424 X56794 S66400 X55150 W60071 AW351820 X55938 M83326 BE005289 BE070059 M83324 BE005248 BE069717 BE1 81648 BE069700 

AW606203 BE069721 AW382138 AW803776 BE463954 BE005334 BE005274 T27386 AA932714 AA972695 AW377728 AI632506 T29066 
AI783934 AW377727 BE163715 AL047291 AA279047 AA523003 BE008048 BE440141 W23614 BE090519 BE092193 N29181 N20358 N44153 
BE546944T69231 AW377441 AA907406 H50799AW051416A1420712BE620922A1279161 AA992549W47198BE005241 AI342696 H50700 
AI969974 A1863855 AA374490 AW130675 A1950633 AA146687 H99482 X55150 BE005414 BE005339 N28294 A1673068 AI887B90 AW804171 
AI675961 AW8041 72 AA778841 AL048050 AI127757 AW95568 AW204965 AW468978 W31 898 AI052595 AI278771 BE46401 8 AI081503 A1824196 
AA51321 1 AA411062 AW084376 N48752 AA703209 N35580 AW059918 AA054563 AI280942 T27619 BE621435 N66010 AW589527 AI160414 
AA283090 AA962536 H82726 W521 15 W45432 W60433 AA577548 AA146714 BE1 50994 AA054615 AW796025 AW382768 BE565671 C00444 
M054555 

101332 25130 1 J04088 NM.001067 AF071 747 AJ01 1741 N85424AL042407AA218572BE296748BE083981 AL040877 AW499918AW675045H17813 
BE081 283 M670403 AW504327 BE094229 AA104024 AI471482 A1970337 AA73761 6 A1827444 AW003286 A1742333 AI344044 AI765634 



91 



WO 02/086443 



PCT/US02/12476 



AI948838 AW235336 AW172827 AA095289 BE046383 A1734240 W1 6699 AI660329 AI289433 AA933778 AW469242 AA468838 AA806983 
AA625873 W78031 BE206307 AA550803 AI743147 AI990075 AA948274 M129533 A1635399 AA605313 AI624669 AW594319 AI221834 AI337434 
AA307708 BE550282 AI760467 AI630636 AI221521 AW674314 AW078889 A1933732 AI686969 A1186928 AW074595 AI127486 AL079644 
AI910815 H17814 AA310903 AW137854 T19279 AA026682 AA306035 AW383390 AW383389 AW383422 AW383427 AW383395 H09977 
AA306247 AA352501 AW403639 F05421 AA224473 AA305321 H93904 AA089612 AW391543 AW402915 AW173382AW402701 AW403113 
R94438 N73126 H93466 AA090928 M095051 T29025 AW951071 U7277 L47276 AI375913 BE384156 W24652 AA746288 AA568223 BE090591 
H93033 N57027 AA504348 AA327653 AW9S9913 N53767 AA843715 AI463437 AW263710 AI076594 AA583483 AWB73194 AW575166 A1128799 
A1803319 AL042776 AW074313 A1887722 AI0322B4 AA447521 AI123B85 N29334 AI35491 1 AW090687 AA238763 AA435535 AA236910 
AAQ47124 AA236734 AW514610 H93467 AA962007 AI446783M127259 AI613495 A1686720 A1587374AA936731 AA702453 AI859757 
AA216786 A1251819 AI469227 AAB06022 AI092324 N71868 AA96B782 AA236919 AA809450 AA227220 AA765284 AI192007 AA768810 
AA8Q5794 AA729280 AA806238 AW768817 N71 879 AI050686 AA505822 AA668974 AI6881 60 BE045915 AW466315 AA731 314 AA649568 
AA834316 AW591901 AW063876 AW294770 AI300266 AI336094 AI560380 AA721755 H09978 D20305 D291 65 AW821790 BE150864 F01 675 
AI457474 AW466316 AA550969 AA630788 
100780 458 127 BE561958 BE561728 BE397612 BE514391 BE269037 BE514207 BE562381 BE514256 BE514403 BE514250 BE397832 BE269598 BE559865 

BE396881 BE560031 BE514199 BE550037 BE560454 
100830 4002 1 AC004770 W05005 AA356068 AA094281 H29358 T56781 AW875313 L37374 BE312466 BE31 1755 BE207106 BE293320 BE018115 AW239090 
BE548830 AW247547AA776062BE397382AA486713T10111 T09340 AW498981 BE547280 AA356003 AW581520 AW875331 AA58072O 
AW875336 BE276873 BE408229 AW188148 BE255166 BE253761 AW793727 AW373141 AW581548 AA471223 AA305950 BE263976 AA626820 
BE257409 AW360962 AA090655 C00312 BE312741 BE407213 AA209352 AW298199 AW248553 AW297794 AW731722 BE300586 AW731972 
AW615446 BE301599 AW615520 AA4B6714 AW440257 AA196516 AA564630 AA618079 AW192592 AW474985 AA6045B0 AI627461 AA765440 
AI680394 AL135548 AI683224 AI581126 AW245096 AW194154 H29274 N70363 AA629758 AA580602 AA862006 AI863841 AI097657 AI928583 
AI358774 BE2434B7 AA620553 AA653297 AA292690 T101 10 Z38906 AA908544 AA340930 AI185438 T03328 T28844 A1687010 AI864965 
AI872575 BE388740 T56780 AW3731 38 BE258717 AA699671 
100906 4312L1 AU07691 6 BE2981 10 AW239395 AW672700 NM.0Q3875 U10860 AW651755 BE297958 C03806 AI795876 AA644165 T36030 AW392852 

AA446421 AW881866 AI469428 BE548103 T96204 R94457 N78225 AI564549 AW004984 AW780423 AW675448 AW087B90 AA971454 AA305698 
AA879433 AA535069 AI394371 AA928053 AI378367 N59764 AI364000 AI431285 T81090 AW674657 AW674987 AA897396 AW673412 BE063175 
AW674408 AJ202011 R00723 AI753769 AI460161 AW079585 AW275744A1873729 D25791 BE537646 T81139 R00722 
100930 16865J J04129 NMJJ02571 AA293088AA477016AA404631 T28299 M476904 AA433965 AA430486 AA495907 AI151391 AA291495 AA402723 W25651 
AA70681 6 AI82671 2 AW296294 AA293479 A12765B1 AW0441 54 AI0801 80 Al 41 7985 Al 2741 KJ AI47421 2 AA495908 AA635664 AI0921 14 
AI804952 AA479874 A1597661 A142051 1 AA479738 AA421417 AA421 247 AA436220 AL047797 M34046 N42277 AA22B076 W02698 AI420297 
AA434011 A1369971 AA479731 AI865541 AI41 8020 AA421 246 AA452764 AL048051 
102221 3861J NMLC06769 U24576 AW161961 AW160473 AW160465 AW160472 AW161069 AI824831 AW162635 AI990356 AW162477 AW162571 AI520836 
AW162352 AW162351 AW162752 AI962216 AI537346 AA853902 H17667 BE045346 BE559802 BE255391 AA985217 AA235051 AI129757 
AW366451 T34489 D56106 D56351 AI936579 AW023219 AW889335 AW889120 AW889232 AW889175 BE093702 AW889349 AA147546 
AI952998 AA91 2579 A1143356 AW90221 1 R64717 AW157236 AI81 5242 D45274 AW263991 AA442920 AA129965 AL035713 A1923255 AI949082 
AI1 42826 AI684160 AI701 987 A1678954 AI827349 BE463635 AW628092 AW3022B1 AA493203 BE348856 BE536419 AW1 93969 AW673561 
AW592609 AI224044 H43943 AA091912 R49632 R48353A1568409 R48256 AI198046 H27986 H43899AI678759 AI680310 A1624220 H17052 
AA156410 N56062 AI699430 AA664629 T09406 T10459 AA627606 AI379584 N83831 N88633 AW022651 AA971281 AA248036 AI039197 
AI914689 AA973825 AL047305 AA129966 AI79B369 AW264348 A1445879 A1658759 N67924 AI933507 AI216121 AI333174 T10972 AI375028 
AI1 86756 AI273778 AA610487 AI797946 AA853903 AA903939 AI338587 AI278494 AW627595 AA904019 

M86849 AA31 5280 NMJJ04004 AA31 5269 BE 1 42653 AA461400 AW802042 BE 1 52893 AW3831 55 AA490688 AW1 1 7930 AW384563 AW384544 
AW384566 AW378307 AW378323 AW839085 AA257102 AW378317 AW276060 AW271245 AW378298 AW384497 AI5981 1 4 AW264544 AI018136 
AW021810 AA961504 AW086214 AW771489 AW192483 A1290266 AW192488 AW384490 AW007451 AW890895 AA554460 AA613715 
AW020066 A1783695 AI589498 AI917637 AW264471 AW384491 AI816732 AW368530 AW368521 AW36B463 AA461087 AI341438 AI970613 
AI040737AI418400AA947181 AA962716 A1280695AW769275 AW023591 AM 60977 AA055400 N71882AA490466AW243772AW316636 
AI076554 AW511702 N69323 H88912 AA257017 AI952506 H88913 AI912481 AA600714 BE465701 N64149 C00523 N64240 AA677120 
R61573 BE005029 X98091 AA297307 BE537267 BE566138 BE568139 F1 1561 BE564795 BE568776 AW064005 BE566479 BE380035 BE567012 
BE566634 BE566568 AA298060 BE566043 BE568813 BE568618 AA2B3070 BE565414 BE566738 BE568585 BE565667 BE5661 16 BE566433 
U62136 AF049140 BE567057 BE567297 BE567403 BE564316 BE567400 BE568854 BE566588 AA448772 AA071363 AW732642 BE564996 
AA297763 AA278550 M421083 AA298184 AA091007 AA984577 AA205916 N28759 AL031291 C15757 C15761 H02728 BE566410 AA1 29335 
AA41 9499 N87741 BE379689 BE004824 BE37961 1 D25874 AA148454 AA323654 AW95031 1 M448795 AW749423 AA773386 AA773843 
AW020327 BE348580 BE504258 BE549990 BE220200 AI673334 AI202679 AA975515 D61421 Al 1 68688 AA 102843 AW246821 AI276203 
AI074054 AI633824 AI962927 AI148926 N50969 AI308911 AA410994 AW373025M1 48455 H02620 AA688293 AI246318 N22220 AI917777 
A1050943 AI097286 AA663794 AW368682 AW627826 AW07B734 AI253060 AA7491 54 AA832236 AI19235B AW024676 AA448676 AA764891 
BE439467 AA661534 AA258061 AI090546 AA995157 AI05101 1 AA584421 A1026032 AW591 338 AW589563 AA776914 AW024884 AA421002 
F09219 BE464500AI383595AA954244AA601583 AA737304 AA1 95549 AA805778AI055876AA1 64942 AW01 3961 AI672608 AW514211 D69441 
AW582574 AA160935 BE566501 BE564612 BE565353 BE568195 BE565447 BE568302 BE566097 BE565470 BE564249 AL036217 AW749424 
BE567494 AA1 02842 AA314761 AV661237 C1421 1 AA651866 AW798997 AA47C605 
101977 29073 1 AF1 12213 AL050318 T24804 AW2481 36 BE386341 BE2631 77 W1 6677 BE250224 BE563669 BE267405 BE546577 AV65 1354 AV651 292 

AI346903 A1539128 AI189171 S83364 AW073849 AI816760 AW073309 AI422690 AA296692 A1860301 AI805446 N77735 AI340328 BE092530 
AW028742 BE088442 AA657742 AA742438 AW170086 AI038920 AI432379 N36073 AI9361 94 AA868655 AA983612 AI077505 BE080433 
AI375014 AI126547 AI348244 AI346077 AI748952 N26915 AI753574 AI093341 AI278762 BE092517 N74204 H08158 T58 1 49 Al 129303 N58366 
AA524456 BE 1 22661 AA542925 AI2461 20 A1735203 AA706829 AA877544 AI082289 AA926687 N92840 AW249798 AA934763 AW998363 
AI128632 N25202 A1240209 AW1 1 8B92 N80744 R35655 AI342321 AI340141 AW878792 A1857321 H0961 0 W04601 AW006650 AA1 26006 
AA553675 A1052791 AW059835 A1041906 AA814658 AW002059 AA729483 AI609301 AA994633 AA903651 AI459183 T95072 AW088530 
AA1261 12 AI800091 AI561215 H17502 AW475072 AI819003 AI683272 AI262701 AW793140 T81 787 R99585 AI275160 AI31O420 AI698929 
AA1591 74 AI827968 F30305 F30309 AA806662 AI091923 AW878722 AA583430 AW571913 AI674584 AA292533 AI079471 AA642325 AA71 9050 
AW793172 AA305476 AW103745 T23459 N79525 AI7B4438 AA534551 AW193751 AI074360 BE281214 T32229 W25066 W01205 T63086 
AW795348 AI361287 AW795353 AW795349 AA594759 AI400295 D1 1489 AI370689 AA482356 AA485295 W401 51 AA564661 AW300745 
AI345938 AJ374975 AI423782 AW1 93899 AA612604 Al 183409 AA99S156 AW366963 AW366977 AI284860 AA846503 AI985064 AA844576 
AA737921 M873274 BE241546 BE241540 AA4B4058 AW468970 AA1 27876 AA159120 AW001568 AW795213 AW795258 AW795330 BE250589 
BE387572 AA910895 AA161217 BE250380 W31500 T95167 AI719306 AI359224 
102781 20812J BE258778 BE281230 BE410044 T33723 AW672694 AW410439 NMJJ06429 AFQ26292 T35505 BE542333 T08940 AU076737 AW247471 

BE393215 AW328640 BE542408 T32170 BE302544 T319S5 BE206898 BE275738 T32570 BE386426 BE298746 BE389937 BE293991 BE31 5289 
BE389578 R34739 R15312 BE279365 BE277756 AL03601 9 T33725 BE277779 BE302962 AL047294 BE276505 T09070 T33673 BE312580 
AW387774BE257175 AW674367 BE253331 BE270344 BE299831 BE273576 T32062AI751831 BE618381 AA304899 BE252268 U46364 
BE256790 BE207199 8E256209 BE251941 BE250791 BE313955 BE269806 BE543623 BE279212 BE252289 T31699 BE262220 T31669 
AA315781 AA192212N84547 BE292737 BE259631 AA232179 Al 1331 44 T31 292 AA31 5945 BE407301 BE251184BE409006 AI880158 AI904003 
AI9041 14 AW651768 AW651 763 R58247 BE271897 U83843 C05298 BE261 609 BE255973 AA351650 N84631 BE263637 AW452910 AA328465 
AA324549 AW579525 BE252296 BE257551 AL048332 BE208630 AA359336 AW327897 AA15J742 AA305816 BE076862 BE076796 BE263161 
AA323785 M676588 AA626565 AA078917 W87657 R09002 R94021 AA31 2032 BE276665 AA295608 AW407162 AA329374 AW87791 2 N27885 
AA369256 AA360968 BE250476 NB5427 BE265569 AI278639 A1816576 AI691037 AW328583 AI567949 AI983455 AI927732 AI81 1297 AI571508 
AW073674 BE296039 BE467326 A1828796 AI816578 AW511604 AI921213 AW152427 AI795787 AI801618 AW168866 AI628144 AI890339 
AW173690 AW51 1 540 BE535620 AA383014 BE301 164 AI866596 AW51 4909 AA658050 AW575243 M074631 AI093488 AW575408 AW675443 
AW615636 AW732207 AW377638 AA321784 AA641629 M633105 AA527640 AW129146 AW615672 BE394607 AA483902 AW475032 BE378532 



101809 32963 1 



102590 15932J 
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125831 1522905J 
126192 45743_3 
113195 178688.1 



119861 
112973 



129402 
105936 
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47367.1 
260931.1 



129466 2094.50 



WO 02/086443 PCT/US02/12476 
AA872808 AI469388 AW105268 BE047301 AW591843 AW410066 AW517153 A1950495 AA746641 A1914878 AA673185 AI696911 AA548625 
AA91 1505 AA148762 AW674535 AI587329 BE328328 AW270348 AA158225 AW1 1 7705 AW474997 AW519193 AA614757 AW664383 AJ082647 
AW590973 AI47671 1 AA1 92213 NB8741 BE464552 AW072679 AI453708 AA152166 AA805924 A1581 078 AI125768 AW173484 AI96198Q 
BE300766 AI199698 AI636792 AW247333 AW272861 M078818 AA150012 AA551232 AA678821 AW873869 AW768266 AI660315 AA319210 
AA814551 AA157994AA31B888 AI582962AW089224 AI356098 AI343694 AW072598 N21054AI301249 M742924H17917 AW328584 . 
AW248898 AI751830 AA907816 R08898 AW087989 A1828300 AA148596 AI269577 T33426 AA213571 A1973201 AA686279 R49612 AI573183 
AW799762 AW410068 AW769668 AI962097 AI475204 D57490 AW51 7531 BE245270 AW470008 T33427 AW005731 AI795795 T23753 AW272981 
T15747 AA552875 T23644 AW3612B9 A1758558 BE207435 AA876958 T03361 AAB83569 F37533 AA582321 AW082524 R4221 2 AA973847 
T18900 AA086202 AI559867AI302418 M948667 AA745670 T08939 T33724T33722 BE621568 D57489 D25906 BE621151 F16510 C05966 
T35127 AA630427 AI933481 AA309426 AI918440 BE561854 BE616866 BE394675 BE296173 AW9516B7 BE383739 BE616141 BE312730 
BE535351 AW080575 BE313330 BE616664 AI354390 AA847315 BE544509 BE515212 BE297833 BE278808 BE544844 AW090178 AI890664 
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Tables 2A-8C were previously filed on Novembers, 2001 In USSN 60/339,245 (18501 -0041 OOUS) 

Table 2A shows 504 genes down-regulated In lung tumors relative to normal lung and chronically diseased lung. Chronically diseased lung samples represent chronic non- 
mallgnant lung diseases such as fibrosis, emphysema, and bronchitis. These genes were selected from 59680 probesets on Ihe Eos/Affymetrix Hu03 GenecWp array. Gene 
expression data for each probeset obtained from this analysis was expressed as average intensity (Al), a normalized value reflecting the relaflve level of mRNA expression. 

Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unlgene number 

Unigene Title: Unlgene gene flue 
R1: 



R2: 
R3: 



R4: 
R5: 
R& 

R7: 
R8: 



90th percentile of Al tor normal lung samples divided by the 80th percentile of Al for adenocarcinoma and squamous cell carcinoma lung tumor 
samples. 

median of Al for normal lung samples divided by 90lh percentile of Al for adenocarcinoma and squamous cell carcinoma lung tumor samples, 
median of Al for normal lung samples minus the 1 5th percentile of Al for alt normal lung, chronically diseased lung and tumor samples divided by 
the 90th percentile of Al for adenocarcinoma and squamous cell carcinoma lung tumor samples minus the 15th percentile of Al for all normal 
lung, chronically diseased lung and tumor samples. 

average of Al for normal lung samples divided by average Al for squamous cell carcinoma and adenocarcinoma lung tumors, 
median of Al for normal lung samples divided by the 90th percenUle of Al for adenrearcmomas. . 

median of Al for normal lung samples minus the 15th percentile of Al for ell normal lung, chronically diseased lung and tumor samples divided by the 90th 
percentile of Al for adenocarcinomas minus the 15th percentile of Al for all normal lung, chronically diseased lung and tumor samples, 
average of Al for normal lung samples divided by Ihe 90th percentile of Al for squamous cell carcinomas. 

median of Al for normal lung samples minus the 15th percentile of Al for all normal lung, chronically diseased lung and tumor samples divided by the 90th 
percentile of Al for squamous cell carcinomas minus the 1 5th percenUle of Al for all normal lung, chronically diseased lung and tumor samples. 
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myocilin; trabecular meshwork Inducible 
glutathione peroxidase 3 (plasma), 
angiopoietln 1 

growth differentiation factor 10 
transcription elongation factor A (Sll); 
KIAA0275 gene product 
Vpre-B 
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Interteukin 8 receptor; beta 
"Homo sapiens core 2 beta-1,6-N-acetylgl 
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Accession not listed In Genbank 

"Human sodium cotransporter RKST1 mRNA, 

"Homo sapiens skeletal muscle UM-protei 

Human clone W2-6 mRNA from chromosome X 

"Human ataxia-telangiectasia locus prote 
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"Human B-cell receptor associated pro lei 

progastricsin (pepsinogen Q 

oplold-binding protein/cell adhesion mol 
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phospholemman 

Homo sapiens cDNA FU14103 As, clone MA 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 35.80 

ESTs 

ESTs 

ESTs 

ESTs 

EST 35.40 
Homo sapiens done 24760 mRNA sequence 58.00 
ESTs 

olfactory receptor; family 7; subfamily 

ESTs 29.00 

ESTs; Weakly similar to GLUCOSE TRANSPOR 

ESTs; Weakly similar to PTB-ASSOCIATED S 28.40 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

"yw35g11.s1 Morton Fetal Cochlea Homo sa 
ESTs 

TATA box binding protein (TBP)-flssodate 
ESTs 

calcium channel; voltage-dependent; alph 
ESTs; Weakly similar to SPLICEOSOME ASSO 
serum deprivation response (phosphatidyl 
ESTs 
ESTs 
ESTs 
ESTs 
ESTs 
ESTs 

DKFZP586D0824 protein 
aryl hydrocarbon receptor-interacting pr 
K1AAC022 gene product 
HUM525A05B Human placenta potyA* (TFuJl 
ESTs; Weakly similar to metalloprofease/ 
ESTs; Weakly similar to HC1 ORF IM.muscu 
ESTs 

ESTs; Weakly similar to plL2 hypothetica 
aa59b04.s1 NCLCGAPJ3CB1 Homo sapiens c 
ESTs 
ESTs 
ESTs 
ESTs 
ESTs 
ESTs 
ESTs 

mannosyl (alpha-1;3-)-glycoprotein beta- 
glutaihlone peroxidase 3 (plasma) 
ESTs; Weakly similar to LR8 [H.sapiensJ 
ESTs; Weakly similar to ZINC FINGER PROT 
ESTs 

potassium Inwardly-rectifying channel; s 31 .20 

ESTs; Moderately similar to Hi! ALU SUB 

ESTs 

ESTs 

granzyme B (granzyme 2; cytotoxic T-lymp 
ESTs 

eukaryoHc translation elongaflon factor 
ESTs 

cofilin 2 (muscle) 

ESTs; Weakly similar to PUTATIVE RHO/RAC 
ESTs; Moderately similar to CaM-KII inhl 
ribosomal protein L41 

ESTs 34.60 



2.34 
2.61 



9.00 



15.00 
6.67 



Z65 



17.00 



59.20 



29.00 
32.20 



35.80 

39.60 
28.80 



2.63 



Z35 

3.07 
2.48 

Z50 



8.00 



12.00 



9.00 



7.33 



10.00 



2.36 



3.09 



2.66 



46.80 
31.40 



39.80 



32.20 
41.80 



82.80 

32.40 
70.60 

29.40 

30.80 

37.60 

34.20 

33.60 
29.80 

34.40 



29.40 
37.20 

38.40 



34.40 



41.60 
87.00 



19.00 
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2.25 
2.92 



3.58 



3.60 
3.14 



3.37 



2.58 



3.95 



13.00 



4.74 



3.14 



6.03 



10.00 



3.12 



2.79 



4.42 
2.58 

4.31 



9.00 



11.00 



4.02 
3.76 



3.10 



3.17 



3.21 
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129210 
129240 
129262 
129301 
129331 
129381 
129565 
129595 
129813 
129782 
129950 
129958 
129959 
130160 
130259 
130273 
130312 
130436 
130523 
130799 
130885 
131002 
131012 
131031 
131061 
131066 
131082 
131087 
131161 
131179 
131182 
131205 
131277 
131281 
131282 
131285 
131355 
131391 
131461 
131487 
131517 
131545 
131583 
131647 
131675 
131676 
131708 
131717 
131756 
131762 
131821 
131839 
131861 
132015 
132070 
132242 
132334 
132476 
132490 
132533 
132598 
132619 
132652 
132726 
133028 
133071 
133120 
133129 
133147 
133151 
133213 
133276 
133377 
133407 
133535 
133537 
133656 
133689 
133779 
133978 
133985 
134000 
134111 
134185 
134204 



AL039940 

AA361258 

BE222198 

AF182277 

AW167668 

AW245805 

X77777 

U09550 

AW978517 

AW016932 

F07783 

R27496 

AL036554 

AA305688 

NMJJQ0328 

AW972422 

AF056195 

NMJJ01928 

AA999702 

AB028945 

NMJJ05883 

AL050295 

AL039940 

NNL001650 

N64328 

AW1 69287 

A1091121 

AF147709 

AF033382 

AA171388 

AI824144 

NMJTC3102 

AA131466 

AA251716 

X03350 

AI567943 

R52804 

AW085781 

AA992841 

F13036 

AB037789 

AL1 37432 

AK000383 

AA359615 

H15205 

AI126821 

S60415 

X94630 

AA443966 

AA744902 

AA017247 

AB014533 

AL09B858 

A1418006 

BE622641 

AA332697 

AW080704 

AL1 19844 

NM.001290 

AI922988 

XB0031 

H28855 

N41739 

N52298 

R51604 

BE384932 

NM_003278 

AA428580 

M026533 

NM_014051 

AA903424 

AW978439 

AJ131245 

AF017987 

AL134030 

U41518 

BE149455 

NMJJ01872 

T58486 

AF035718 

L34657 

AW175787 

A137258B 

AA285136 

AI873257 



Hs.202949 

Hs.237868 

Hs.109843 

Hs.330780 

Hs.279772 

Hs.110903 

Hs.198726 

Hs.1154 

Hs.172847 

Hs.104105 

Hs.1369 

Hs.1378 

Hs.274463 

Hs.267695 

Hs.153614 

Hs.153863 

Hs.15430 

Hs.155597 

Hs.214507 

Hs.12696 

Hs.20912 

Hs.22039 

Hs.202949 

Hs.288650 

Hs.268744 

Hs.22588 

Hs.246218 

Hs.22824 

Hs.23735 

Hs.184482 

Ks.23912 

Hs.2420 

Hs.23767 

Hs.25227 

Hs.4 

Hs.25274 

Hs.25956 

Hs.26270 

Hs.27263 

Hs.27373 

Hs.263395 

Hs.28564 

Ks.323092 

Hs.30089 

Hs.30509 

Hs.30514 

Hs.30941 

Hs.3107 

Hs.31595 

Hs.107767 

Hs.164577 

Hs.33010 

Hs.184245 

Hs.3731 

Hs.38489 

Hs.42721 

Hs.45033 

Hs.49476 

Hs.4980 

Hs.1 72510 

Hs.530 

Hs.53447 

Hs.61260 

Hs.55608 

Hs.300842 

Ks.64313 

Hs.65424 

Hs.65551 

Ks.66 

Hs.94896 

Hs.6786 

Hs.69504 

Hs.7239 

Hs.7306 

Hs.284160 

Hs.74602 

Hs.75415 

Hs.75572 

Hs.222566 

Hs.78051 

Hs.78146 

Hs.334841 

Hs.8022 

Hs.301914 

Hs.7994 



KIAA1 102 protein 

interteukln 7 receptor 2.29 
ESTs 

Human cytochrome P450-IIB (hllB3) mRNA; 
ESTs; Highly similar to CGl-38 protein [ 

Clauoln 5 (transmembrane protein deleted 2.93 

vasoactive intestinal peptide receptor 1 1 60.80 

oviductal glycoprotein 1; 120kD 10.00 

ESTs; Weakly similar to collagen alpha 1 

EST 9.00 

decay accelerating factor for complement 87.80 

annexinA3 44.60 

defensin; alpha 1 ; myelotd-related sequa 2.72 

UDP-Gal:bet aGlcN Ac beta 1 ;£galactosyttr 42.20 

retinitis pigmentosa GTPase regulator 2.54 

MAD (mothers against decapentaplegic; Dr 51 .60 

DKFZP586G1219 protein 

D component of complement (adipsto} 

ESTs 

ESTs 6.00 
adenomatous polyposis colt like 
KIAA0758 protein 

KIAA1 102 protein 20.00 
aquaporin4 41.20 

ESTs; Moderately similar to KIAA0273[H. 31.40 

ESTs 29.60 

ESTs; Weakly similar to zinc finger prot 9.00 

ESTs; Weakly similar to p160 myb-blnding 

potassium voltage-gated channel; subfami 

OKFZP586D0824 protein 

ESTs 

superoxide ctismutase 3; extracellular 2.98 

ESTs 3.15 

ESTs 32.20 

alcohol dehydrogenase 3 (class I); gamma 

ESTs; Moderately simitar to putative sev 

DKFZP564D206 protein 8.00 

ESTs 10.00 

butyrate response factor 2 (EGF-response 28.80 

Homo sapiens mRNA; cONA DKFZp56401763 (f 

ESTs; Highly similar to semaphorin Via [ 39.00 

ESTs 11.00 
ESTs; Weakly similar to dual specificity . 10.00 

ESTs 2.47 
ESTs 

ESTs 45.80 
calcium channel; voltage-dependent; beta 2.28 
CD97 antigen 
ESTs 

ESTs; Moderately similar to CaM-KH Inhl 

ESTs 2.87 
KIAA0633 protein 

KIAA0929 protein Msx2 interacting nudea 54.00 
ESTs 
ESTs 

ESTs 2.68 
lacrimal proline rich protein 4.66 
Homo sapiens clone TUA8 Cri-du-chat regl 34.20 
UM binding domain 2 2.66 
ESTs 13.00 
collagen; type IV; alpha 3 (Goodpasture 
ESTs; Moderately similar to kinesln ligh 
ESTs 

ESTs; Weakly similar to cDNA EST yk484g1 11.43 

ESTs 2.37 

ESTs 2.27 

tetranectin (ptasminogervbinding protein 2.63 

ESTs 

Interteukln 1 receptor-like 1 6.20 
ESTs 

ESTs 31.40 
ESTs 9.00 
SEC24 (S. cerevistae) related gene famil 41 .20 

secreted frizzled-related protein 1 50.20 
protocadherin 2 (cadherin-like 2) 
aquapodn 1 (channel-forming integral pr 

Accession not listed tn Genbank 2.65 
carboxypeptkJase B2 (plasma) 90.80 
ESTs 

transcription factor 21 2.92 
plateletfendothelial cell adhesion molec 
selenium binding protein 1 

TU3A protein 4.49 
Homo sapiens mRNA; cONA DKFZp586K1 220 (f 

ESTs; Weakly similar to CGI-69 protein ( 40.80 
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4.09 



3.30 



3.40 



4.05 
4.09 



3.16 
4.77 
3.54 



3.14 



6.40 



411 
3.50 

3.86 
3.67 

a44 



4.03 



3.06 



40.60 



49.20 
34.80 



30.60 



3.78 
3.67 



3.48 



4.02 
3.18 



3.69 



3.72 



3.05 



5.49 



3.35 



3.45 
4.05 



3.27 
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134641 AI092634 Hs.156114 

134677 AA2S1363 Hs.177711 

134745 NM_000685 Hs.89472 

134749 T28499 Hs.89485 

134786 T29618 

134825 U33749 

134978 AI829008 

135010 N50465 

135053 AW796190 Hs.93678 

1350B1 AF069517 Hs.173993 

135091 AA493650 

135135 AA775910 

135203 C15737 

135236 AI636208 

135266 R41179 

135346 NM.000928 Hs.992 

135378 AW961818 Hs.24379 

135387 NM_001972 Hs.99863 

135388 W27965 Hs.99865 
135402 L12398 Hs.99922 



Hs.89640 
Hs.197764 
Hs.333383 
Hs.92927 



Hs.94367 
Hs.95011 
Hs. 269386 
Hs.96901 
Hs.97393 



protein tyrosine phosphatase; norwecept 
ESTs 

angiotensin receptor 1B 

carbonic anhydrase (V 

angiopoietln 1 receptor; TEK tyrosine ki 

thyroid transcription factor 1 

ficolin (coDagenffibrinogen domain-cont 

ESTs 

ESTs 

RNA binding motif protein 6 
ESTs 

syntrophln; beta 1 (dystrophin-associate 

ESTs 

ESTs 

Human mRNA for KIAA0328 gene; partial cd 
phosphoflpase A2; group IS (pancreas) 
potassium voltage-gated channel; shaker- 
elastase 2; neutrophil 
EST 

dopamine receptor D4 



15.00 



3.05 



2.52 



32.20 



57.80 



31.60 
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3.76 



3.73 



28.80 



43.00 



37.20 
38.80 



8.00 



3.82 
4.15 



3.21 



431 



4.21 



4.24 



6.42 



TABLE 2B shows the accession numbers for those primekeys lacking untgeneiD's for Table 2A. For each probeset we have listed the gene cluster number from which the 
oligonucleotides were designed. Gene clusters were compiled using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence 
similarity using Clustering and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for sequences comprising each cluster are listed in the 
"Accession" column. 

Pkey: Unique Eos probeset Identifier number 
CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey 

108447 
108550 
106655 
102397 
126303 
125810 
103627 
121366 
114609 
115272 
108338 
108434 
123802 
102310 
102636 
104776 
120504 
113502 
108499 
101308 
108629 
103098 
103241 
103508 
103575 
119514 
121082 
128634 
105817 
121518 
114449 
114648 
121950 
107723 



CAT number Accessions 

4345*_-7 AA079126 

120073J AA0848S7 AA084996 

127522.1 AA099960 AA113013 

44371_-1 U41898 

1525933.1 D78841 D78880 

1554054.1 H00083R81062 

2615.2 Z48513Z48512 

280401 1 AI743515AA405617AW276706 

1 1 6777.1 AA079505 AA079537 

1 721 13.1 AW015947 AA21 1890 AA279425 

112186.1 AA070773AA070774 

1 14012.1 M078B99 AA078782 AA075788 

genbankj*A620448 AA620448 

NOT_FOUND_entrezJJ33B39 U33839 

entrezJJ67092 U67092 

genbank u AA026349 AA026349 

genban»LAA256837 AA256837 

genbankJT89130T89130 

genbankJ\A083103 AA083103 

entrezj_41390 L41390 

genbankJVU02425 AA102425 

221 215 M86361 Z26593 X02850 D13070 AE000659 M17649 M87869 M87871 X61077 M16286 AF018169 X61079 S59351 X60142 AF043169 

entrezj<76223 X76223 

entreO'10141 Y10141 

entre^Z26256 Z26256 

NOT_FOUND.entrez_W37937 W37937 

genbank J AA398722 AA398722 

AA464918.at AA464918 

genbankJ\A397825 AA397825 

genbanleM412155 AA412155 

genbankJVA020736 AA020736 

genbanK,AA101056 AA101056 

genbankJVA429515 AA429515 

genbank^AA015967 AA015967 
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Table 3A shows 452 genes up-regulated In chronically diseased tung relative to normal lung. Chronically diseased tung samples represent chronic non-maBgnant lung diseases 
such as fibrosis, emphysema, and bronchitis. These genes were selected from 59680 probesets on the Eos7Affymetrtx Hu03 Genechip array. Gene expression data for each 
probeset obtained from this analysis was expressed as average intensity (Al), a normalized value reflecting the relative level of mRNA expression. 

Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UntgenelD: Unlgene number 

Unlgene TiUe: Unlgene gene title 

Rt: 80th percentile of Al for chronically diseased lung samples divided by the 90th percentile of Al for normal lung samples. 

R2 BOlh percentile of Al for chronically diseased (ung samples divided by the 90th percentile of normal lung samples, squamous ceD carcinomas and 

adenocarcinomas 

R3: 70th percentile of Al for chronically diseased lung samples minus the 15th percentile of A! for all normal lung, chronically diseased tung and tumor samples 

divided by the 90th percentile of normal lung samples, squamous cell carcinomas and adenocarcinomas minus the 15th percentile of Al for all normal lung, 
chronically diseased lung and tumor samples 



Pkey 


ExAccn 


UnlgenelO 


Unlgene Title 


135423 


U50531 


Hs.138751 


Human BRCA2 region, mRNA sequence CG030 


135378 


AW961818 


Hs.24379 


MUM2 protein 


135346 


NM.000928 


Hs.992 


phospholipase A2, group IB (pancreas] 


135235 


AW298244 


Hs.293507 


ESTs 


135057 


U90268 


Hs.93810 


cerebral cavernous maifbrmations 1 


134951 


BE305081 


Hs.169358 


hypothetical protein 


134799 


M36821 


Hs.89690 


GR03 oncogene 


134786 


T29618 


Ks.89640 


TEK tyrosine kinase, endothelial (venous 


134772 


NM_000829 


Hs. 163697 


glutamate receptor, lonotrophlc, AMPA 4 


134752 


BE246762 


Hs.89499 


arachfdonate 5-lipoxygenase 


134749 


T28499 


Hs.69485 


carbonic anhydrase IV 


134696 


BE326276 


Hs.8861 


ESTs 


134636 


NML005582 


Hs.87205 


lymphocyte antigen 64 (mouse) homolog, r 


134627 


Al 01 8768 


Hs.12482 


glyceronephosphate O-acyltransferase 


134622 


AW975159 


Hs.293097 


ESTs, Weakly similar to A55380 faciogeni 


134570 


U66615 


Hs.1 72280 


SW1/SNF related, matrix associated, act! 


134561 


U76421 


Hs.85302 


adenosine deaminase, RNA-specific, B1 (h 


134468 


NNL001772 


Hs. 83731 


CD33 antigen (gp67) 


134417 


NMJJ06416 


Hs.82921 


solute carrier family 35 (CMP-staHc ad 


134343 


O50683 


Hs.82028 


transforming growth factor, beta recepto 


134323 


BE170651 


Hs.8700 


deleted In liver cancer 1 


134300 


NMJJ01430 


Hs.8136 


endothelial PAS domain protein 1 


134299 


AW58Q939 


Hs.97199 


complement component C1q receptor 


134253 


X52075 


Hs.80738 


siafophorln (gpL115, leukosiaffn, CD43J 


134182 


D52059 


Hs.7972 


KIAA0871 protein 


133985 


134657 


Hs.78146 


platelet/endothelial cell adhesion motec 


133978 


AF035718 


Hs.78061 


transcription factor 21 


133835 


A1677B97 


Hs.76640 


RGC32 protein 


133651 


AI301740 


Hs.1 73381 


dihydropyrimidinase-like 2 


133633 


D21262 


Hs.75337 


nucleolar and coBed-body phosphprotein 


133565 


AW955776 


Hs. 3 13500 


ESTs. Moderately similar to ALU7_HUMAN A 


133548 


AW946384 


Hs.1 781 12 


ONA segment, single copy probe LNS-CAI/L 


133488 


AA335295 


Hs.74120 


adipose specific 2 


133478 


X83703 


Hs.31432 


cardiac ankyrin repeat protein 


133337 


AF085983 


Hs.293676 


ESTs 


133200 


AB037715 


Hs.1 83639 


hypothetical protein FU 10210 


133153 


AF070592 


Hs.66170 


HSKM-B protein 


133130 


AI128606 


Hs.6557 


zinc finger protein 161 


133120 


NM_003278 


Hs.65424 


tetranectin (ptasmlnogerv-bindtng protein 


132928 


AW1 68082 


Hs.1 69449 


protein kinase C, alpha 


132836 


AB023177 


Hs.29900 


K1AA0960 protein 


132799 


W73311 


Hs.1 69407 


SAC2 (suppressor of actin mutations 2, 


132742 


AA025480 


Hs.292812 


ESTs, Weakly similar to T33468 hypotheti 


132548 


X12830 


Hs.1 93400 


intarleukin 6 receptor 


132476 


AL1 19844 


Hs.49476 


Homo sapiens done TUA8 Crf-du-chat regl 


132439 


AK001942 


Hs.4863 


hypothetical protein DKFZp566A1524 


132240 


AB018324 


Hs.42676 


K1AA0781 protein 


132210 


NM.007203 


Hs.42322 


A kinase (PRKA) anchor protein 2 


132199 


AL041299 


Hs.165084 


ESTs 


131751 


T96555 


Hs.31562 


ESTs 


131745 


A1828559 


Hs.31447 


ESTs, Moderately similar to A46010 X-li 


131694 


NNL000246 


Hs.3076 


MHC class II transactivator 


131686 


NM_012296 


H3.306B7 


GRB2-associated binding protein 2 


131676 


Al 126821 


Hs.30514 


ESTs 


131629 


245794 


Hs.238809 


ESTS 


131589 


C18825 


Hs.29191 


epithelial membrane protein 2 


131536 


AA019201 


Hs.269210 


ESTs 


131517 


AB037789 


Hs.263395 


sema domain, transmembrane domain (TM), 


131355 


R52804 


Hs.25956 


DKFZP564D206 protein 


131253 


R71802 


Hs.24853 


ESTs 


131207 


AF104266 


Hs.24212 


latrophuln 


131156 


A1472209 


Hs.323117 


ESTs 


131066 


AW169287 


Hs.22588 


ESTs 


131061 


N64328 


Hs.268744 


KIAA1796 protein 


131053 


AA348541 


Hs.296261 


guanine nucleotide binding protein (G pr 


130895 


AA641767 


Hs.21015 


hypothetical protein DKFZp564L0864 slmil 


130762 


084371 


Hs.1898 


paraoxonase 1 



R1 

1140 



12.40 
11.67 



29.80 

13.60 
13.20 



R2 



aoo 

8.20 



R3 
2.13 



1.93 
2.07 



1.92 
1.92 

1.78 



6.20 



20.60 
12.20 



15.20 



30.60 
22.60 

13.80 

41.60 
40.40 



21.20 
15.20 
27.80 

21.40 
15.00 



16.60 
12.00 



9.60 



1.77 
2.08 
1.77 



7.20 
4.76 



4.00 
6.20 



9.40 
3.59 
4.48 



3.54 



1.88 
1.99 
1.76 



1.75 
1.84 



1.93 
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130657 


AW337575 


Hs.201591 


ESTs 




130655 


A1831962 


Hs.17409 


cysteine-rich protein 1 (intestinal) 




130589 


AL1 10226 


Hs.16441 


DKFZP434H204 protein 




130562 


D50402 


Hs.182611 


solute carrier family 11 (proton-coupled 




130555 


R69743 


Hs.1 16774 


Integrin, alpha 1 




130365 


W56119 


Hs.1 55103 


eukaryotic translation Initiation factor 


11.60 


130273 


AW972422 


Hs.1 53883 


MAD (mothers against decapentapteglc, Dr 




130259 


NMJJ00328 


Hs.153614 


retinitis pigmentosa GTPase regulator 




130090 


H97878 


Hs.132390 


zinc finger protein 36 (KOX 18) 


21.20 


129958 


R27496 


Hs.1 378 


annexlnA3 




129898 


A1672731 


Hs.13256 


ESTs 




129875 


AA161018 


Hs.13056 


hypothetical protein FU13920 


18.60 


129699 


AB007899 


Hs.1 2017 


homolog of yeast ubiquitin-protein ligas 




129826 


F13272 


Hs.1 11334 


ferritin, light polypeptide 




129598 


N30436 


Hs.1 1556 


Homo sapiens cDNA FU12566 fis, clone NT 


22.63 


129593 


AI338247 
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Hs.60418 


107332 


T87750 


Hs.183297 


107292 


BE166479 


HS.4769 


107230 


AI034467 


Hs.34650 


107168 


W57578 


Hs.237955 


107160 


AA314490 


Hs.27669 


107054 


AI076459 


Hs.15978 


107029 


AF264750 


HS.28B971 


106999 


H932B1 


Hs.10710 


106954 


AF1 28847 


Hs.204038 


106870 


A1983730 


Hs.26530 


106865 


AW192535 


Hs.19479 


106844 


AA485055 


Hs.158213 


106820 


NM_016831 


Hs.12592 


106618 


AK002135 


Hs.3542 


106797 


A1768801 


Hs.169943 


106773 


AA478109 


Hs.188833 


106747 


NM.007118 


Hs.171957 


106743 


6E613328 


Ks.21936 


106667 


AW360847 


Hs.16578 


106605 


AW772298 


Hs.21103 


106567 


AW450408 


Hs.86412 


106562 


AL031846 


Hs.152151 


106536 


AA329648 


Ks.23804 


106533 


AL134708 


Hs.145998 


105507 


AA259068 


Hs.267819 


106490 


AA404265 


Hs.1 15537 


106474 


BE383668 


Hs.42484 


106211 


AA428240 


Hs.126083 


105986 


AB037722 


Hs.8707 


105894 


AI904740 


Hs.25691 


105847 


AW964490 


Hs.32241 


105803 


AW747996 


H3.160999 


105731 


AA834664 


Hs.29131 


105729 


H46612 


Hs.293815 


105688 


AI299139 


Hs.17517 


105510 


Z42047 


Hs.283978 


105101 


H63202 


Hs.38163 


104989 


R65998 


Hs.285243 


104986 


AW088826 


Hs.1 17176 


104969 


AI670947 


Hs.78406 


104903 


AJ436323 


Hs.31141 


104896 


AW015318 


Hs.23165 


104865 


T79340 


Hs.22575 


104825 


AA035613 


Hs.141883 


104781 


AA099904 


Hs.21610 


104776 


AA026349 




104691 


U29690 


Hs.37744 


104667 


AI239923 


Hs.30098 


104404 


H58762 




104392 


AA076049 


Hs.274415 


104212 


AB002298 


Hs.173035 


104074 


AL162039 


Hs.31422 


103749 


AL135301 


Hs.8768 


103645 


AW246253 


Hs.7043 


103554 


AI878826 


Hs.323469 


103541 


AI815601 


Hs.79197 


103496 


Y09267 


Hs.132821 


103428 


BE383507 


Hs.78921 


103353 


X89399 


Hs.119274 



solute carrier family 16 (monocarboxyllc 

ESTs, Weakly similar to I38022 hypothetl 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

hypothetical protein 
DKFZP586C1620 protein 
KIAA0863 protein 

gbzn98g07.s1 Stratagene fetal retina 93 
ESTs, Weakly similar to T26845 hypothec 
flb3l84c04.s1 Stratagene colon (937204) 
hypothetical protein OKFZp434l0428 
macrophage receptor with collagenous str 
ESTs 

Homo sapiens mRNA; cDNA DKFZp554G112 (fr 
ESTs 

Homo sapiens cDNA FU 14237 fis, clone NT 
ESTs 

chordin-llke 

UM domains containing 1 

Ig superfamily receptor LNIR 

ESTs, Moderately similar to ALU7JHUMAN A 

EST 

DKFZP566F2124 protein 

Homo sapiens serologically defined breas 

ESTs 

RAB7, member RAS oncogene family 
WAA1563 protein 
KIAA1272 protein 

myeloid/lymphoid or mixed-lineage teukem 
hypothetical protein FU20417 
indolethyiamlne N-methyl transferase 
serum deprivation response (phosphatidyl 
ESTs 

sperm associated antigen 6 

period (Drosophila) homolog 3 

hypothetical protein FU11273 

Homo sapiens cDNA FLJ13569 fis, clone PL 

ESTs 

triple functional domain (PTPRF interact 
hypothetical protein FU12492 
ESTs 

Homo sapiens mRNA; cDNA DKFZp564B076 (fr 
chromosome 9 open reading frame 5 
plakophlUn 4 

ESTs, Weakly similar to PN0099 son3 prot 
ESTs 

protein phosphatase 1. regulatory (inhib 



hypothetical protein FU10618 
ESTs 

KIAA1301 protein 

receptor (calcitonin) activity modifying 

ESTs, WeaWy similar to S65657 slpha-1 C- 

ESTs, Moderately similar to A56194 throm 

nuclear receptor coactivator 2 

Homo sapiens HSPC285 mRNA. partial cds 

ESTs 

Homo sapiens PR02751 mRNA, complete cds 
ESTs 

hypothetical protein FU22029 
poly(A)-bIndlng protein, nuclear 1 
phosphatidyBnositol-4- phosphate 5-kInas 
Homo sapiens mRNA for KIAA1568 protein, 
ESTs 

Homo sapiens cDNA: FU21042 fis, clone C 
ESTs 

DKFZP434B203 protein 

gb:z)99f01.s1 SoaresjjregnanUiterusJlbH 

Homo sapiens beta-1 adrenergic receptor 

ESTs 

gb:EST000S7 HE6W Homo sapiens cDNA clone 
Homo sapiens cONA FU10229 Rs, clone HE 
KIAA0300 protein 

Homo sapiens mRNA; cDNA DKFZp434M229 (fr 
hypothetical protein FU10849 
succInate-CoA ligase, GDP-formlng, alpha 
cavedln 1, caveolae protein, 22kO 
CD83 antigen (activated B lymphocytes, I 
flavin containing monooxygenase 2 
A kinase (PRKA) anchor protein 1 
RAS p21 protein activator (GTPase acrJva 



23.83 



22.80 



15.00 
25.60 
14.20 
11.00 
26.00 



15.20 
15.44 



14.20 
51.80 
29.20 
10.73 
32.00 
17.40 
10.43 
11.40 

21.40 
35.80 



13.40 
13.00 



12.60 
10.60 



23.20 
15.20 

10.44 



10.71 



23.40 
37.20 



13.80 



3.91 



27.20 

11.20 
10.86 
12.00 



11.20 
19.80 



17.20 
9.60 



6.00 



3.60 

11.40 

4.76 



7.13 
7.00 



2930 
3.70 



8.30 

ao9 

5.40 
7.60 



10.20 
5.69 
3.82 
4.20 



1.76 



2.05 



140 
1.78 
1.76 
219 



1.94 
1.75 
2.47 



1.92 



1.87 
1.93 



1.91 



1.80 
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103295 


X81479 


Hs.2375 


103280 


U84722 


Hs.76206 


103100 


NH.005574 


Hs.184585 


103025 


NMJJ02837 


Hs.123641 


102898 


M18667 


Hs.1867 


102659 


BE245169 


Hs.211610 


102580 


U60808 


Hs.1 52981 


102417 


AA034127 


Hs.153487 


102363 


NMJJ03734 


Hs.198241 


102302 


AA305342 


Hs.69171 


102283 


AW161552 


Hs.83381 


102188 


U20350 


H3.78913 


102151 


T27013 


Hs.3132 


101957 


128824 


Hs.74101 


101842 


M93221 


Hs.75182 


101771 


NM.002432 


Hs.153837 


101764 


AJ198550 


Hs.81256 


101716 


AF050658 


Hs.2563 


101678 


M62505 


Hs.2161 


101447 


M21305 




101383 


NM.000132 


Hs.79345 


101346 


AI738616 


Hs.77348 


101345 


NML005795 


Hs.152175 


101336 


NM.O0S732 


H3.75678 


101330 


L43821 


Hs.80261 


101277 


BE297626 


Hs.296049 


101262 


L35854 




101168 


NM_005308 


Hs.211569 


101102 


NMJJ03243 


Hs.79059 


101088 


X70697 


Hs.553 


101066 


AW970254 


Hs.889 


100971 


BE379727 


Hs.83213 


100893 


BE245294 


Hs.180789 


100770 


W25797.comp 


Hs.177486 


100716 


X89887 


Hs.172350 


100555 


M69181 




100425 


NMJJ14747 


Hs.78748 


100408 


D86640 


Hs.56045 


100382 


D83407 


Hs.156007 


100351 


064158 




100299 


D49493 


Hs.2171 


100134 


M305746 


Hs.49 


100108 


U09577 


Hs.76873 


100095 


Z97171 


Hs.78454 


100066 







egf-Iike module containing, mucln-like, 
cadherin 5, type 2, VE-cadherln (vascula 
LIM domain only 2 (rhombotln-like 1) 
protein tyrosine phosphatase, receptor t 
progastricstn (pepsinogen C) 
CUG triplet repeat, RNA-binding protein 
CDP-dlecyiglycerol synthase (phosphatlda 
signal transducing adaptor molecule (SH3 
amine oxidase, copper containing 3 (vase 
protein kinase C-like 2 
guanine nucleotide binding protein 11 
chemoWne (C-X3-C) receptor 1 
steroidogenic acute regulatory protein 



3.60 



mannose receptor, C type 1 
myeloid cell nuclear differentiation ant 
SI 00 calcium-binding protein A4 (calcium 
tachykinin, precursor 1 (substance K, su 
complement component 5 receptor 1 (C5a I 
gb:Human alpha satellite and satellite 3 
coagulation factor VIII, procoagutantco 
hydroxyprostaglandin dehydrogenase 15-(N 
calcitonin receptor-like 
FBJ murine osteosarcoma viral oncogene h 
enhancer of filamentafion 1 (cas-likedo 
mlcrofibrfJIar-assoclated protein 4 
gb:Human dystrophin (dp140) mRNA, 5* end 
G protein-coupled receptor kinase 5 
transforming growth factor, beta recepto 
solute carrier family 6 (neurotransmitte 
CharoMeyden crystal protein 
fatty acid binding protein 4, adipocyte 
S164 protein 

amyloid beta (A4) precursor protein (pro 
H1R (histone cell cycle regulation defec 
gb:Human nonmuscle myosin heavy chaln-B 
K1AAQ237 gene product 
sre homology three (SH3) and cysteine ri 
Down syndrome critical region gene 1-llk 

growth differentiation factor 10 
macrophage scavenger receptor 1 
hyaluronoglucosamlnldase 2 
myocilin, trabecular meshwork Inducible 



1.76 
2.15 



11.00 
25.40 
14.00 

10.86 



16.40 
15.40 



18.80 
504.80 



19.00 



19.38 

15.40 
11.20 
14.80 
33.00 
16.20 



7.40 



31.00 



7.52 



1.78 
222 

1.75 
2.24 

201 

1.91 



11.29 



4.00 
4.24 
6.20 
21.20 



5.40 



1.79 



TABLE 3B shows the accession numbers for those prtmekeys tacking unigenelD's for Table 3A. For each probeset we have Dsted the gene cluster number from which the 
oligonucleotides were designed. Gene clusters were compiled using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence 
similarity using Clustering and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for sequences comprising each cluster are listed In the 
"Accession" column. 



Pkey: Unique Eos probeset identifier number 
CAT number. Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT number Accessions 



123619 371681J 

126433 127143J 

125831 1522905J 

126816 122973J 

126852 136135.1 

121059 273450J 

120637 200885.1 

122011 7617_-2 

120934 177521J 

123802 genbank_M620448 

116814 genbank_H50834 

118329 genbank.N63520 

104404 H58762_at H58762 

104776 genbankJW)26349 

113502 genbank_T89130T89130 

101262 entrez_L35854 L35854 

108573 genbankJ\A086005 

101447 entrez_M21305 M21305 

124357 genbanleN22401 

108781 genbanK-AA128654 

112794 genbank_R97018 

100351 entrez^D64158 D64158 

100555 BgrJfT2245 M691B1 M81 105 U5 1039 



AA602964 AA609200 
AA325606AA099517 N89423 
H04043 D60988 D60337 
AA248234AA090985 
AA399961 AA1 28347 
AA393283 AA398528 

AA81 1804 AA809404 AA286907 AW977624 
AA431082 

AA226198 AA226513 AA383773 
AA620448 
H50834 
N63520 



AA026349 



AA086005 

N22401 

AA128654 

R97018 
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WO 02/086443 PCT/US02/12476 

Table 4A shows 202 genes up-regulated In samples from patients treated with chemotherapy or radiotherapy. These genes were selected from 59680 probesets on the 
Eos/Affymetrix Hu03 Genechip array. Gene expression data for each probeset obtained from this analysis was expressed as average Intensity (Al), a normalized value reflecting 
the relative level of mRNA expression. 

Pkey: Unique Eos probeset identifier number 
ExAccn: . Exemplar Accession number, Genbank accession number 
UnigenelD: Unigene number 
Unlgene Title: Unigene gene title 

average of Al for samples from patients treated with chemotherapy or radiotherapy divided by the average of Al for normal lung samples. 

Unigene Title R1 

chromosome condensation 1 27.20 

aldo-keto reductase family 1 , member C3 20.60 
WAAQ042 gene product - 20.40 

glutamate receptor, metabotropic 5 20.60 

E2F transcription factor 3 29.40 

topolsomerase (DNA) II binding protein 23.50 

WAA0874 protein 35.56 

S164 protein 43.40 

POU domain, class 3, transcription facto 21 .80 

gb:Human alpha satellite and satellite 3 193.60 

heparin-blnding growth factor binding pr 38.40 

bullous pemphigoid antigen 1 (230/240kO) 198.80 

desmogtein 3 (pemphigus vulgaris antigen 78.60 

gap junction protein, beta 2, 26kD (conn 1 62.20 

nuclear autoantigenic sperm protein (his 50.00 

cytosoUc ovarian carcinoma antigen 1 26.00 

UDP.N^ceryl-alpha-I>gaiactosamine:polyp 37.20 
mutS (E. coli) homoiog 2 (colon cancer, 

RAR-related orphan receptor A 32.00 

ISL1 transcription factor, UM/homeodoma 51 .20 

deoxyguanosine kinase 13.90 

Homo sapiens cDNA: FU21800 fis, clone H 28.80 

preferentially expressed antigen In mela 1 1 0.60 

neurotensin 116.80 

enolase 2, (gamma, neuronal) 2.30 

matrix metailoprotelnase 1 (Interstitial 1 81 .40 

senirn/glucocoru'coid regulated kinase 49.20 

5T4 oncofetal trophoblast glycoprotein 86.60 

Homo sapiens mRNA; cDNA DKFZp564D016 (fir 42.60 

ESTs 29.40 

K1M1488 protein 21.50 

hypothetical protein FU20287 32.80 

Homo sapiens PR02751 mRNA, complete cds 20.20 

paired box gene 5 (B-cefi lineage specif 28.40 

downstream neighbor of SON 25.40 

ESTs, Weakly similar to 138022 hypotheti 32.00 
Homo sapiens mRNA; cDNA DKFZp761G02121 ( 40.60 

ESTs 59.80 

ESTs 43.40 

phosphoserine aminotransferase 50.80 

gb:tk04g09j(1 NCLCGAPJ.U24 Homo sapiens 53.40 

KIAA0922 protein 20.88 

DKFZP566F2124 protein 23.60 

Homo sapiens mRNA; cDNA DKFZp762G207 (fr 57.20 

Ig superfamBy receptor LNIR 49.00 

hypothetical protein 19.67 

collagen, type XVII, alpha 1 48.17 

RAB6 interacting, WnesMIke (rabkJnes 59.20 

WAA0863 protein 28.60 

hypotheflcal protein FU10493 22.80 
KIAA1702 protein 

ESTs 21.00 

trinucleotide repeat containing 9 31 .60 

ESTs 24.20 

hypothetical protein MGC5487 21.40 

ESTs 20.40 

klnesin family member 13A 19.60 

ESTs 24.00 

HMT1 (hnRNP methyltransferase, S. cerevi 28.40 

zinc-fingers and homeoboxes 1 38.00 

PDZ domain containing 1 61 .20 

hypothetical protein 24.60 

ESTs 27.20 

U Si-Interacting protein NUDE1, rat homo 48.00 

K1AA0942 protein 37.80 

CDC1 4 (cell division cycle 14, S. cerevi 26.80 

solute carrier family 6 (neurotransmitte 63.80 

ESTs, Weakly similar to ALU1 JiUMAN ALU 26.40 

ESTs, Weakly similar to 155214 salivary 47.64 

ESTs 22.00 

hypothetical protein FU10201 65.00 

Homo sapiens mRNA; cDNA DKFZp761 J1324 (f 42.00 

microtubule-associated protein 1 B 55.40 



R1: 


average of Al for samples 


Pkey 


ExAccn 


UnigenelD 


100113 


NMJ101269 


Hs.84746 


100187 


D17793 


Hs.78183 


100210 


D26361 


Hs.3104 


100225 


D28539 


Hs.167185 


100269 


NrVU)01949 


Hs.1189 


100438 


AA013051 


Hs.91417 


100877 


X80821 


Hs.27973 


100893 


BE245294 


Hs.180789 


101273 


Z11933 


Hs.182505 


101447 


M21305 




101649 


AW959908 


Hs.1690 


101724 


L11690 


Hs.620 


101748 


NMJJ01944 


Hs.1925 


101809 


M86849 


Hs.323733 


101879 


AA176374 


Hs.243886 


101915 


AF207881 


Hs.155185 


101973 


U41514 


Hs.80120 


102025 


V0AQ45 


Hs.78934 


102031 


U04898 


Hs.2156 


102052 


NM.002202 


Hs.505 


102391 


AA296874 


Hs.77494 


102420 


U44060 


Hs.14427 


102610 


U65011 


Hs.30743 


102829 


NM_006183 


Hs.80962 


103000 


NMJJ01975 


Hs.146580 


103036 


M13509 


Hs.83169 


103507 


AJ000512 


Hs.296323 


103587 


BE270266 


Hs.82128 


104660 


BE298665 


Hs.14846 


104896 


AW015318 


Hs.23165 


105038 


AW503733 


Hs.9414 


105298 


BE387790 


Hs.26369 


105510 


Z42047 


Hs.283978 


105667 


AA767526 


Hs.22030 


106073 


AL157441 


Hs.17834 


106205 


AW965058 


Hs.111583 


106516 


AL137311 


Hs.234074 


106533 


AL134708 


Hs.145998 


106575 


AW970602 


Hs.105421 


106654 


AW075485 


Hs.286049 


106851 


A1458623 




106995 


AB023139 


Hs.37892 


107332 


T87750 


Hs.183297 


107532 


AA443473 


Hs.173684 


107922 


BE153855 


Hs.61460 


108609 


BE409857 


Hs.69499 


108780 


AU076442 


Hs.117938 


109166 


AA219691 


Hs.73625 


109260 


AW978515 


Hs.131915 


109280 


AKC01355 


Hs.279610 


109292 


AW975746 


Hs.1 88662 


109384 


AA219172 


Hs.86849 


109415 


U80736 


Hs.1 10826 


109445 


AA232103 


Hs.189915 


109502 


AW967069 


Hs.211556 


109633 


AW003785 


Hs.170267 


109766 


A1989482 


Hs.146286 


109958 


AA001266 


Hs.133521 


110920 


N47224 


Hs.20521 


110924 


AW058463 


Hs.12940 


111084 


H44186 


Hs.15456 


111132 


AB037807 


Hs.83293 


111229 


AW389845 


Hs.1 10855 


111337 


AA837396 


Hs.263925 


111987 


NM_016310 


Hs.6763 


112046 


AA383343 


Hs.22116 


112268 


W39609 


Hs.22003 


112685 


R87650 


Hs.33439 


112871 


AL1 10216 


Hs.12265 


112897 


AW206453 


Hs.3782 


112973 


AB033023 


Hs.318127 


112992 


AL157425 


Hs.133315 


113073 


N39342 


Hs.1 03042 



108 



WO 02/086443 



PCT/US02/12476 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



113494 


T91451 


Hs.86538 


ESTs 


2Z80 


113560 


T91015 


Hs.268626 


ESTs 


22.80 


113849 


AA457211 


Hs.8858 


bromodomain adjacent to zinc finger doma 


51.80 


113950 


AI267652 


Hs.30504 


Homo sapiens mRNA; cDNA DKF2p434E082 (fr 


28.20 


114339 


AA782345 


Hs.22790 


ESTs 


20.20 


114385 


H42169 


Hs.18653 


hypothetical protein FU 14627 


21.00 


114455 


H3790B 


Hs.271616 


ESTs, WeaWy slmflarto ALU8.HUMAN ALU S 


25.80 


114518 


AW1 63267 


Hs.106469 


suppressor of varl (S.cerevislae) 3-like 


23.60 


114824 


AA960961 


Hs.305953 


zinc finger protein 83 (HPF1) 


27.20 


114837 


BE244930 


Hs.166895 


ESTs 


30.20 


114974 


AW966931 


Hs.179562 


nudeosome assembly protein 1-Bke 1 


20.80 


115075 


AA814043 


Hs.88045 


ESTs 


30.60 


115084 


BE383668 


Hs.42484 


hypothetical protein FU10618 


28.86 


115291 


BE545072 


Hs.1 22579 


hypothetical protein FU 10461 


38.00 


115313 


AA808001 


Hs.184411 


albumin 


22.60 


115697 


D31382 


Hs.63325 


transmembrane protease, serine 4 


173.60 


115909 


AW872527 


Hs.59761 


ESTs, Weakly similar to DAP1_HUMAN DEATH 


27.77 


116090 


AI591147 


Hs.61232 


ESTs 


20.80 


116107 


AL133916 


Hs. 172572 


hypothetical protem FU20093 


164.20 


116399 


AA889120 


Hs.1 10637 


homeo boxA10 


38.00 


117099 


H93699 




gb:yv16a11.s1 Soares fetal liver spleen 


21.60 


117881 


AF161470 


Hs.260622 


butyrate-induced transcript 1 


49.40 


118091 


AW005054 


Hs.47883 


ESTs, Weakly similar to KCC1.HUMAN CALCI 


22.40 


118138 


AA374756 


Hs.93560 


Homo sapiens mRNA for KIAA1 771 protein, 


22.00 


118720 


N73515 




gb:za49d07,s1 Soares fetal liver spleen 


20.00 


118873 


AI824009 


Hs.44577 


ESTs 


19.40 


119126 


R45175 


Hs.1 171 83 


ESTs 


111.20 


119717 


M918317 


Hs.57987 


B-cefl CLL/iymphoma f IB (zinc finger pro 


33.00 


119940 


AL050097 


Hs.272531 


DKFZP586B031 9 protein 


31.00 


120266 


AI807264 


Hs.205442 


ESTs, Weakly similar to T34036 hypotheti 


20.20 


120515 


AA258356 




gb:zr59c10.s1 Soares_NhHMPu_S1 Homosapi 


25.00 


120859 


AA828434 


Hs.1619 


achaete-scute complex (Drosophila) homo) 


95.40 


120983 


AA398209 


Hs.97587 


EST 


105.20 


121054 


AW976570 


Hs.97387 


ESTs 


38.80 


121369 


AW450737 


Hs.128791 


CGi-09 protein 


41.60 


122335 


M443258 


Hs.241551 


chloride channel, calcium activated, fam 


30.80 


122612 


M974832 


Hs.128708 


ESTs 


19.60 


123130 


AA487200 




gb:ab19f02.s1 Stratagene lung (937210) H 


33.20 


123440 


AI733692 


Hs.112488 


ESTs 


23.17 


123596 


AA421130 


Hs.1 12640 


EST 


23.00 


123619 


AA602964 




gb:no97c02.s1 NCLCGAP_Pr2 Homo sapiens 


28.80 


124006 


A1147155 


Hs.270016 


ESTs. 


77.60 


124169 


BE079334 


Hs.271630 


ESTs 


22.20 


124281 


AI333756 


Hs.1 11801 


arsenate resistance protein ARS2 


42.20 


124472 


N52517 


Hs.102670 


EST 


32.60 


124617 


AW628168 


Hs.152684 


ESTs 


21.80 


124631 


NM_014053 


Hs.270594 


FLVCR protein 


30.40 


124839 


R55784 


Hs.140942 


ESTs 


21.20 


125186 


AA610620 


Hs.161244 


major histocompatibility complex, class 


42.80 


125321 


TB6652 


Hs.178294 


ESTs 


27.00 


125535 


NVL013243 


Hs.22215 


secretogranin III 


23.80 


125646 


M628962 


Hs.75209 


protein kinase (cAMP-dependent, caialytf 


23.20 


125684 


AW589427 


Hs.1 58849 


Homo sapiens cDNA: FU21663 fis, done C 


21.20 


125724 


AL360190 


Hs.295978 
Hs.249034' 


Homo sapiens mRNA full length Insert cDN 


48.80 


125847 


AW161885 


ESTs 


31.00 


125934 


Ml 93325 


Hs.32646 


hypothetical protein FU21901 


21.20 


126077 


M78772 


Hs.210836 


ESTs 


49.80 


126299 


AW979155 


Hs.298275 


amino acid transporter 2 


21.80 


126395 


AI468004 


Hs.276956 


hypothetical protein FU12929 


71.00 


126433 


AA325606 




gb:EST28707 Cerebellum II Homo sapiens c 


23.20 


126509 


R47400 


H*23850 


ESTs 


23.80 


126538 


AB030656 


Hs.17377 


coronin, actin-bindtng protein, 1C 


23.10 


126666 


AA646886 


Hs.1 51999 


ESTs 


36.00 


126812 


AB037860 


Hs.1 73933 


nuclear factor 1/A 


20.80 


126872 


AW450979 




gb:UI-H-BI3-a1a-a-12-0-UI.s1 NCLCGAP_Su 


46.29 


127046 


AA321948 


Hs.293968 


ESTs 


22.60 


127431 


AW771958 


Hs.175437 


ESTs, Moderately similar to PC4259 fern 


30.00 


127489 


AA650250 


Hs.272076 


ESTs 


20.80 


127521 


AW297206 


Hs.164018 


ESTs 


25.20 


127742 


AW293496 


Hs.180138 


ESTs 


28.00 


127925 


AA605151 


Ks.3628 


mitogen-activated protein kinase kinase 


21.20 


127930 


AA809672 


Hs.123304 


ESTs 


20.64 


12796B 


AA830201 


Hs.124347 


ESTs 


28.20 


127987 


AI022103 


Hs.124511 


ESTs 


19.60 


128116 


H07103 


Hs.286014 


Homo sapiens, done IMAG&3867243. mRNA 


20.40 


128609 


NNL003616 


Hs.102456 


survival of motor neuron protein interac 


34.40 


128777 


A1878918 


Hs.10526 


cysteine and glycine-rich protein 2 


53.80 


128949 


AA009647 


Hs.8850 


a disintegrin and metafloproteinase doma 


23.00 


129168 


A1132988 


Hs.109052 


chromosome 14 open reading frame 2 


37.60 


129404 


AI267700 


Hs.317584 


ESTs 


28.60 


129527 


AA769221 


Hs.270847 


delta-tubuSn 


40.80 


129574 


AA026815 


Hs.11463 


UMP-CMP kinase 


31.20 


12S598 


N30436 


Hs.11556 


Homo sapiens cDNA FU12566 fis, done NT 


29.60 


129785 


H19006 


Hs.184780 


ESTs 


72.20 


129970 


AV655B06 


Hs.296198 


chromosome 12 open reading frame 4 


22.20 
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130149 


AW067805 


Hs.172665 


130199 


Z48579 


Hs.172028 


130441 


U63630 


Ha.155637 


130466 


W19744 


Hs. 180059 


130462 


AW409701 


Hs.1578 


130617 


M90516 


Hs.1674 


130703 


R77776 


Hs.18103 


130732 


AW890487 


Hs.63984 


130867 


NMJJ01072 


Hs.284239 


131028 


AJ879165 


Hs.2227 


131066 


AL035461 


Hs.2281 


131284 


NM.001429 


Hs.25272 


131775 


AB014548 


Hs.31921 


131860 


BE383676 


Hs.334 


131945 


NM.002916 


Hs.35120 


132040 


NM_001196 


Hs.315689 


132084 


NMJJ02267 


Hs.3886 


132389 


AA310393 


Hs.190044 


132437 


AA152106 


Hs.4659 


132550 


AW969253 


Hs.170195 


132617 


AF037335 


Hs.5338 


132632 


AU076916 


Hs.5398 


132672 


W27721 


Hs.54697 


132742 


AA025480 


Hs.292812 


132771 


Y10275 


Hs.56407 


133070 


U92649 


Hs.64311 


133153 


AF070592 


Hs.66170 


133181 


X91662 


Hs.66744 


133282 


AA449015 


Hs.286145 


133350 


AI499220 


Hs.71573 


133592 


AV652066 


Hs.75113 


13365B 

Ivwwvv 


AA319146 


HsJ5426 


133865 


AR011155 

nDv III *J*J 


Hs. 170290 


134032 


NM (105025 


Hs.78589 


134125 


NM-014781 


Hs!50421 


134158 


U15174 


Hs.79428 


134321 


BE538082 


Hs.8172 


134367 


M339449 


Hs.82285 


134570 


U66615 


Hs.172280 


134753 


NfVLQ06482 


Hs.173135 


135002 


AA448542 


Hs.251677 


135029 


H58818 


Hs.187579 


135047 


AL134197 


Hs.93597 


135345 


X53655 


Hs.99171 



methylenetetrahydrofolate dehydrogenase 
a disintegrin and metal loproteinase doma 
protein kinase, DMA-activated, catalytic 
Homo sapiens cDNA RJ20653 fis, ctone KA 
baculoviral IAP repeat-containing 5 (sur 
glutamlne-fnjctose-6-phosphatB transamln 
ESTs • 

cadherin 13, H-cadherin (heart) 

DDP glycosyltransferase 1 family, pctype 

CCAAT/enhancer binding protein (C/EBP), 

chromogranln B (secretogranin 1) 

E1A binding protein p300 

KIAA0648 protein 

Rho guanine nucleotide exchange factor ( 
replication factor C (activator 1) 4 (37 
Homo sapiens cDNA: FU22373 fis, clone H 
karyopherin alpha 3 (importin alpha 4) 
ESTs 

cyclfn L anfa-6a 

bone morphogenetlc protein 7 (osteogenic 

carbonic anhydrase XII 

guanine monphosphate synthetase 

Cdc42 guanine exchange factor (GEF) 9 

ESTs, Weakly simitar to T33468 hypotheU 

phosphoserine phosphatase 

a disintegrin and metalloproteinase doma 

HSKM-B protein 

twist (Dmsophila) homoiog (acrocephatos 
SRB7 (suppressor of RNA polymerase B t ye 
hypothetical protein FU10074 
general transcription factor liiA 
secretogranin II (chromogranln C) 
(fiscs, targe (Drosophiia) homoiog 6 
serine (or cysteine) proteinase inhibito 
KIAA0203 gene product 
BCL2/adenovirus E1B 19kD-lnteracfing pro 
ESTs, Moderately similar to A46010 X-fin 
phosphoribosylglycinamlde formyltransfer 
SWI/SNF related, matrix associated, acB 
dual-speclflclty tyrosine-(Y)-phosphoryl 
G antigen 7B 

hydroxysteroid (17-beta) dehydrogenase 
cyclin-dependent kinase 5, regulatory su 
neurotrophin 3 



29.60 

27.60 

28.36 

20.20 

22.40 

19.60 

19.40 

21.40 

110.00 

25.20 

40.60 

24.60 

21.00 

33.40 

60.80 

20.40 

29.40 

32.40 

27.40 

75.60 

31.36 

32.40 

23.40 

61.20 

22.33 

23.50 

30.00 

23.80 

51.60 

33.00 

82.00 

69.33 
33.20 
31.60 
30.60 
23.40 
49.20 
20.20 
20.80 
37.60 
53.40 
31.60 



TABLE 4B shows fte accession numbers for those primekeys lacking unigeneltys for Table 4A. For each probeset we have listed the gene cluster number from which the 
oligonucleotides were designed. Gene clusters were compiled using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence 
similarity using Clustering and Alignment Tools (DoubleTwlst, Oakland California). The Genbank accession numbers for sequences comprising each cluster are listed in the 
"Accession* column. 

Pkey: Unique Eos probeset Identifier number 
CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey 

123619 
126433 
126872 

106851 
118720 
120515 
117099 
101447 
123130 



CAT number 



Accessions 



371681 1 AA602964 AA609200 
127143 1 AA325606AA099517N89423 

142696J AW450979M136653AA13S656AW419381M984358 M492073BE168945AA809054AW238038BE011212BE011359 

BE011367 BE011368 BE011362 BE011215 BE011365 BE011363 
322947 1 AI458623 AA639708 AA485409 R22065 AA485570 
genbanLN73515 N73515 
genbankJW258356 AA258356 
321871.1 H93699 H97976 H80036 
entrezJU21305 M21305 
genbank_AA4872Q0 AA487200 
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Table 5A shows 680 genes up-regulated in squamous cell carcinoma or adenocarcinoma lung tumors relative to normal lung and chronically diseased lung. These genes were 
selected from 5S680 probesets on the Eos/Afrymetrix Hu03 GenecWp array. Gene expression data for each probeset obtained from this analysis was expressed as average 
intensity (Al), a normalized value reflecting the relative level ofmRNA expression. 

Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnlgenelD: Unigene number 

U nig ene Title: Unigene gene title 

R1 : 7Q|h percentile of AI for squamous cell carcinoma and adenocarcinoma lung tumor samples divided by the 90th percentile of Al for normal and chronically 

diseased lung samples. , , 

R2: 80th percentile of Ai adenocarcinoma lung tumor samples divided by the 90th percenliie of Al for normal and chronically diseased lung samples. 

R3: 80lh percentile of Al squamous ceil carcinoma lung tumor samples divided by the 90th percentile of Al for normal and chronically diseased lung samples. 

R4: 80th percentile of Al adenocarcinoma lung tumor samptes divided by the 80th percentile of A! for squamous cell carcinoma lung tumor samples. 

R5: ■ 70th percentile of AJ for squamous cell carcinoma and adenocarcinoma lung tumor samples minus the 15lh percentile of Al for all normal lung, chronically 

diseased lung and tumor samples divided by 90th percentile of Al for normal and chronically diseased lung samples minus the 1 5lh percentile of Al for ail 
normal lung, chronically diseased lung and tumor samples 



Pkey 


ExAccn 


UnigenelD 


10003S 






100036 






tOX37 






100071 


A28102 




100114 


X02308 


Hs.82962 


100154 


H60720 


Hs.81892 


100187 


D17793 


Hs.78183 


100188 


AW247090 


Hs.57101 


100202 


BE294407 


Hs.99910 


100216 


AA489908 


Hs.1390 


100269 


mm 001949 


He. 1189 


100287 


AU076657 


Hs.1600 


100297 


AU077258 


Hs. 182429 


100330 


AW4 10976 


Hs.77152 


100335 


AW247529 


Hs.6793 


100360 


W70171 


Hs.75939 


100372 


NM 014791 


Hs.184339 


lUUHr *t 




Hs 300280 


mndRR 

IUUHOD 


T19006 


Hs 10842 


100491 


U9DIOJ 


Hs 275163 


100516 


D90276 


Hsu 


100522 


X51501 


Hs 99949 




mm nnnn94 


Hs 1640 

1 lO» 1 U*1U 


100576 


X00356 


Hs.37058 




AA01S693 


Hs 21291 


100661 


BE623001 


Hs.1 32748 


100677 


AA353S86 


Hs.57813 


1DQ69R 


D1 4887 


Hs.121686 


100709 


N26539 


Hs. 100469 


100761 


BE208491 


Hs.295112 


100830 




Hs 4756 

1 Wit I WW 


100867 

IvUUvf 


U14622 




100902 


M 16029 


Hs.287270 


100906 


AU076916 


Hs.5398 


100960 


J00124 


Hs, 117729 


101045 


J05614 




101061 


NM_000175 


Hs. 180532 


101071 


L02840 


Hs.84244 


101124' 


L10343 


Hs.1 12341 


101175 


U82671 


Hs.36980 


101181 


BE262621 


Hs.73798 


101204 


L24203 


Hs.82237 


101210 


L29301 


Hs.2353 


101216 


AA284166 


Hs.84113 


101228 


AA333387 


Hs.82916 


101233 


AL135173 


Hs.878 


101273 


Z11933 


Hs.182505 


101342 


U52112 


Hs.182018 


101346 


AI738616 


Hs.77348 


101369 


NlvL000892 


Hs.1901 


101396 


BE267931 


Hs.78996 


101431 


BE185289 


Hs.1076 


101448 


NMJ300424 


Hs.1 95850 


101462 


AL035668 


Hs.73853 


101466 


BE262660 


Hs.170197 


101484 


AA053486 


Hs.20315 


101502 


M26958 




101505 


AA307680 


Hs.75692 


101526 


NM.002197 


Hs.154721 


101535 


X57152 


Hs,99853 


101577 


M34353 


Hs.1 041 


101649 


AW959908 


Hs.1690 


101663 


NMJ)03528 


HJL2178 


101664 


AA436969 


Hs.121017 


101669 


L24498 


Hs.80409 



Unigene Title R1 R2 R3 R4 R5 

AFFX control: GAPDH 6.76 
AFFX control: GAPDH 5.77 
AFFX control: GAPDH 5.75 
Human GABAa receptor alpha-3 subunit 8.00 

thymidylate synthetase 5.71 
KIAA0101 gene product 184 
atdo-keto reductase family 1 , member C3 3.33 

mlnlchromosome maintenance deficient (S. 4.52 
phosphofructoklnase, platelet 5.49 
proteasome (prosome. macropain) subunit, 6.67 
E2F transcription factor 3 2.55 

chaperonln containing TCP1, subunit 5 (a 5.66 
protein disulfide isomerase-related prat 3.81 
mlnlchromosome maintenance deficient (S. 4.50 
platelet-activating factor acetylhydrola 5.07 

uridine monophosphate kinase 4.82 
KIAA0175 gene product 3.79 
amylase, alpha 2A; pancreatic 1 5.65 

RAN, member RAS oncogene family 5.49 
ncn-malastafc cells % protein (NM23BJ 4. 17 

carcinoembryonic antigen-related cell ad 7.20 
prdac tin-induced protein 14.20 
collagen, type VII, alpha 1 (epidermolys 3.10 

calcitonin/calcitonin-related polypeptid 9.30 

mltogen-activated protein kinase kinase 20.60 

Homo sapiens ribosoma) protein L39 mRNA, 3.85 

zinc ribbon domain containing, 1 8.60 

general franscrfpUon factor HA, 1 (37k 10.00 

myeloid/lymphoid or mixed-lineage leukem 24.60 

KIAA061 8 gene product 7.60 

flap structure-specific endonudease 1 7.99 
gb:Human transketolase-like protein gene 10.20 
ret proto-oncogene (multiple endocrine n 8.00 

guanine monphosphate synthetase 5.16 
keratin 14 (epidermolysis bullosa simple 2.57 

gb:Human proliferating ceil nuclear antl 4.69 
glucose phosphate Isomerase 4. 1 9 

potassium voltage-gated channel, Shab-re 1 2.91 

protease inhibitor 3, skin-derived (SKAL 3.12 
melanoma antigen, family A, 2 3.50 

macrophage migration inhibitory factor ( 5.69 

ataxia-telangiectasia group D-associated 4.08 

opioid receptor, mu 1 6.40 

cyclin-dependent kinase inhibitor 3 (CDK 253 

chaperonln containing TCP1, subunit 6A ( 7.90 
sorbitol dehydrogenase 4.45 
POU domain, class 3, transcription facto 8.50 

fnterieukin-1 receptor-associated kinase 4. 17 

hydroxyprostaglandin dehydrogenase 15-{N 21 .89 

kaQikrein B, plasma (Fletcher factor) 1 12.80 
proliferating ceD nuclear antigen 3.24 

small proline-rich protein 1 B (comlfin) 7.90 
keratin 5 (epidermolysis bullosa simplex 8.31 

bone morphogenetic protein 2 38.80 
glutamic-oxaloacetic transaminase 2, mil 4.01 
interferon-induced protein with tetratri 12.00 
gb:Human parathyroid hormone-related pro • 10.50 

asparagine synthetase 4.46 
aconitase 1, soluble 4.02 

fibrillarln 4.65 

v-ros avian UR2 sarcoma virus oncogene h 9-09 

heparfn-bindtng growth factor binding pr 54.00 

H2B histone family, member Q 5.59 

H2A histone family, member A 7.00 

growth arrest and DNA-damage-induclble, 7.60 
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101695 


M69136 


Hs.135626 


101724 


L11690 


Hs.620 


101748 


NM.001944 


Hs.1925 


101759 


M80244 


Hs.184601 


101771 


NM-002432 


Hs.1 53837 


101804 


M86699 


Hs.169840 


101809 


M86B49 


Hs.323733 


101833 


AU076442 


Hs.1 17938 


101842 


M93221 


Hs.75182 


101851 


BE260964 


Hs.82045 


102002 


NMJJ02484 


Hs.81469 


102039 


AL134223 


Hs.306098 


102072 


U09410 


Hs.78743 


102083 


T35901 


Hs.75117 


102111 


L36196 


Hs.81884 


102123 


NM.001809 


Hs.1 594 


102154 


U17760 


Hs.75517 


102193 


ALD36335 


Hs.313 


102217 


AA829978 


Hs.301613 


102224 


NMJ)02810 


Hs.1 48495 


102234 


AW163390 


Hs.278554 


102251 


NMJW4398 


Hs.41706 


102305 


AL043202 


Hs.90073 


102330 


BE298063 


Hs.77254 


102340 


U37055 


Hs.278657 


102348 


U37519 


Hs.87539 


102368 


U39817 


HS.36B20 


102394 


NM.00381& 


Hs.2442 


102404 


NJVL005429 


Hs.79141 


102537 


U57094 


Hs.50477 


102581 


AU077228 


Hs.77256 


102605 


AI435128 


Hs.181369 


102610 


1)65011 


Hs.30743 


102623 


AW249285 


Hs.37110 


102642 


AA205847 


Hs.23016 


102654 


AV649989 


Hs.24386 


102659 


BE245169 


Hs-211610 


102669 


U71207 


Hs.29279 


102672 


U72066 


Hs.29287 


102687 


NI\4_0O7019 


Hs.93002 


102696 


BE540274 


Hs.239 


102768 


U82321 




102781 


BE258778 


Hs.1 08809 


102784 


U85658 


Hs.61796 


102824 


U90916 


Hs.82845 


102829 


NM.006183 


Hs.80962 


102888 


AI346201 


Hs.76118 


102892 


BE440042 


Hs.83326 


102913 


NM.002275 


Hs.80342 


102935 


BE561850 


Hs.80506 


102951 


X15218 


Hs.2969 


102983 


BE387202 


Hs.1 16639 


103023 


AW500470 


Hs.1 17950 


103036 


M13509 


Hs.63169 


103038 


AA926960 


Hs.334883 


103060 


NIVL005940 


Hs.155324 


103099 


AI693251 


Hs.8248 


103119 


X63629 


Hs.2877 


103166 


X53463 


Hs.2704 


103185 


NMJJ06825 


Hs.743$8 


103192 


M22440 


Hs.1 70009 


103223 


8E275607 


Hs.1708 


103242 


X76342 


Hs.389 


103316 


X83301 


Hs.324728 


103375 


NIVL005982 


Hs.54416 


103376 


AL036166 


Hs.323378 


103385 


NNLGQ7069 


Ks.37189 


103391 


X94453 


Hs.1 14366 


103404 


BE394784 


Hs.78596 


103430 


BE564090 


Hs.20716 


103446 


X98834 


Hs.79971 


103476 


Y07701 


Hs.293007 


103477 


AJ011812 


Hs.119018 


103478 


BE514982 


Hs.38991 


103515 


Y10275 


Hs.56407 


103558 


BE616547 


Hs.2785 


103580 


AA328046 


Hs.46405 


103587 


BE270266 


Hs.82128 


103594 


A1368680 


Hs.816 


103636 


NM 006235 


Hs.2407 


103768 


AF086009 




103841 


AA314821 


Hs.38178 


103847 


AF219946 


Hs.102237 


103913 


AW967500 


Hs.133543 


104Q94 


AA418187 


Hs.330515 



chymase 1, mast ceil 
bullous pemphigoid antigen 1 (230/240kD) 
desmogteln 3 (pemphigus vulgaris antigen 
solute carrier family 7 (cationic amino 
myeloid cell nuclear differentiation ent 
TTK protein kinase 

gap function protein, beta 2, 26kD (conn 
collagen, type XVII, alpha 1 
mannose receptor, C type 1 
midkine (neurits growth-promoting factor 
nucleotide binding protein 1 (Ecoll Min 
aldo-keto reductase family 1, member C1 
zinc finger protein 131 (clone pHZ-10) 
interleukin enhancer binding 'factor 2, 4 
surtotransferase family, cytosolfc, 2A, 
centromere protein A (17kD) 
laminin, beta 3 (nlcein (1 25kD), kalinin 
secreted phosphoproteln 1 (osteopontin, 
JTV1 gene 

proteasome (prosome, macropain) 26S subu 
heterochromatin-like protein 1 
DEAD/H (Asp-Glu-AIa-Asp/Hte) box polypep 
chromosome segregation 1 (yeast homolog) 
chromobox homolog 1 (Drosophila HP1 beta 
macrophage stimulating 1 (hepatocyta gro 
aldehyde dehydrogenase 3 family, member 
Bloom syndrome 

adisintegrin and metalloproteinase doma 
vascular endothelial growth factor C 
RAB27A, member RAS oncogene family 
enhancer of zests (Drosophila) homolog 2 
ublquitin fusion degradation 1-like 
preferentially expressed antigen in mela 
melanoma antigen, family A, 9 
G protein-coupled receptor 
Human hbc647 mRNA sequence 
CUG triplet repeat, RNA-binding protein 
eyes absent (Drosophila} homolog 2 
retinoblastoma-binding protein 8 
ublquitin carrier protein E2-C 
forkhead box M1 

gb;Homo sapiens clone 14.9B mRNA sequenc 
chaperonin containing TCP1, subunit7 (e 
transcription factor AP-2 gamma (aclivat 
Homo sapiens cDNA: FU21930 fis, clone H 
neurotensin 

ubiquilin carboxyMermlnal esterase LI 
matrix metaibprotemase 3 (stromslysin 
keratin 15 

small nuclear ribonucleoprotein polypept 
v-sH avian sarcoma viral oncogene homol 
non-metastattc cells 1, protein (NM23A) 
multifunctional polypeptide similar to S 
matrix metalloproteinase 1 (interstitial 
CDC28 protein kinase 1 
matrix metalloproteinase 1 1 (stromelysin 
NADH dehydrogenase (ubiquinone) Fe-S pro 
cadherin 3, type 1, P-cadherln (placenta 
glutathione peroxidase 2 (gastrointestin 
transmembrane protein (63kD), endoplasmi 
transforming growth factor, alpha 
chaperonin containing TCP1, subunit3 (g 
alcohol dehydrogenase 7 (class IV), mu o 
SMA5 

sine oculis homeobox (Drosophila) homoto 
coated vesicle membrane protein 
similar to rat HREV1 07 
pyrroiine-5-carboxylate synthetase (glut 
proteasome (prosome, macropain) subunlt, 
translocase of Inner mitochondria) membr 
sal (Drosophila)-fike 2 
aminopeptidase puromycin sensitive 
transcription factor NRF 
S100 calcium-binding protein A2 
phosphoserlne phosphatase 
keratin 17 
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serine (or cysteine} proteinase inhlbito 


A Cft 

4.60 


132771 


Y10275 


Hs.56407 


phosphoserine phosphatase 


3./i 


132799 


W73311 


Hs.169407 


SAC2 (suppressor of actin mutations 2, 




132833 


U78525 


HS.577B3 


eukaryobc translation initiation factor 




132892 


AW834050 


Hs.9973 


tensln 




132906 


BE613337 


Hs.234896 


gemtntn 


3.09 


132959 


AW014195 


Hs.61472 


ESTs, Weakly similar to YAE6_YEAST HYPOT 




132962 


AA576635 


Hs.6153 


CGM8 protein 


3.50 


132990 


X77343 


Hs.334334 


transcription factor AP-2 alpha (actlvat 


6.18 


132994 


AA112748 


Hs.279905 


clone HQ0310 PRO03l0p1 


O. i5? 


133000 


AL042444 


Hs.62402 


p21/Cdc42/Rac1-activatea kinase 1 (yeast 


Z96 


133050 


X73424 


Hs.63788 


propiony) Coenzyme A carboxylase, beta p 




133083 


BE244588 


Hs.6456 


chaperonln containing TCP1, subunit 2 (b 




133086 


L17131 


Hs.139800 


high-mobility group (nonhistone chromoso 




133134 


AF198620 


Hs.65648 


RNA binding motif protein 8A 




133155 


M58583 


Hs.662 


cerebeKIn 1 precursor 




133181 


X91662 


Hs.66744 


twist (Drosophlla) homolog (acrocephalos 


3.00 


133204 


BE267696 


Hs.254105 


enolasel, (alpha) 




133412 


U41493 


Hs.73112 


guanine nucleotide binding protein (G pr 




133421 


AF134160 


Hs.7327 


dautfinl 


2.85 


133451 


AW970026 


H3.73B18 


ublqulnoksytochrome c reductase hinge p 




133453 


AI659306 


HS.73B26 


protein tyrosine phosphatase, non-recept 


6.14 


133504 


NM_004415 


Hs.74316 


desmoplakln (DPI, DPK) 


133506 


BE562958 


Hs.74346 


hypothefjca) protein MGC14353 




133615 


M62843 


Hs.75236 


ELAV (embryonic letha), abnormal vision, 




133627 


NM_002047 


Hs.75260 


glycyl-tRNA synthetase 




133849 


U25849 


Hs.75393 


acid phosphatase 1, soluble 




133669 


MM.006925 


Hs.166975 


splicing factor, arajnine/serfne-rtch 5 




133749 


L20852 


Ks.10018 


solute carrier family 20 (phosphate tran 




133776 


BE268649 


Hs.177766 


ADP-nbosyltransferase (NAD+; poly (ADP- 




133865 


AB011155 


Hs.1 70290 


dlscs, large (Drosophila) homolog 5 


O fit 

3.07 


133946 


AJ001258 


Hs.173878 


N/PoNAP, C elegans, homolog 1 




133973 


N55540 


Hs.78026 


ESTs, Weakly similar to similar to ankyr 




134047 


BE262529 


Hs.78771 


phosphoglycerate kinase 1 




134098 


BE513171 


H8.79086 


mitochondrial liposomal protein L3 


2.56 


134107 


NNL005629 


Hs.187958 


solute carrier family 6 (neurotransmitte 




134112 


AW449809 


Hs.79150 


chaperonin containing TCP1, subunit 4 (d 




134158 


U15174 


Ha.79428 


BCL2/adenovirus E1B 19kD-Interactlng pro 


31.00 


134160 


T98152 


Hs.79432 


fibrillin 2 (congenital contractural ara 




134168 


AA398908 


Hs.181634 


Homo sapiens cDNA: FU23602 fis, clone L 




134185 


AA285136 


Hs.301914 


neuronal specific transcription factor D 




134201 


L35035 


Hs.79886 


ribose 5-phosphate isomerase A (ribose 5 


4.50 


134272 


X76040 


Hs.278614 


protease, serine, 15 


134276 


BE083936 


Hs.80976 


antigen identified by monoclonal antibod 




134353 


AL13B201 


Hs.82120 


nuclear receptor subfamily 4, group A, m 




134387 


AA339449 


Hs.82285 


phosphoribosytglycinamide forrnyltransfer 


2.80 


134380 


AU077143 


Hs.179565 


minichromosome maintenance deficient (S. 


4.68 


134423 


H53497 


Hs.83006 






134469 


AA279661 


Hs.83753 


small nuclear ribonucleoproteln pdypept 




134470 


X54942 


Hs.83758 


CDC28 protein kinase 2 




134498 


AW246273 


Hs.84131 


IhreonykRNA synthetase 




134502 


BE148534 


Hs.84168 


UV-B repressed sequence, HUR 7 




134510 


NMJJ02757 


Hs-250870 


mitogen-actlvated protein kinase kinase 




134548 


N95406 


Hs.333495 


Deleted In spfit-hartdTspIU-foot 1 regio 


6.00 


134654 


AK001741 


Hs.8739 


hypothetical protein RJ10879 



PCTYUS02/12476 



8.40 



9.80 
8.40 



3.82 



12.25 



7.00 



8.20 



8.60 
27.40 



6.60 



13.20 



9.20 
19.80 



15.83 



9.48 
12.00 



10.80 



12.50 
6.80 



6.11 

8.20 
24.60 



17.80 
14.00 

13.00 



8.40 
9.00 



14.74 



16.40 



5.83 
3.87 



4.00 
8.96 
4.28 



4.63 
4.66 

4.55 

4.85 
6.34 

4.91 
4.60 
3.85 

4.08 

6.71 



13.60 



9.70 



3.64 
5.81 
4.21 
7.30 



4.63 
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134724 


AF045239 


Hs.321576 


ring finger protein 22 




134743 


AA044163 


Hs.89463 


potassium large conductance calcium-acti 


4.00 


134781 


AA374372 


Hs.89626 


parathyroid hormone-Ee hormone 




134806 


AD001528 


Hs.89718 


spermine synthase 




134853 


BE268326 


Hs.90260 


5-aminoimid azole-4-carboxamide ribonucle 




134859 


D26488 


Hs.90315 


K1AAC007 protein 




134891 


R51083 


Hs.90787 


ESTs 




134960 


BE246400 


HSJ285176 


acetyl-Coenzyme A transporter 


4.00 


134993 


BE409809 


Hs.301005 


purine-rich element binding protein B 




135047 


AL134197 


Hs.93597 


cyci In-dependent kinase 5, regulatory su 


9.50 


135080 


AI761180 


Hs.94211 


rcdl (required for cell differentiation, 


5.00 


135103 


NM.003428 


Hs.9450 


zinc finger protein 84 (HPF2) 




135145 


AW014729 


Hs.95262 


nuclear factor related to kappa 8 bindin 




135184 


U13222 


Hs.96028 


forkhead box D1 




'135242 


AI583187 


Hs.9700 


cyclin E1 


13.50 


135286 


AW023482 


Hs.97849 


ESTs 


6.46 


135289 


AW372559 


Hs.9788 


hypothetical protein MGC10924 similar to 




135355 


AK001652 


Hs.99423 


ATP-dependent RNA hellcase 


10.00 


135371 


NM.006025 


Hs.997 


protease, serine, 22 


8.00 


135393 


L11244 


Hs.99886 


complement component 4-binding protein, 





12.00 



11.00 



8,80 



25.20 



6.20 
7.40 



7.00 



PCT/US02/12476 



4.58 
4.79 



4.48 



4.01 



14,60 



TABLE 5B shows the accession numbers for those primekeys lacking unigeneiD's for Table 5A. For each probeset we have listed the gene cluster number from which the 
oligonucleotides were designed. Gene clusters were compiled using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence 
similarity using Clustering and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for sequences comprising each duster are listed in the 
"Accession" column. 

Pkey: Unique Eos probeset identifier number 
CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey 

117079 
124305 
101502 
109792 
126034 
102768 
126345 
127066 
127099 
119243 
125B75 
112054 
126979 
126992 
122318 
114699 
114793 
108305 
.108393 
100867 
123731 
109700 
120715 
113702 
115113 
101045 
108554 
108573 
119052 
126522 
126605 
103768 



CAT number 



Accessions 



1621717J 
242183J 
18202.-6 
754958.1 
1598157J 
44641 1 
1653833J 
1703458.1 
244301 1 
1774795J 
1566433J 
1538292J 
17141 1J 
880855.1 
292419.1 
135322.1 
150742.1 
111550J 
113411.1 
tlgr.HT4586 
genbank^AA609839 
genbank.F09609 F09609 
genbanlCjAA292700 
genbank_T97307 T97307 
genbankJtA256460 
entrez_J05614 J05614 
genbank_AA084948 
genbank^AA086005 
149538J 
416020.1 



H92325T97125 

AW963221 AA344870 AA344871 H93331 
M26958 
R49625F10674 
H60340 N91637 
U82321 H66077 
N49713 N49819W03810 
R25066 R201 44 R20145 Z43845 
AA347668 AW956810 Z44271 F07065 F07064 R13506 
T12603T12604 
H14480N98295 
R43590F10439 
AA210954AA211007 
AI809521 H12174Z42556 
AA429743AA442754 
AA127386R15644AA127404 
AA158245AA158235 
AA071391 AA069892AA069891 
AA07521 1 AA075245 AA075126 AA074946 
U14622 

AA609839 



AA292700 



AA256460 



AA084948 
AA086005 
R10889 R10888 
W31912AI167491 
439280 1 AA676910AA778853AA778865W86800 
46922.1 W42667 AI580740 AI690440 AI561350 AW467906 AW151450 AI825927 AL041716 AI885600 AI742213 AW248624 AI955498 AA033947 

AA845593 AI62371 1 N68583 C00064 AA193567 AW083868 AW163216 AA191595 AA522776 AI628008 AI915518 AA843508 AI92619S 
M176265AW167963AA992115W93647AW103572AI862994AI342059AA911719AA176155AA024712AA069988AA205591 AI591107 
AI199673 AIB1 1766 AJ275832 AI422233 AI191852 AI096682 AI580124 AJ683612 AA582453 AA927559 AA486415 T32414 AI08497B H44849 
H44848 H20477 T91 695 W47039 M070055 AA024795 AA328855 AA379248 AA379330 AA385580 W25920 W03688 AA448359 AA093881 
AW362477 AA089997 A1350265 W93479 N99688 AA932257 AW351469 H68590 AA663402 AA069771 AW087986 AI858420 M60021 4 
AI970774AI857712AI6B3081 A1885584 AW131150 A1567981 AW002714 AW1 89973 AW075495 AW1 68303 AA95371 4 AW51 6881 A1357375 
A1566663 AW512676 AI570580 AI023690AA448216 AI079853 AI422707 AA779516 AW026972 AW1 30082 AW1 62307 AW438646 AA709332 
AW192394AI167350AI217B79AI129152AA719509Ai350480AA663418AI003634AW118546AA180261 AA442833 Al 268625 AA888881 
AI038759 AA846723 AI248770 AA993694 AI280335 AI885107 AW518649 AA641563 AA995835 AA582521 AI276744 M436478 A1017360 
AI620763 A1859887 N73926 A1076327 AI741615 AI160617 AW172819 AI492005 AA677429 AA996334 AI693771 A1950039 A1245629 AI288515 
A18661 86 T93293 M1 73262 AA599779 A16B0092 AW43931 6 AI084555 AI272672 AI583507 AW47321 9 AA7381 32 AW473283 AI367492 
AA995410 AJ689624 AA206353 AI033095 AI040382 AA873630 AI221074 AJ 934840 AI4 18680 AA844306 R94503 AA773520 AA843169 
AA219425 AA629658 AI81 1719 AW41 1 275 AI590981 W37907 AI591 178 AI684051 AA983238 AA669347 AA976239 AA704570 A1628339 
AI884391 AI241580 A1003539 AW176687 AA009650 N34566 AI333493 AI186070 AA070827 AA41 1683 AI280884 M872023 AA207255 
AA021576 N71953 AI885888 AW076039 T15777 AI537673 AW248048 H09554 W93480 W47001 AW0791 14 AA0631 60 AA757453 R60788 
AI859431 H20478 AA218882 AA757465 AA100995 AI864135 AI934209 AA070503 H47008 AA219646 W61039 W93907 AW385050 W37967 
W78028 AA189007 AA479136 R93650 AA442312 T30287 AA847628 AA180262 AA009649 C03892 AW149464 AA310963 AA219693 
AA069747 R29207 AA094784 AA293615 AA447848 AI9841 67 N90393 C05097 N56499 AW292351 AW149681 AW473258 AA629322 AI004409 
AW105577 A1954937 AI811070 AA902422 AW514437 AA535460 AA916877 AW517122 AA974657 AA975649 AW517130 AW517129 F31737 
W07688AA193645AA378994 AA489273 F32267 W39303 AA0211B1 N86B10M406524AA062553 AA436B01 H08985H15979 N40310 
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WO 02/086443 PCTYUS02/12476 

AA436789 AA2321 72 AW360778 W25862 R60282 AA436530 AA378894 AA187461 AI940535 AA604210 MQ8BSU AA360421 N88243 N84281 
AA209340 N561 74 N88374 AA191088 AW247691 AA24901 3 M0931 1 1 AA972536 AW298594 AA375893 T1 21 39 W281 86 AW243849 
AI288629 AA843996 W15260 AI188286 AW248079 R15836 

119599 genbank_W45S52 W45552 

112382 genbank_R59904 R59904 

105264 genbanleAA227934 AA227934 

100071 enlrez^A2B102 A28102 

123315 714071 1 AA495369 AA496646 



Table 6A shows 99 genes up-regulated nonsmokers with lung cancer rela0ve to smokers with lung cancer. These genes were selected from 59680 prabesets on the 
Eos/Affymetrix Hu03 Genechlp array. Gene expression data for each probeset obtained from this analysis was expressed as average Intensity (Al), a normalized value reflecting 
the relative lave) of mRNA expression. 

Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unlgene Title: Unigene gene title . 

R1 : average of Al for samples from non-smokers with adenocarcinoma divided by the 90th percentile of Al for samples from smokers with adenocarcinoma 

R2: average of Al for samples from non-smokers with squamous cell carcinoma divided by the 90th percentile of Al for samples from smokers with squamous cell 
carcinoma 



Pkey 


ExAccn 


UnigenelD 


Unigene Title 


100971 


BE379727 


Hs.83213 


tatty acid binding protein 4, adipocyte 


101174 


L 17330 


Hs.280 


pre-T/NK cell associated protein 


101296 


Y 12490 


Hs.85092 


thyroid hormone receptor interactor 11 


101304 


AA001021 


Hs.6685 


thyroid hormone receptor interactor 8 


101806 


AA586894 


Hs. 112408 


S100 calcium-binding protein A7 (psorias 


101972 


S82472 




gb:beta -pobDNA polymerase beta {exon a 


102274 


U30930 


Ha 158540 


UDP glycosyl transferase 8 (UDP-galactose 


102394 


mm Q03816 


Hs.2442 


a disintegrin and metalloprotelnase doma 


102832 


U92015 




gb:Human clone 143789 defective mariner 


tniMfl 

IUwU IV 


X52509 


Hs. 161640 


tyrosine aminotransferase 








gb:H.sapiens mRNA for iigase like prate! 


i uojuo 


L02911 


Hs. 150402 


aclrvin A receptor, type t 


103857 


AI076795 


Hs.45033 


lacrimal proline rich protein 


104239 


AB002367 


Hs.21355 


doublecortin and CaM kfnase-like 1 


104590 


AW373062 


Hs.63623 


nuclear receptor subfamily 1, group I, m 


104907 


AA0S5829 


Hs. 196701 


ESTs, Weakly similar to ALU1_HUMAN ALU 


106131 


BE514788 


Hs.296244 


SNARE protein 


106672 


H47233 


Hs.30643 


ESTs 




T56887 


Hs 18282 


KIAA1 134 protein 


incqcn 




Hs 32501 


ESTs 




243845 


Hs 194478 


Homo sapiens mRNA; cDNA DKFZp43401572 (f 


107982 




Hq S78S7 


ESTs, Weakly similar to KJM0758 protel 








gb:zm26c06.s1 Stratagene pancreas (93720 






Hs R9328 


MD-2 protein 


108663 


BE219231 


Hs.292653 


ESTs, Weakly similar to T26845 hypotheii 






Hs 85950 


ESTs 


109630 


R44607 


Hs.22672 


ESTs 


110193 


AI004874 


Hs.310764 


Homo sapiens mRNA; cDNA DKFZp434M082 (fr 


1 iuzg*) 


H24458 


Hs 32085 


EST 


110644 


R94207 


Hs.268989 


ESTs, Highly similar to type ll CALM/AF1 


110886 


AW274992 


Hs.72249 


three-PDZ containing protein similar to 


111057 


T79639 


Hs.14629 


ESTs 


111950 


AF071594 


Hs.1 10457 


Wolf-Hirschhorn syndrome candidate 1 


112291 


R53972 


Hs.26026 


ESTs 


112956 


Z43784 


Hs.75893 


ankyrin 3, node of Ranvier (ankyrin G) 


113009 


T23699 


Hs.7246 


ESTs 


113060 


BE564162 


Hs^50820 


hypothetical protein FU14827 


113073 


N39342 


Hs.103042 


microtubule-associated protein 1B 


113074 


AK001335 


Hs.31137 


protein tyrosine phosphatase, receptor t 


113121 


T48011 


Hs.8764 


EST 


113125 


AA968672 


Hs.8929 


hypothetical protein FIJ1 1362 


113757 


AA703095 


Hs.18631 


ESTs 


113848 


W52854 


Hs.27099 


hypothetical protein FU23293 similar to 


113B84 


AI333076 


Hs.28529 


chromosome 12 open reading frame 2 


113936 


W17056 


Hs.83623 


nuclear receptor subfamily 1, group I, m 


114875 


AA235609 


Hs.236443 


Homo sapiens mRNA; cONA DKFZp564N1063 ( 


114987 


AA251016 


Hs.87808 


EST 


115460 


AW958439 


Hs.38613 


ESTs 


115722 


W91892 


Hs.59609 


ESTs 


116261 


AA481788 


Hs.190150 


ESTs 


116830 


H61037 


Hs.70404 


ESTs, Weakly similar to ALU2.HUMAN ALU 


116970 


AB023179 


Hs.9059 


K1AA0962 protein 


117178 


K98675 


HSJ269034 


ESTs 


117757 


AF088019 


Hs.46732 


EST 


118283 


AA2B7747 


Hs.173012 


ESTs, Weakly similar to A46010 X-IInked 


118384 


AF217525 


Hs.49002 


Down syndrome ceil adhesion molecule 


118657 


AJ822106 


Hs.49902 


ESTs 


120328 


AA923278 


HSJ290905 


ESTs, Weakly similar to protease [H.sapi 


120404 


AB023230 


Hs.96427 


KIAA1013 protein 


120524 


AA261852 


Hs.192905 


ESTs 


120688 


AW207555 


Hs.97093 


Homo sapiens cONA: RJ23004 fis. clone L 



R1 

15.00 



7.50 
7.50 
13.50 
9.50 

9.00 

13.50 

16.50 

7.00 
11.50 

9.50 

16.50 
13.00 

7.00 

1250 

16.50 

8.00 

17.00 

16.50 

11.00 



9.79 
32.50 



19.50 
6.00 



9.50 
850 
7.50 

7.50 
16.50 



7.00 
6.00 
17.92 



R2 

3.64 

2.46 
12.00 
268 
211 



250 
3.94 
12.66 
217 

238 
295 

240 
5.00 



3.00 
279 
4.50 



3.82 
221 

2.65 

6.00 
4.63 
7.00 
6.00 
2.27 
9.00 



268 



2.50 
239 
3.50 



120 



WO 02/086443 




121558 


AA412497 




gb:zl95g12.s1 Soaresjestis jyht Homo sap 


121676 


H56037 


Hs.108146 


ESTs 


121936 


AI024600 


Hs.98612 


ESTs 


121938 


AA428659 


Hs.98610 


ESTs 


122177 


AA435789 


Hs.98833 


EST 


123442 


AA299652 


Hs.1 11496 


Homo sapiens cDNA FU 1 1 643 ns, crane HE 


123551 


AA608837 




gb:af03h12.s1 Scares Jestis_NHT Homo sap 


123756 


AA609971 


Hs.1 12795 


EST 


123861 


AA620840 




gb:af89g01.si Soares_testis__NHT Homo sap 


124371 


N24924 


Hs.188601 


ESTs 


127477 


BE328720 


Hs.280651 


ESTs > 


127591 


AI190540 


Hs.131092 


ESTs" 


128252 


AA455924 


Hs.1 92228 


ESTs 


128426 


A1265764 


HS.14S197 


ESTs 


128925 


R67419 


Hs.21851 


Homo sapiens cuna rui/suu ns, cione in i 


128945 


AI990506 


Hs.8077 


Homo sapiens mRNA; cuna Uf\rxpo4/c im ut 


129105-;' 


• AI769160 


Hs.108681 


Homo sapiens brain tumor associated prct 


129235 


AW977238 


Hs.126084 


KIAA1055 protein 


129506 


AB020684 


Hs.11217 


1/1 A A AQ77 KMl.ln 

KIAA0877 protein 


129595 


U09550 


Hs.1154 


ovtouctai glycoprotein i , i/uxu jmucin s 


130160 


AA305688 


Hs.267695 


UUP-viai;D8iab!CNAC ceia i , j-gaiaciosyiu 


130340 


082326 


Hs.239106 


solute carrier family 3 (cystine, omasi 


131220 


AB023194 


Hs.300855 


KIAA0977 protein 


131430 


AI879148 


H3.26770 


fatty acid binding protein 7, brain 


132114 


NM_006152 


Hs.40202 


rymphoid-festricled membrane protein 


132458 


AA935315 


Hs.48965 


Homo sapiens cDNA: FU21693 fts, cione C 


132647 


NMJ306927 


Hs.54432 


siaryltransferase 4B (bela-galactosidase 


132655 


049372 


Hs.54460 


small Inducible cytokine subfamily A (Cy 


132682 


AI077500 


Hs.54900 


serologically defined colon cancer antig 


132747 


AA345241 


Hs.55950 


ESTs, Weakly similar to KIAA1330 protein 


132812 


R50333 


Hs.92186 


Leman coDed-coi) protein 


133337 


AF085983 


Hs.293676 


ESTs 


133876 


AL134908 


Hs.771 


phosphorylase, glycogen; liver (Hers dis 


134119 


AW157837 


Hs.79226 


fascicuiation and elongation protein zet 


134464 


AA302983 


Hs.239720 


CCR4-NOT transcription complex, subunit 


134542 


M14156 


Hs.85112 


insulin-like growth factor 1 (somatome^ 


135002 


AA448542 


Hs.251677 


G antigen 7B 


135305 


AA203555 


Hs.98288 


Homo sapiens cONA FU14903 fis, clone PL 



PCTVUS02/12476 



10.00 

15.00 

14.00 

8.93 

13.04 

11.50 

11.00 

6.50 



7.00 



10.00 
15.50 

6.50 

20.00 
11.50 
17.50 
6.10 



7.50 



Z95 



87.00 



2.50 

4.33 
3.02 

2.08 
2.11 



4.25 
10.00 



6.15 
5.58 

2.53 
Z50 
2.83 
3.82 
5.00 
3.00 
2.06 
2.27 
11.50 

6.50 



TABLE 6B show the accession numbers for those primekeys lacking unigenelD's for Table 6A. For each probeset we have listed the gene cluster number from which the 
oligonucleotides were designed. Gene clusters were complied using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence 
similarity using Clustering and Alignment Toots {DoubleTwist, Oakland California). The Genbank accession numbers for sequences comprising each cluster are listed In the 
"Accession' column. 

Pkey: Unique Eos probeset identifier number 

CAT number Gene cluster number . 
Accession: Genbank accession numbers 



Pkey CAT number Accessions 

108562 36375 1 AA 100796 AF020589 AA074629 AA075946 AA1 00849 AA085347 AA1 26309 AA07931 1 AA079323 AA0B5274 

103439 35330J X98266 N41124 

123551 gBnban)oAA608837 AA608837 

123861 genbank_AA620840 AA620840 

102832 entrezJJ92015 U92015 

101972 entrez_S82472 S82472 

121558 genbanRJ*A412497 AA412497 
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Table 7A shows 98 genes down-regulated fn non-smokers with lung cancer relative to smokers with lung cancer. These genes were selected from 59680 probesels on the 
Eos/Affymetrix Hu03 Genechtp array. Gene expression data for each probeset obtained from this analysis was expressed as average Intensity (AO, a normalized value reflecting 
the relative level ofmRNA expression. 

Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnJgenelD: Unlgene number 

Unigena Tille: Unigene gene title 

R1 : 90th percentile of Al for samples from smokers with adenocarcinoma divided by the average of Al tor samples from non-smokers with adenocarcinoma. 

R2: 90th percentile of Al for samples from smokers with squamous ceil carcinoma divided by the average of Al for samples from non-smokers with squamous cell 
carcinoma. 



Pkey 


ExAccn 


UnigenelD 


Unlgene Title 


R1 


R2 


100187 


D1 7793 


Hs.78183 


aldo-keto reductase family 1, member C3 




164.10 


100380 


082343 


Hs.18551 


neuroblastoma (nerve tissue) protein 




77.40 


100576 


X00356 


Hs.37058 


calcitonin/calcitonin-reiated polypeptid 


102.40 




100971 


BE379727 


Hs.83213 


fatty acid binding protein 4, adipocyte 


463.80 




101046 


KOI 160 




(NONE) 


672.00 




101066 


AW970254 


Hs.889 


Charot-leyden crystal protein 


66.00 




101175 


U82671 


Hs.36980 


melanoma antigen, family A, 2 




77.20 


101497 


W05150 


Hs.37034 


homeo box A5 


62.80 




101663 


NIVL003528 


Hs.2178 


H2B histone family, member Q 


78.00 




101677 


NMJJ00715 


Hs.1012 


complement component 4-bindlng protein, 


186.20 




101745 


M88700 


Hs. 150403 


dopa decarboxylase (aromatic L-amlrto aci 


80.09 




101941 


S77583 




gb:HERVK1 Q/HUMMTV reverse transcriptase 


99.20 




102125 


NMJJ06456 


Hs.288215 


siatyltransferase 




103.10 


102242 


U27185 


Hs.82547 


retlnoic acid receptor responder (tazaro 


67.00 




102340 


U37055 


Hs.278657 


macrophage stimulating 1 (hepatocyte gro 


71.60 




102369 


U39840 


Hs.299867 


hepatocyte nuclear factor 3, alpha 




69.70 


102457 


NM.001394 


Hs.2359 


dual specificity phosphatase 4 


153.00 




102669 


U71207 


Hs.29279 


eyes absent (Drosophlla) homolog 2 




65.70 


102796 


AL079648 


Hs.107019 


symptakln; Hunting tin interacting protei 




58.80 


102829 


NMJJ06183 


Hs.80962 


neurotensin 




268.80 


103207 


X72790 




gb:Human endogenous retrovirus mRNAtbr 


70.00 




103242 


X76342 


Hs.389 


alcohol dehydrogenase 7 (class IV), mu o 




212.10 


103260 


X78416 


Hs.3155 


casein, alpha 




130.70 


103351 


X89211 




gb:H.sapiens DNA for endogenous retrovir 


64.60 




104212 


AB002298 


Hs.173035 


K1AA0300 protein 


66.80 




104252 


AF002246 


Hs.210863 


cell adhesion molecule with homology to 


83.60 




104258 


AF007216 


Hs.5462 


solute carrier family 4, sodium blcarbon 


94.40 




105024 


AA1 26311 


Hs.9879 


ESTs 


68.20 




106260 


AI097144 


Hs.5250 


ESTs, Weakly similar to ALU1_HUMAN ALU S 




74.60 


106440 


AA449563 


Hs.151393 


glutamate-cystelne Bgase, catalytic sub 




71.10 


106566 


BE298210 




gb:601 1 1801 6F1 NIH.MGC J7 Homo sapiens c 


73.20 




106605 


AW772298 


Hs.21103 


Homo sapiens mRAlA; cDNA DKFZp564B076 (fr 


83.80 




106614 


AA648459 


Hs.335951 


hypothetical protein AF301222 




62.30 


106654 


AW075485 


Hs.286049 


phosphoserine aminotransferase 




202.40 


106999 


H93281 


Hs.10710 


hypothetical protein FU20417 




89.60 


108700 


M121518 


Hs.193540 


ESTs, Moderately similar to 2109260A B c 




66.40 


108810 


AW295647 


Hs.71331 


hypothetical protein MGC5350 




95.50 


108857 


AK001468 


Hs.62180 


anillin (Orosophila Scraps homolog), act 




63.40 


109597 


AA989362 


Hs.293780 


ESTs 


85.00 




109691 


T6556B 


Hs.12860 


ESTs 




58.70 


109704 


AI743680 


Hs.12876 


ESTs 




60.60 


110942 


R63503 


Hs.28419 


ESTs 


76.40 




111722 


R23924 


Hs.23596 


EST 


74.60 




112891 


T03927 


Hs.293147 


ESTs, Moderately similar to A46010 X-ll 


64.80 




112992 


AL157425 


Ks.133315 


Homo sapiens mRNA; cONA DKFZp761 J1324 (f 




76.70 


113073 


N39342 


Hs.103042 


mterotubule-assoclaied protein 1B 




120.20 


114251 


H15261 


Hs.21943 


ESTs 


127.20 




115230 


AA278300 


Hs.124292 


Homo sapiens cDNA: FU23123 fis, clone L 


174.00 




115291 


BE545072 


Hs.122579 


hypothetical protein FU 10461 




91.00 


115815 


AW905328 


Hs.180842 


ribosomaJ protein L13 


66.40 




115909 


AW872527 


Hs.59761 


ESTs, Weakly similar to DAP1 JiUMAN DEATH 




226.60 


115965 


AA001732 


Hs.173233 


hypothetical protein FU 10970 


82.80 




116107 


AL133916 


Hs.172572 


hypothetical protein FU20093 




361.60 


116552 


D20508 


Hs.164649 


hypothetical protein DKFZp434H247 


69.00 




116571 


D45652 




gb:HUMGS02848 Human adult lung 7 direct 


64.20 




118466 


N66741 




gb:yz33g08.s1 Morton Fetal Cochlea Homo 




63.50 


120484 


AA253170 


Hs.96473 


EST 


81.60 




120983 


AA398209 


Hs.97587 


EST 




81.10 


121034 


AL389951 


Hs.271623 


nucleoporin 50kD 




66.20 


121423 


AW973352 


Hs.290585 


ESTs 


64.40 




122553 


AA451884 


Hs.190121 


ESTs 




60.40 


122946 


AI718702 


Hs.308026 


major histocompatibility complex, class 


188.60 




123130 


AA487200 




gb:ab19f02.s1 Stratagene lung (937210) H 




80.20 


124472 


N52517 


Hs.102670 


EST 


71.00 




124526 


N62096 


Hs.293185 


ESTs, WeaHy similar to JC7328 amino ad 




104.90 


125489 


H49193 


Hs.124984 


ESTs, Moderately similar to ALU7.HUMAN A 




72.00 


125731 


R61771 


Hs.26912 


ESTs 




69.90 


125747 


NM_002884 


Hs.865 


RAP1A, member of RAS oncogene family 


69.00 




126020 


H79863 


Hs.114243 


ESTs 




6Z40 


126547 


U47732 


Hs.84072 


transmembrane 4 superfamily member 3 




62.80 


126966 


R38438 


Hs.182575 


solute canter family 15 (H+/peptide tra 




60.10 
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127472 


AA761378 


Hs.192013 


127610 


AA960867 


Hs.1 50271 


127742 


AW293498 


Hs.180138 


127987 


AI022103 


Hs.124511 


128233 


AW889132 


Hs.11916 


128420 


AA650274 


Hs.41296 


128766 


AW160432 


Hs.296460 


129014 


AW935187 


Hs.170162 


129215 


AB040930 


Hs.126085 


13O0S0 


H97878 


Hs.132390 


130385 


AW067800 


Hs.155223 


130732 


AW890487 


HS.639B4 


131025 


AB040900 


Hs.6189 


131241 


BE501914 


Hs.24654 


131775 


AB014548 


Hs.31921 


132240 


AB018324 


Hs.42676 


132856 


NM_001448 


Hs.58367 


132977 


AA093322 


Hs.301404 


133749 


L20852 


Hs.10018 


133818 


AI110684 


Hs.7645 


134264 


AF149297 


Hs.8087 


134265 


M83772 


Hs.80876 


134346 


X84002 


Hs.82037 


134395 


AA456539 


Hs.8262 


135047 


AL134197 


Hs.93597 


135056 


N75765 


Hs.93765 


135309 


A1564123 


Hs.42500 



ESTs 

ESTs, Highly similar to unnamed protein 

ESTs 

ESTs 

riboklnase 

fibronectin leucine rich transmembrane p 

craniofacial development protein 1 

WAA1357 protein 

WAA1497 protein 

zinc finger protein 36 (KOX 18) 

slanniocalcin 2 

cadherin 13, H-cadherin (heart) 
WAA1467 protein 

Homo sapiens cDNA FU11640 fis, clone HE 
KIAA0648 protein 
KJAA0781 protein 
glypican 4 

RNA binding motif protein 3 
solute carrier family 20 (phosphate tran 
fibrinogen, B beta polypeptide 
NAG-5 protein 

flavin containing monooxygenase 3 
TATA box binding protein (TBP)-associate 
lysosomal-associated membrane protein 2 
cyctin-dependent kinase 5, regulatory su 
rjpoma HMGIC fusion partner 
ADP-ribosytetion factor-like 5 



70.20 
64.00 
85.20 
96.60 



66.80 
64.20 



64.40 
76.20 
97.80 



133.20 
341.00 

66.00 



71.40 
70.40 



78.90 
106.90 

58.53 



139.60 
64.60 



71.00 



59.30 

64.30 
232.53 

75.80 
108.30 



TABLE 7B shows the accession numbers for those primekeys lacking unigenelD's for Table 7A. For each probeset we have listed the gene cluster number from which the 
oligonucleotides were designed. Gene clusters were compiled using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence 
similarity using Clustering and Alignment Tools (DoubieTwist, Oakland California). The Genbank accession numbers for sequences comprising each cluster are listed In the 
'Accession' column. 

Pkey: Unique Eos probeset identifier number 
CAT number. Gene cluster number 
Accession: Genbank accession numbers 



Pkey 
103207 



116571 
118466 
101046 
101941 
103351 
123130 



CAT number Accessions 
30835.-4 X72790 

120358 1 BE298210 AI672315 AW086489 BE298417 M455921 M902537 BE3271 24 R14963 AA085210 AW274273 A1333584 A1369742 AI039658 
AI8B5095 AI476470 AI287650 AI885299 AI985381 AW592624 AW3401 36 A1266556 AA456390 AI31 081 5 AA484951 



genbank_D45652 
genbank_N66741 
entrezJ<01160K01160 
entrez_S77583 S775B3 
entrezJC8921 1X89211 
genbank_M487200 



D45652 
N66741 



AA487200 
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Table 8A shows 1720 genes either up or down-regulated in lung tumors or chronically diseased lung relative to a broad collection of over 40 distinct normal body tissues. 
Chronically diseased lung samples represent chronic non-malignant lung diseases such as fibrosis, emphysema, and bronchitis. These genes were selected from 39494 
probesets on the Eos/Afrymetrix Hu02 Genechip array. Gene expression data for each probeset obtained from this analysis was expressed as average intensity (Al), a 
normalized value reflecting the relative level of mRNA expression. 

Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigeneiD: Unigene number 

Unlgene Title: Unigene gene title 

R1 : 70th percentile of Al for lung tumors divided by 90th percentile of Al for norma) lung 

R2: 70th percentile of Al for chronically dseased tang cMfed by 90th percenfi/e of Al for normal lung 



Pkey 


ExAccn 


UnigeneiD 


Unigene Title 


300097 


AI916973 


Hs.213603 


ESTs 


300117 


AW189787 


Hs.1 47474 


ESTs 


300197 


AI686661 


Hs.218286 


ESTs 


300201 


AI308300 




gb:la90c06jt1 NCt_CGAP..Bm20 Homo sapien 


300225 


AI989963 


Hs.1 97505 


ESTs 


300247 


AW274682 


Hs.161394 


ESTs 


300256 


AI469095 


Hs.298241 


Transmembrane protease, serine 3 


300337 


AI707881 


Hs.202090 


ESTs 


300362 


Z42308 




gb:HSC0FB121 normalized infant brain cDN 


300374 


AI859947 


Hs.314158 


ESTs 


300387 


AW270150 


Hs.254516 


ESTs 


300440 


AI421541 


Ks.146164 


ESTs 


300441 


R10387 


Hs.307921 


EST, Weakly similar to 2232_HUMAN ZINC F 


300449 


AI362967 


Hs.1 32221 


hypothetical protein FU12401 


300469 


AW1 35830 


Hs 233955 


hypothetical protein FU20401 


3005S? 


X85711 


Hs.21838 


hypothetical protein FU1 1 191 


300627 


W27363 




gb:ab37d01 x\ Stratagene HeLa cell s3 93 


qnnBio 


AW11R822 


Hs 128757 

no. i A.uf vi 


ESTs 


OUUf 10 


AI91 Al 11 
All IOI lO 


TO. KOaOU 


hunnlhnflrnl nmtpln FLI 23393 


300738 


A1623332 


Hs.1 30541 


KIAA1542 protein 


300777 


AA235361 


Hs 96840 


KIAA1527 protein 


300790 


AI492471 


Hs 188270 


ESTs 


300832 


A1R88147 


Hs 220615 


ESTs, Weakly similar b T03829 transcrlp 


30OR3R 


244942 


Hs 22958 


calcium channel a)pha2-delta3 subunit ■ 


300838 


AI582897 


Hs 192570 


hypothetical protein FU22028 


300878 


AW449802 


Hs. 285901 


Homo sapiens cONA FU20428 Ms, clone KA 




AI890356 


Hs.1 27804 


ESTs, Weakly similar to T1 7233 hypotheti 


300Q9R 






gb:abQ3a10.s1 Stratagene fetal retina 93 


uUUOUU 


AI041019 


Hs 152454 


ESTs 


3fMQfi1 


AW9fWlfiQ 

MVVlU*HJOS7 


Hs 317716 


FSTs WeaWv similar to unnamed orotein 




AASQ3373 


Hs 293744 * 


ESTs 


300QR7 


AARR*i20<) 


Hs 260439 


ESTs 


300987 


AW450840 


Hs.1 48590 


ESTs, Weakly similar to AF208846 1 BM-00 


300988 


AI927208 


Hs.208952 


ESTs 


301050 


AW 136973 


Hs.288516 


ESTs, Weakly similar to S69890 mitogen I 


301098 


AA677570 


Hs.1 8591 8 


ESTs 


301157 


AA729905 


Hs.231916 


ESTs 


301162 


AH 421 18 


Hs.1 29004 


ESTs 


301170 


AA737594 


Hs.247606 


ESTs 


301192 


AIB08751 


Hs.121188 


ESTs 


301193 


AA758115 


Hs.1 28350 


ESTs, Weakly similar to JC5423 2-hydroxy 


301267 


AW297762 


Hs.255690 


ESTs 


JVJ t CO 1 


AAR43QR6 


Hs 100586 


ESTs 


-an 134.1 


AIA1Q1QR 

AlOlO ISO 


Hs 208229 


ESTs 


301382 


AAS12839 


Hs 163369 


ESTs 


301407 


aw/450466 


Hs. 126830 


ESTs 


301452 


AAQ75fiflfl 


Hs 159955 


ESTs 




AUU9794R7 


He 2K4R l ft 


Unfitted 


301494 


AI678Q34 


Hs.131099 


ESTs 


301521 


AJ733621 


Hs.1 33011 


zinc finger protein 1 17 (HPF9) 


301531 


AI077462 


Hs.1 34084 


ESTs 


301580 


AI878959 


Hs.73737 


splicing factor, arglnlne/serine-rich 1 


301676 


Z43570 


Hs.27463 


ESTs, Moderately similar to G01251 Rar p 


301690 


F05865 


Hs.108323 


ubiquitJn-conjugating enzyme E2E 2 (homo 


301718 


F07744 


Hs.7987 


DKFZP434F162 protein 


301799 


AA3B4252 


Hs.286132 


D15F37 (pseudogene) 


301804 


AA5B1004 


Hs.62180 


anfllin (Drosophlla Scraps homolog), act 


301822 


X17033 


Hs.271986 


Integrin, alpha 2 (CD49B, alpha 2 subunl 


301846 


R200O2 


Ks.6823 


hypothetical protein FU10430 


301868 


T71508 


Hs.1 3861 


ESTs. Weakly similar to pH sensitive max 


301882 


T78054 




gb:yc97g09j1 Soares Infant brain 1NIB H 


301905 


AI991127 


Hs.1 17202 


ESTs 


301948 


AA344647 


Hs.1 16724 


aldo-keto reductase family 1, member B1 1 


301960 


AW070252 


Hs.27973 


K1AA0874 protein 


302011 


T91418 


Hs.125156 


transcriptional adaptor 2 (A0A2, yeast, 


302016 


N40834 


Hs.23495 


hypothetical protein FU11252 


302041 


NM_001501 


Hs.1 29715 


gonadotropln-releaslng hormone 2 


302072 


AJ238381 


Hs.132576 


paired box gene 9 


302094 


AI286176 


Hs.6786 


ESTs 


302095 


AW044300 


Hs.137506 


Homo sapiens BAC clone RP1 U120J2 from 7 


302148 


AW269618 


Hs.23244 


ESTs 



R1 


R2 


5.46 


4.69 


0.58 


0.56 


4.26 


5.44 


0.62 


0.83 


1.68 


1.75 


1.08 


2.28 


0.86 


1.00 


5.80 


9.09 


4.18 


12.78 


2.99 


4.38 


1.50 


2.53 


3.98 


5.25 


3.18 


6.80 


0.43 


0.62 


0.16 


0.83 


4.10 


9.75 


4.60 


12.60 


2.91 


5.86 


1.00 


0.92 


1.82 


1.71 


4.48 


8.22 


1.29 


1.18 


5.51 


8.56 


4.90 


6.34 


1.70 


2.81 


4.56 


7.91 


2.23 


1.58 


2.13 


3.50 


274 


4.46 


1.00 


1.00 


1.46 


1.51 


0.39 


1.30 


1.49 


1.08 


0.16 


0.37 


3.23 


1.94 


6.76 


14.28 


3.16 


8.85 


1.68 


7.18 


4.40 


6.42 


6.38 


11.59 


4.35 


7.78 


1.66 


1.61 


2.19 


1.78 


0.76 


0.76 


1.00 


1.81 


1.48 


1.51 


0.51 


1.46 


2.40 


5.02 


2.79 


3.41 


0.67 


0.67 


2.52 


3.76 


7.41 


11.92 


8.31 


10.70 


170 


4.22 


4.20 


8.78 


5.93 


7.04 


1.70 


0.76 


1.58 


1.36 


1.00 


1.00 


2.88 


5.49 


Z28 


3.60 


1.00 


1.00 


5.28 


2.28 


5.3B 


6.48 


3.03 


3.42 


1.00 


1.25 


0.71 


0.99 


1.60 


1.71 


0.52 


1.20 


Z75 


4.93 


3.04 


3.87 
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302155 


A1088485 


Hg.144759 


302201 


AJ006276 


Hs.159003 


302202 


AF097159 


Ks.1 59140 


302206 


AI937193 


Hs.41143 


302209 


AF047445 


Hs.159297 


302235 


AL049987 


Hs.166361 


302290 


AL1 17607 


Hs.175563 


302328 


AA354849 


Hs.23240 


302346 


AL039101 


Hs.1 94625 


302360 


AJ010901 


Hs.198267 


302384 


Y08982 


Hs.202676 


302406 


U86751 


Hs.211956 


302409 


AF155156 


Hs.218028 


302423 


AB028977 


Hs.225974 


302432 


AL080068 


Hs.272534 


302435 


AF092047 


Hs.227277 


302437 


AB024730 


Hs.227473 


302455 


M356923 


Hs.240770 


302472 


AA317451 


Hs.6335 • 


302476 


AF182294 


Hs.241578 


302489 


T80660 


Hs.230424 


302490 


AA885502 


Hs.187032 


302S62 


AJ005585 


Hs.48956 


302566 


AA085996 


Hs.248572 


302630 


AB029488 


Hs.272100 


302634 


AB032953 


Hs.173560 


302638 


AA463798 


Hs.102696 


302647 


X57723 


Hs.198273 


302655 


AJ227892 


Hs.146274 


302656 


AW293005 


Hs.70704 


302668 


AA580691 


Hs.180789 


302679 


H65022 




302680 


AW1 92334 


Hs.38218 


302697 


AJ001408 




302705 


U09060 




302711 


L08442 




302719 


W69724 


Hs.288959 


302742 


L12069 




302755 


AW384815 


Hs.149208 


302771 


H98476 


Hs.42522 


302789 


AJ245067 




302795 


AJ245313 


Hs.272838 


302802 


. Y08250 




302803 


AA442824 


Hs.293961 


302812 


N31301 


Hs.152664 


302847 


X98940 




302885 


AL1 37763 


Hs.132127 


302943 


AI581344 


Hs.127812 


302977 


AW263124 


H8.315111 


303006 


AF078950 


Hs.24139 


303011 


AF090405 




303013 


F07898 


Hs.286968 


303061 


AF151882 


Hs.27693 


303077 


AF163305 




303090 


M443259 


Hs.146288 


303091 


AF192913 


Hs.130683 


303094 


AF195513 


Hs.278953 


303095 


AF202051 


Hs.134079 


303131 


AW081061 


Hs.103180 


303195 


AA082211 


Hs.233936 


303196 


AA082298 


Hs.59710 


303216 


AAS81439 


Hs.152328 


303222 


AA333538 


Hs.204501 


303234 


AA132255 


Hs.143951 


303251 


AW340037 


Hs.115897 


303295 


AA205625 


Hs.208067 


303297 


T80072 


Hs.13423 


303316 


AF033122 


Hs.14125 


303467 


AA398801 


Hs.323397 


303506 


M340605 


Hs.105887 


303552 


AA359799 


Hs.224652 


303598 


AA382814 




303837 


AF056083 


Hs.24879 


303655 


AA504702 


Hs.258802 


303756 


A1738488 


Hs.115838 


303856 


AA968589 


Hs.180532 


303893 


N88597 


Hs.1 13503 


303907 


AW467774 


Hs.171880 


303946 


AW4741S6 


Hs.306637 


303978 


AW513315 




303981 


AW513804 


Hs.278834 


303990 


AW515465 




303998 


AW516449 




303999 


AW516611 




304006 


AW517947 





ESTs 



UDP-Gal:betaGlcNAc beta 1,4- galactosyl* 
phosphoinosttlde-specific phosphollpase 
killer cell lectin-Iike receptor subfami 
Homo sapiens mRNA; cDNA DKFZp564F112 (fr 
Homo sapiens mRNA; cDNA DKFZp564N0763 (f 
Homo sapiens cDNA FU13496 fis, clone PL 
dynein, cytoplasmic, light intermediate 
mucin 4, tracheobronchial 
synaptonemaJ complex protein 2 
C03-epsilon-assaciated protein; antisens 
adaptor-related protein complex 4, epsil 
KIAA1054 protein 

Homo sapiens mRNA; cDNA DKFZp564J062 (fir 
sine ocults homeobox (Drosophita) homolo 
UDP-N-acetylgtucosamine:a-1 ,3-CMnannosld 
nuclear cap binding protein subunit 2, 2 
SW1/SNF related, matrix associated, act] 
U6 snRNA-associated Sm-like protein LSmB 
Homo sapiens cDNA FU13540 te, clone PL 
ESTs 

gap junction protein, beta 6 {connexin 3 
hypothetical protein FU22965 
SMS3 protein 

odd Oz/ten-m homolog 2 (DrosophUa, mous 
MCT-1 protein 

NADH dehydrogenase (ubiquinone) 1 beta s 
ESTs 

Homo sapiens, clone IMAGE:2823731, mRNA, 
S164 protein 

gb:yu66g11.r1 Weizmann Olfactory Epithel 
ESTs 

gb:Homo sapiens mRNA for Immunoglobulin 

gb:Human immunoglobulin heavy chain, V-r 

gb:Human autonomously replicating sequen 

hypothetical protein FU20920 

gb:Homo sapiens (clone WR4.10VH) antMh 

KIAA1555 protein 

ESTs 

gbiHomo sapiens mRNA for immunoglobulin 

hypothetical protein FU 10494 

gb:H.saplens mRNA for variable region of 

ESTs, Moderately similar to putative DNA 

hypothetical protein FU20051 

gb:Rsapiens rearranged Ig heavy chain ( 

hypothetical protein LOC57822 

ESTs, Weakly similar to T17330 hypolhell 

hypothetical protein FL1 12894 

Homo sapiens cDNA: FU23137 fis, clone L 

gb:Homo sapiens clone 2A1 scFV anitbody 

RAB22A, member RAS oncogene family 

peplidyiprolyl Isomerase (cyclophilin)-l 

gb:H.saplens T-cell receptor mRNA 

klnesln family member 13A 

zinc finger protein 180 (HHZ168) 

Pur-gamma 

NM23-H8 

DC2 protein 

myosin, light polypeptide, regulatory, n 

ESTs 

ESTs 

hypothetical protein FU10S34 
ESTs 

protocadherln 12 
ESTs 

Homo sapiens done 24468 mRNA sequence 
p53 regulated PA26 nuclear protein 
ESTs 

ESTs, Weakly similar to Homolog of rat Z 
ESTs, Weakly similar to unnamed protein 
gb;EST96097 Testis I Homo sapiens cDNA 5 
phosphatide acid phosphatase type 2C 
ATPase, (Na+)/K+ transporting, beta 4 po 
ESTs 

glucose phosphate isomerase 
karyopherin (importln) beta 3 
polymerase (RNA) ii (DNA directed) polyp 
Homo sapiens cDNA FU 12363 fis, clone MA 
gb:xo43c12j(1 NCLCGAPJJtl Homo sapiens 
ESTs, Weakly similar to ALU INHUMAN ALU S 
gb:xu71a1 1 Jc1 NCl_CGAP_Kid8 Homo sapiens 
gb:xl68f05.x1 NO.CGAP.Ut2 Homo sapiens 
gb:xp70b11.x1 NCI_CGAP.Ov39 Homo sapiens 
gb:xt66h02.x1 NCI_CGAPJJt2 Homo sapiens 
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1.00 


1.86 


4.48 


0.10 
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0.09 


0.04 


1.00 
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9.14 


2.06 


Z02 


1.00 


1.24 


1.08 


1.43 


1.76 


1.31 


2.30 


2.57 


3.10 


5.79 


5.06 


11.86 


5.14 


7.31 
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1.15 
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9.35 
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6.28 


3.21 


4.07 



125 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

304008 AW518198 Hs.3297 

304009 AW518206 Hs.181165 
304024 T03036 

304026- T03160 

304028 T03266 

304036 T16855 Hs.244621 

304046 T54803 

304061 T61521 



PCT/US02/12476 



304063 T62536 

304097 R26376 

304114 R78946 

304122 H28966 

304155 K68696 

304203 N56929 

304234 W81608 

304267 AA064862 

304270 AA069711 

304287 AA079286 

304348 AA179868 

304415 AA290747 

304430 M347682 

304456 AA411240 

304521 M464716 
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305726 AA828156 
305728 AA828209 
305759 AA835353 
305792 M845256 
305864 AA864374 
305901 AA872968 
305910 AA875981 
306015 AA897116 
306017 M897221 
306020 AA897630 
306063 AA906316 
306065 AA906725 
306104 AA910956 
306109 AA911861 
306148 AA917409 
306242 AA932805 
306288 AA936900 
306325 AA953072 
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306375 AA968650 
306396 AA970223 
306428 AA975110 
306442 AA976899 
306446 AA977348 



Hs.177592 



Hs.73742 
Hs.297753 
Hs.78466 

Hs.169476 



Hs.169476 
Hs.297681 

Hs.111334 
Hs.3463 



Hs.13801 

H3.284136 

Hs.297753 

Hs.282435 

Hs.300697 



Hs.303405 

Hs.275668 

Hs.169476 

Hs.81328 

Hs.163019 
Hs.300697 



Hs.287445 
Hs.303512 

Hs,272572 

Hs.275865 



Hs.73742 



Hs.73742 



Hs. 109058 
Hs.130027 



Hs.288036 



Hs.210546 
Hs.275865 
Hs.276018 

Hs.191228 



ribosomal protein S27a 
eukaryotic translation elongation factor 
gb:FB21 B7 Fetal brain, Stratagene Homo s 
gb:FB26F2 Fetal brain, Stratagene Homo s 
gb:FB7C1 Fetal brain, Stratagene Homo sa 
ribosomal protein S14 
gb:yb42d06.s1 Stratagene fetal spleen (9 
gb:yb73g01.s1 Stratagene ovary (937217) 
gb:yc04c12.s1 Stratagene lung (937210) H 
ribosomal protein, large, PI 
gb:yi87gOZs1 Soares placenta Nb2HP Homo 
gb:ym31a06.s1 Soares Infant brain 1NIB H 
gb:yr78b06.s1 Soares fetal liver spleen 
gb:yy82d08.s1 Soaresjnulfipie_sclerosis_ 
gbtzd88h08.s1 SoaresJetaLriearLNbHH19W 
ribosomal protein, large, P0 



proteasome (prosome, macropaln) 26S sub 
gb:zp38g12.s1 Stratagene muscle 937209 H 
glyceraldehyde~3-phosphate dehydrogenase 
gb:EST54044 Fetal heart II Homo sapiens 
gb:zv26g05.s1 Soares_NhHMPu_S1 Homosapl 
gb:zx82c11.s1 Soares ovary tumor NbHOT H 
gb:zx02c05.s1 SoaresJotaLtetus_Nb2HF8_ 
glyceraldehyde-3-phosphate dehydrogenase 
serine (or cysteine) proteinase Inhlblto 
gb:nh85e08.s1 NCI_CGAP_Br1.1 Homosapien 
ferritin, light polypeptide 
ribosomal protein S23 

gb:nm75h11.s1 NCLCGAP_Co9 Homo sapiens 
gb:nn13g09.s1 NCLCGAP„Co12 Homo sapiens 
KIAA16B5 protein 
PRO2047 protein 
vimentln 
ESTs 

immunoglobulin heavy constant gamma 3 (G 
gbzu89h06.s1 SoaresJestisJJHT Homo sap 
gb:ab99c04.s1 Stratagene lung (937210) H 
gb:nr72a12.s1 NCLCGAP.Pr24 Homo sapiens 
ESTs 

gbirt01g08.s1 NCI_.CGAPJ.ym3 Homo sapiens 
EST, Weakly similarto EF1D_HUMAN ELONG 
gb:ag57d12.8l Gessler Wilms tumor Homo s 
gtyceraldehyde-3-phosphate dehydrogenase 
gb:ag37e01.s1 Jia bone marrow stroma Horn 
nuclear factor of kappa tight potypeptld 
gb2j44f07.s1 SoaresJetal_liver_spleen_ 
EST 

Immunoglobulin heavy constant gamma 3 (G 
gb:al10f08.s1 Soares j)arathyro3d_lumor_N 
gb:nx10c08.s1 NCLCGAPJ3C3 Homo sapiens 
hypothetical protein FU11726 
EST 

gb:nz12eQ5.s1 NCI_CGAP_GCB1 Homo sapiens 
hemoglobin, alpha 2 

gb:a|09h02.8l Soares_parathyroidJumor_N 
ribosomal protein S18 

gb:oe29a12£l NCI CGAP Pr25 Homo sapiens 
gb:oe29c12.s1 NCLCGAP_Pr25 Homo sapiens 
gb;nw31e04.s1 NCI_CGAP_GCB0 Homo sapiens4. 
gb:a!67a05.s1 Soaras_tostis_NHT Homo sap 
ribosomal protein, large, PO 
gb:of34a02.s1 NCLCGAPJ<ld5 Homo sapiens 
gb:ak72b06.s1 Barstead spleen HPLRB2 Horn 
gbak64a08.s1 Barstead spleen HPLRB2 Horn 
ribosomal protein, large, P0 
gb:oh63h08.s1 NCLCGAPJCd5 Homo sapiens 
gb:nx21h02.s1 NCI_CGAP_GC3 Homo sapiens 
gb:am08b07.s1 Soares_NFU.T_GBC.S1 Homosl 
ribosomal protein S6 kinase, 90XD, polyp 
EST 

gb:ok03g03.s1 Soares_NFI_JLGBC_S1 Homos 
gb:ok78g02.s1 NCLCGAPJ3C4 Homo sapiens 
gb:ok85h1 1.s1 NCI_CGAP_Kid3 Homo sapiens 
gb:og21e07.s1 NCi_CGAP_PNS1 Homo sapiens 
tRNA teopentenylpyrophosphate transferas 
gb:oo60g0As1 Na_CGAP_Lu5 Homo sapiens 
gb:o!53h05.s1 NC(_CGAP_HN3 Homo sapiens 
interieukin 21 receptor 
ribosomal protein S18 
EST, Moderately similar to JC4662 ribos 
gb:opQ9d05.s1 NCl_CGAPJ<ld6 Homo sapiens 
hypothetical protein FU20284 
gb:oq35e09.s1 NCLCGAPJ3C4 Homo sapiens 
gb:oq72e12.s1 NCI CGAPJGd6 Homo sapiens 
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308458 


AA978186 




306467 


AA983508 


Hs.163593 


306510 


AA988546 




306555 


M994304 


Hs.276083 


306557 


AA994530 




306572 


AA995686 




306582 


AA996248 




306598 


AI000320 


Hs.1 69476 


306605 


AI000497 


H8.1 19500 


306656 


AI004024 




306676 


AI005603 


Ks.284136 


306686 


AI015615 




306702 


AI022565 


Hs.307670 


306728 


AI027359 


Hs.272572 


306751 


AI032589 




306767 


AI038963 


Hs.249118 


306892 


A1092465 




306897 
306956 


AI093967 
AI125111 




306958 


AI125152 




307035 


AI142774 


Hs.1 19122 


307041 


Al 144243 




307091 


AI167439 




307181 


AI1B9251 




307297 


AI205798 


Hs.1 11334 


307317 


Al 208303 


Hs.1 47333 


307327 


A12U142 


Hs.246381 


307382 


AI223158 


Hs.147885 


307410 


AI241715 


Hs.77039 


307415 


AI242118 




307423 


AI243206 


Hs.1 79573 


307426 


AI243364 




307517 


A1275055 




307551 


AI281556 




307561 


A1282207 




307608 


AI290295 




307657 


Al 306428 


Hs.298262 


307691 


A1318285 




307701 


AI318583 

niv 1 v www 


Hs.276672 


307718 


ai aaadflfi 


Hi 837S^ 


307730 


Al 336092 




307760 


AI342387 




307764 


AI342731 




307783 


AI347274 




307796 


AI350S56 




307807 

ww f Our 


Al 351 799 

nlwv 1 1 ww 




307BQ8 

uwf UvO 


Al 351 826 

rvl WW I W^w 




307820 


Al 355761 




307830 


Al 358722 


Hs.276737 


307852 


AI365541 




307902 


Al 380462 




307997 


Al 434512 


Hs.1 81 165 


308002 


Al 435240 


HS.2B3442 


308011 


A! 439473 




308023 


AI452732 


Hs.251577 


308041 


AI458824 


Hs. 169476 


308059 


AI468938 


Hs.276877 


30B085 


A1474135 


Hs.181165 


30B101 


AI475950 

rij*f * w ww w 


Hs.1 81 165 


308106 


AI476803 




308122 


A! 480 123 


Hs,309411 


308154 


A1500600 




308171 


AI523632 


Hs.298766 


30821 1 


AI557029 


Hs.278572 


308213 


AI557041 




308216 


AI557135 




308219 


AI557246 




308271 


AI567844 


Hs. 252259 


308319 


AI583983 


Hs.181165 


308362 


A1613519 


Hs.1 05749 


308413 


A1636253 


Hs.1 96511 


308450 


AI660860 


Hs.96840 


308464 


AI672425 


Hs.277117 


308588 
308599 


AI718299 
AI719893 




308615 


AI738593 


Hs.101774 


308643 


AI745040 




308673 
308697 


AI760864 
AI767143 




308762 


AI607405 


Hs.259408 


308778 


AI811109 




308762 


A1811767 


Hs.2186 


308808 


A1818289 




308823 


AI824118 


Hs.217493 


308875 


AI832332 





gb:op33c06.s1 Soares_NFLJ_GBC_S1 Homos 
ribosomal prolan L18a 

gb:or84d07.s1 NCI_CGAP_Lu5 Homo sapiens 
EST, Weakly similar to RL23_HUMAN 60S R 
gb:ou57e08.s1 NCl_CGAP_Br2 Homo sapiens 
gb:os25c12.s1 NCLCGAP^WdS Homo sapiens 
gb:os18c10.s1 NC!_CGAP_Kid5 Homo sapiens 
gIycera!dehyde-3- phosphate dehydrogenase 
ribosomal protein, large P2 
gb:ou11b07j(1 Soares_NFL_T_GBC_S1 Homos 
PRO2047 protein 

gb:ov29f10J(1 SoaresJestisJJHT Homo sap 
EST 

hemoglobin, alpha 2 

gb:ow70h12.s1 Soares_fetalJver_spleen_ 
ESTs 

gb:qa75h12.x1 SoaresJetaUujarLNbHH19W 
gb:qa33c06.s1 Soares_NhHMPu_S1 Homosapi 
gb:am66f03.s1 Barstead spleen HPLRB2 Horn 
gb:am55e09.x1 Johnston frontal cortex Ho 
ribosomal protein L13a 

gb:qb85b12j(1 Soares_fetaLhearLNbHH19W 
gb:ox70h06.s1 Soares_NhHMPu_S1 Homo sapl 
gb:qc99g06.x1 Soares_pregnant_uterus_NbH 
ferritin, light polypeptide 
EST 

CD68 antigen 
ESTs 

ribosomal protein S3A 

gb:qh92b02j<1 Soares_NFL_T_GBC_S1 Homos 
collagen, type I, alpha 2 

gb:qh30g11j(1 Soares_NFkJ_GBC_S1 Homos 
gb:ql72d03.x1 Soares_NhHMPu_S1 Homo sapl 
gb:qu52M 1 jc1 NCI_CGAP_Lym6 Homo sapiens 
gb:qp65a12j<1 Soares_fetalJung_NbHL19W 
gb:qm01f02j<1 Soares_NhHMPu_S1 Homo sapi 
ribosomal protein S19 

gb:lb17b01.x1 NO JSGAPJX37 Homo sapiens 
EST, Weakly similar to RL6.HUMAN 60S Rl 
small nuclear ribonucteoproteln polypept 
gb:qt43b07j(1 SoaresJetaLlung_NbHL19W 
gb:qt27f07.x1 Soares_pregnanLuterus_NbH 
gb:qo26a07j(1 NCI_CGAP_Ui5 Homo sapiens 
gb:tc05dQ2jc1 NCLCGAP_Co1 6 Homo sapiens 
gb:qt18f09.x1 NCI_CGAP_GC4 Homo sapiens 
gb:qt09d02.x1 NCI_CGAP_GC4 Homo sapiens 
gb:qt09gO3.x1 NCI_CGAP_GC4 Homo sapiens 
gbxjt94a11jt1 NCI_CGAP_Co14 Homo sapiens 
EST, Weakly similar to R5HU22 ribosomal 
gb:qz08g05.x1 NC1_CGAP_CLL1 Homo sapiens 
gbig02h05.x1 NCLCGAP_CLL1 Homo sapiens 
eukaryotic translation elongation factor 
ESTs 

gb:ti60a08.x1 NCI_CGAP_Lym12 Homo sapien 
hemoglobin, alpha 1 

glyceraMehyde-3-phosphate dehydrogenase 
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AA938663 


Hs.199826 


ESTs 


317598 


AW206035 


Hs.192123 


ESTs 


317627 


A1346110 


Hs.132553 


ESTs 


317650 


A1733310 


Hs.127348 


ESTs 


317659 


AA961216 


Hs.127785 


ESTs 


317674 


AW294909 


H3.13220B 


ESTs 


317686 


AA969051 


Hs.187319 


ESTs 


317692 


AI307659 


Hs.174794 


ESTs 


317701 


AI674774 


Hs.128014 


ESTs 


317711 


AI733015 


Hs.272189 


ESTs 


317722 


A1733373 


Hs.128119 


ESTs 


317756 


AA973667 


Hs.128320 


ESTs 


317777 


AI143525 ' 


Hs.47313 


KIAA0258 gene product 


317799 


A1498273 


Ns.126808 


ESTs 


317803 


AA983251 


Hs.128899 


ESTs 


317821 


AJ368158 


Ks.70983 


PTPL1 -associated RhoGAP 1 


317848 


AI820575 


Hs.129086 


Homo sapiens cDNA RJ12007 fis, clone HE 


317850 


N29974 


Hs.152982 


hypothetical protein FU 131 17 


317861 


AW341064 


Hs.129119 


ESTs 


317865 


AI298794 


Hs.129130 


ESTs 


317869 


AW295184 


Hs.129142 


deoxyribonudease II beta 


317881 


AI827248 


Hs.224398 


Homo sapiens cDNA FU11469 fts, cforte HE 


317890 


A1915599 


Hs.129225 


ESTs 


317899 


AI952430 


Hs.150614 


ESTs, Weakly similar to ALU4J1UMAN ALU S 


317986 


A1005163 


Hs.201378 


ESTs, Weakly similar to T1 2545 hypolheti 


318001 


AW235697 


Hs.130980 


ESTs 


318016 


A1016694 


Hs.256921 


ESTs 


313023 


AW24305B 


Hs. 131 155 


ESTs 


318054 


AW449270 


Hs.232140 


ESTs 


318068 


AI024540 


Hs.131574 


ESTs 


318117 


AI208304 


Hs.250114 


ESTs 


318187 


AI792585 


Hs.133272 


ESTs. Weakly similar to ALUC.HUMAN 111! 


318223 


AI077540 


Hs.134090 


ESTs 


318240 


A1085377 


Hs.143610 


ESTs 


318255 


AI082692 


Hs.134662 


ESTs 


318266 


AI554341 


Hs.271443 


ESTs 


318330 


A1093840 


Hs.143758 


ESTs 


318369 


AI493501 


Hs.170974 


ESTs 


318428 


AI949409 


Hs.194591 


ESTs 


318458 


Al 149783 


Hs.158438 


ESTs 


31 8467 


AI151395 


Hs.144834 


ESTs 


318473 


AI939339 


Hs.146883 


ESTs 


313476 


AI693927 


Hs.265165 


ESTs 


318487 


AI167877 


Hs.1 43716 


ESTs 


318488 


AI217431 


Hs.144709 


ESTs 


318491 


T26477 


Hs.22883 


ESTs, Weakly similar to ALU8.HUMAN ALU S 


318499 


T25451 




gb:PTHH88 HTCDL1 Homo sapiens cDNA 573 


318537 


AA377908 


Hs.1 3254 


ESTs 


313538 


N28625 


Hs.74034 


Homo sapiens done 24651 mRNA sequence 


318547 


R20578 


Hs.90431 


ESTs 


318552 


R18364 


Hs.90363 


ESTs 


318575 


R55102 


Hs.107761 


ESTs, Weakly similar to unnamed protein 


318580 


T34571 


Hs.49007 


pory(A) polymerase alpha 


318587 


AA779704 


Hs.1 68830 


Homo sapiens cONA FU12136 fis, clone MA 


318596 


AI470235 


Hs.172698 


EST 


318622 


T48325 


Hs.237658 


apolipoprotelnA-ll 


318629 


N25163 


Hs.8861 


ESTs 


318637 


AA243539 


Hs.9196 


hypothetical protein 


318648 


T77141 


Hs.184411 


albumin 


318650 


AA393302 


Hs.176626 


hypothetical protein EDAG-1 


318671 


AA1 88823 


Hs.299254 


Homo sapiens cONA: FU23597 fls, clone L 


318679 


T5B115 


Hs.10336 


ESTs 


318711 


AI936475 


Hs.101282 


Homo sapiens cDNA: FU21238 fis, clone C 


318725 


AI962467 


Hs.242990 


ESTs 


318728 


Z30201 


Hs.291289 


ESTs, WeaMy similar to ALU1_HUMAN ALU S 


318740 


NM_002543 


Hs.77729 


oxidised tow density lipoprotein (lectin 


318776 


R24963 


Hs.23766 


ESTs 


318764 


H0014B 


Hs.5181 


proliferatlon-associated 2G4, 38kD 


318816 


F07873 


Hs.21273 


ESTs 


318865 


H10818 




gb:ym04f10.r1 Soares infant brain 1MB H 


318879 


R56332 


Hs.18268 


adenylate kinase 5 


318881 


243224 


Hs.1 24952 


ESTs 


318894 


F08138 


Hs.7387 


DKFZP564B1 16 protein 


318901 


AW368520 


Hs.301528 


L-kynurenlne/alpha-amlnoadipate aminotra 


318925 


243577 


Hs.21470 


ESTs 


316936 


AI219221 


Hs.308298 


ESTs 


318982 


Z44140 


Hs.269622 


ESTs 


318986 


Z44186 


Hs.169161 


ESTs, Highly similar to MAON_HUMAN NADP- 


319041 


Z44720 


Hs.98365 


ESTs, Weakly similar to weak similarity 


319103 


H05896 


Hs.4993 


K1AA131 3 protein 


319170 


R13678 


Hs.285305 


putative selenocysteine lyase 


319196 


F07953 


Hs.1 6085 


putative G-protein coupled receptor 


319199 


F07361 


Hs.1 3306 


ESTs 


319242 


F11472 


Hs.1 2839 


ESTS 



1.00 


243 


5.20 


11.95 


0.33 


1.56 


1.50 


1.39 


0.48 


146 


4.18 


7.14 


2.92 


3.20 


1.00 


1.01 


5.33 


9.59 


1.00 


1.00 


5.13 


7.81 


2.50 


6.03 


1.59 


1.30 


1.00 


2.48 


1.78 


2.11 


0.80 


1.06 


0.17 


0.68 


5.30 


8.16 


1.30 


2.28 


218 


5.93 


4.48 


8.20 


0.44 


0.99 


4.06 


2.23 


4.68 


7.48 


3.14 


3.37 


0.28 


1.66 


5.12 


9.97 


1.86 


4.50 


292 


5.22 


3.92 


6.37 


1.21 


1.27 


0.86 


1.17 


5.90 


6.98 


1.05 


0.90 


3.10 


2.40 


0.02 


1.05 


6.12 


10.55 


4.98 


7.90 


2.48 


5.62 


0.77 


0.45 


3.54 


4.92 


4.56 


5.62 


2.08 


4.05 


4.22 


8.07 


1.47 


1.05 


1.40 


4.14 


1.84 


1.90 


2.58 


5.20 


3.26 


4.18 


0.35 


1.07 


3.22 


4.60 


4.87 


9.06 


1.91 


1.98 


2.74 


6.22 


0.85 


2.46 


4.88 


4.93 


4.80 


12.51 


0.39 


1.04 


1.72 


3.57 


6.27 


9.91 


3.96 


8.84 


1.53 


0.81 


1.00 


2.19 


3.05 


3.18 


1.08 


2.46 


0.77 


1.33 


0.25 


1.49 


1.00 


3.01 


2.70 


3.86 


3.90 


7.13 


2.25 


3.56 


1.78 


5.00 


4.79 


14.13 


5.31 


7.00 


1.03 


0.91 


2.23 


3.80 


1.86 


7.16 


5.84 


9.79 


1.00 


1.00 


3.38 


6.11 


1.00 


1.07 


3.79 


5.03 


1.00 


298 


3.53 


5.66 


5.87 


7.26 
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319263 


T65331 


Hs.81360 


319267 


F11B02 


Hs.6818 


319270 


R13474 


Hs.290263 


319279 


T65094 


Hs.12677 


319282 


AA461358 


Hs.1 2876 


319289 


W07304 


Hs.79059 


319291 


W86578 


Hs.285243 


319293 


F12119 


Hs.12583 


319312 


Z45481 




319370 


H54254 


Hs.325823 


319391 


R06304 


Hs.13911 


319396 


H67130 


Hs.301743 


319398 


AA359754 


Hs.191196 


319407 


R05329 




319425 


T82930 




319433 


R06050 


Hs.191198 


319437 


AA282420 


Hs.1 11991 


319466 


AJ809937 


Hs.116417 


319471 


R06546 


Hs.19717 


319480 


R06933 


Hs.184221 


319484 


T91772 




319486 


AI382429 


Hs.250799 


319508 


T99398 


Hs.270104 


319523 


T69499 


Hs.191184 


319545 


R83716 


Ks.14355 


319546 


R09692 




319552 


M096106 


Hs.20403 




T829BB 


Hs.250154 


319586 


D78808 


Hs.283683 


319604 


R11679 


Hs.297753 


319609 


AW247514 


Hs.12293 


319611 


H14957 




319653 


M770183 


Hs.173515 


319657 


R19897 


Hs.106604 


319658 


R13432 


Hs. 167481 


319661 


H08035 


Hs.21398 


319662 


H06382 


Hs.21400 


319708 


R15372 


Hs.22664 


319742 


T776SJ 


Hs.21162 


319748 


R18178 


Hs.295866 


319772 


R76633 


Hs.22646 


319788 


AA321932 


Hs.11 741 4 


319805 


R92857 


Hs.271350 


319812 


N74880 


Hs.264330 


319834 


AA071267 




319878 


T78517 


Hs. 13941 


319862 


AA258981 


Hs.291392 


319912 


T77559 


Hs.94109 


319935 


H79460 


Hs.271722 


319944 


T79248 


Hs.133510 


319947 


AA160967 


Hs.14479 


319962 


H06350 


Hs.135056 


32Q007 


AA336314 




320018 


T83263 




320030 


H63789 


Hs.296268 


320032 


AI699772 


Hs.292664 


320040 


AA233671 


Hs.87164 


320047 


T86564 


Hs.302256 


320083 


AA074108 


Hs. 120844 


320096 


H58138 


Hs.1 17915 


320099 


AW411307 


Hs.114311 


320112 


T92107 


Hs.188489 


320140 


H94179 


Hs.119023 


320188 


AW419200 


Hs.1 7231 8 


320193 


AA831259 


Hs.17132 


320195 


R62203 


Hs.24321 


320199 


R78659 


Hs.29792 


320203 


AL049227 


Hs.1 24776 


320219 


AA327564 


Hs.1 27011 


320220 


AF054910 


Hs.1 271 11 


320225 


AF058989 


Hs.1 28231 


320231 


H03139 


Hs.24683 


320260 


NM_003608 


Hs.1 31 924 


320267 


AL049337 


Hs.132571 


320268 


K06019 


Hs.151293 


320322 


AF077374 


Hs.1 39322 


320325 


AI167978 


Hs.1 39851 


320330 


AF026004 


Hs.141660 


320339 


H10807 


Hs.281434 


320388 


H16065 


• Hs.31286 


320402 


R22291 


Hs.23366 


320413 


AA203711 


Hs.173269 


320432 


R62786 


Hs.124136 


320438 


AA253352 


Hs.293663 


320438 


W24548 


Hs.5669 



Homo sapiens cDNA; FU21927 fts, done H 

ESTs 

ESTs 

CGM47 protein 
ESTs 

transforming growth factor, beta recepto 
hypothetical protein FU22029 
ESTs 

gb:HSC2QE041 normalized infant brain cDN 

ESTs, Moderately similar to ALU5.HUMAN A 

ESTs 

ESTs 

ESTs 

gb:ye91b04.r1 Soares fetal liver spleen 
gb:yd39f07 s\ Soares fetal Dver spleen 
ESTs 

ESTs, Weakly similar to Y48A5A.1 [Celeg 

ESTs 

ESTs 

ESTs 

gbyd52a10.s1 Soares fetal Over spleen 
ESTs 

ESTs. Moderately similar to ALU8.HUMAN A 
ESTs 

Homo sapiens cDNA FU13207 fis, clone NT 
gb:yf23b1 2.r1 Soares fetal Over spleen 
ESTs 

hypothetical protein FU12973 
chromosome 8 open reacfing frame 4 
vimentin 

hypothetical protein FU21103 
gb:ym19c10.r1 Soares infant brain 1NIB H 
uncharacterized hypothalamus protein HT0 
ESTs 

syntrophin, gamma 1 

ESTs, Moderately similar to A Chain A, H 

ESTs 

ESTs 

ESTs 

Homo sapiens mRNA; cDNA DKFZp434N1923 <f 
ESTs 

KIAA1320 protein 

likely orthotog of mouse polydom 

N-acylsphingoslne amidohydrolase (acid c 

gbzm61g01 .rl Stratagene fibroblast {937 

ESTs 

ESTs 

Homo sapiens cDNA FU 13634 fis, clone PL 
ESTs, Weakly similar to ALU1.HUMAN ALU S 
ESTs 

Homo sapiens cDNA FU 14199 Ms, clone NT 
Human DNA sequence from clone RP5-85QE9 
gb:EST40943 Endometrial tumor Homo saple 
gb:yd40h09.r1 Soares fetal liver spleen 
ESTs, WeaMy similar to KIAA0638 protein 
ESTs, Weakly similar to A46010 X-linked 
hypothetical protein FU14001 
EST 

FOXJ2forkhead factor 
ESTs 

CDC45 (ceil division cycle 45, S.cerevis 
ESTs 

SMC2 (structural maintenance of chromoso 

ESTs 

ESTs 

Homo sapiens cDNA FU 12028 fis, clone HE 
ESTs 

Homo sapiens mRNA; cDNA OKFZpS64N1 1 1 6 (f 
tubuldnterstitial nephritis antigen 
tektin 2 <tesUcular) 

G antigen, family B, 1 (prostate associa 
ESTs 

G protein-coupled receptor 65 

Homo sapiens mRNA; cONA DKFZp564P016 (fr 

Homo sapiens cDNA FU 10664 fis, clone NT 

small protlne-rich protein 3 

caveolbi 2 

chloride channel 2 

Homo sapiens cDNA FU14028 fis, done HE 
ESTs 

Homo sapiens clone FLC0578 PR02852 mRNA, 

ESTs 

ESTs 

ESTs 

ESTs 



1.81 


1.57 
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4.72 


4.80 


10.40 


1.50 


2.11 


1.00 


1.00 


0.18 


0.68 


0.26 


0.62 


3.13 


4.50 


1.10 


1.00 


0.16 


0.73 


1.26 


2.43 


0.70 


0.76 


2.45 


3.59 


2.00 


3.54 


4.28 


8.81 


6.15 


14.13 


3.26 


5.68 


1.76 


5.65 


4.29 


4.84 


1.00 


1.00 


2.81 


4.88 


2.08 


2.82 


2.80 


4.39 


1.55 


3.25 


1.65 


1.19 


5.11 


8.54 


1.89 


3.36 


3.48 


4.82 


0.26 


0.82 


1.68 


3.41 


3.06 


4.24 


2.76 


4.24 


251 


3.55 


5.32 


7.68 


3.35 


5.00 


5.18 


12.55 


1.58 


1.56 


1.00 


1.22 


2.48 


3.13 


3.02 


4.85 


4.36 


11,61 


2.56 


3.68 


4.63 


6.56 


0.63 


1.32 


0.30 


0.94 


3.99 ' 


6.44 


5.09 


7.36 


3.24 


3.21 


4.40 


9.42 


3.31 


5.39 


2.90 


4.95 


1.81 


1.57 


3.42 


6.29 


2.77 


5.14 


4.10 


6.69 


3.27 


3.27 


1.81 


1.64 


3.38 


7.36 


5.90 


16.73 


2.08 


4.47 


1.00 


1.00 


2.27 


2.06 


1.00 


1.00 


1.26 


1.00 


2.58 


6.23 


2.85 


4.53 


0.40 


0.94 


0.84 


1.18 


1.00 


1.17 


0.18 


1.09 


5.26 


13.75 


1.59 


1.93 


1.38 


4.56 


1.00 


1.92 


6.56 


5.70 


1.41 


1.01 


0.05 


0.67 


2.17 


1.26 


1.81 


2.32 


1.00 


3.22 


1.41 


1.36 


2.31 


3.61 


11.25 


20.78 


2.22 


3.49 


3.53 


8.14 
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320448 


AI240233 


Hs.80887 


v-yes-1 Yamaguchi sarcoma viral related 


1.42 


3.46 


320451 


R26944 


Hs.1 80777 


Homo sapiens mRNA; cDNA DKFZp564M0264 (f 


0.87 


0.81 


320484 


AA094436 


Hs.296267 


folllstatin-like 1 


0.65 


1.18 


320499 


R32555 


Hs.24321 


Homo sapiens cDNA FU12028 lis, clone HE 


3.44 


7.15 


320514 


AB007978 


Hs.158278 


KIAAQ509 protein 
hypothetical protein FU20171 


6.44 


13.62 


320521 


N31464 


Hs.24743 


1.48 


1.04 


320526 


AW374205 


Hs.111314 


ESTs 


3.66 


7.87 


320527 


R34672 


Hs.324522 


ESTs 


3.16 


5.63 


320536 


AA331732 


Hs.1 37224 


ESTs 


2.83 


5.83 


320556 


AF054177 


Hs.14570 


hypothetical protein FU22530 


1.28 


1.00 


320564 


AF056209 


Hs.159396 


psptidyiglycine alpha-amldaling monooxyg 


1.22 


0.81 


3205B7 


244524 


Hs.167456 


Homo sapiens mRNA full length Insert cDN 


1.84 


2.44 


320635 


R54159 


Hs.80508 


small nuclear ribonucleoprotetn pdypept 


1.00 


6.25 


320639 


AA243258 


Hs.7395 


hypothetical protein FU23182 


260 


2.30 


320648 


N48521 


Hs.26549 


Homo sapiens mRNA for KIAA1708 protein, 


1.00 


1.53 


320651 


AA489268 


Hs.111334 


ferritin, light polypeptide 


0.14 


0.79 


320664 


AS904216 


Hs.91251 


hypothetical protein FU1 1 198 


5.02 


8.84 


320676 


AA132650 


Hs.300511 


ESTs 


3.63 


5.37 


320683 


R59291 


Hs.26638 


ESTs, Weakly similar to unnamed protein 


0.37 


1.31 


320689 


AA334609 


Hs.171929 


ESTs, Weakly similar to A54849 collagen 


1.27 


1.02 


320696 


AW135016 


Hs.172780 


ESTs 


3.53 


4.60 


320714 


AI445591 




gb:yq04a1 0.r1 Soares fetal fiver spleen 


1.06 


0.85 


320727 


U96044 


Hs.181125 


immunoglobulin lambda locus 


1.35 


1.49 


320771 


AI793266 


Hs.117176 


poly{A)-binding protein, nuclear 1 


0.04 


0.82 


320794 


AA281993 


Hs.91226 


ESTs 


Z96 


4.33 


320822 


AF100780 


Ha.194679 


WNT1 Inducible signaling pathway protein 


0.10 


0.79 


320824 


AF120274 


Ks.194689 


artemtn 


1.16 


1.11 


320830 


AJ132445 


Hs.266416 


claudin 14 


1.06 


1.75 


320843 


AA317372 


Hs.34744 


Homo sapiens mRNA; cDNA DKF2p547C136 (fr 


1.36 


1.47 


320849 


D60031 


Hs.34771 


ESTs 


5.30 


7.49 


320853 


AI473796 


Hs.135904 


ESTs 


1.00 


1.00 


320896 


AB002155 


Hs.271580 


uroplakin IB 


5.90 


2.55 


320921 


R94038 ' 


Hs.199538 


Inhibln, beta C 


Z20 


1.17 


320927 


AI2057B6 


Hs.213923 


ESTs 


0.18 


1.46 


320957 


AI878933 


Hs.92023 


core histone macroH2A22 


1.67 


218 


320997 


H22544 




gb:yn69f11.rl Soares adult brain N2b5HB5 


3.26 


3.62 


321045 


W88483 


Hs.293650 


ESTs 


2.25 


4.55 


321046 


H27794 


Hs.269055 


ESTs 


2.69 


4.25 


321052 


AW372884 


Hs.240770 


nuclear cap binding protein subunit 2, 2 


114 


256 


321059 


A1092824 


Hs.126465 


ESTs 


1.69 


0.53 


321062 


R87955 


Hs.241411 


Homo sapiens mRNA full length insert cDN 


2.76 


5.20 


321067 


AF131782 


Hs.241438 


Homo sapiens clone 24941 mRNA sequence 


4.79 


7.41 


321102 


AAO18306 




gb:ze40d08.r1 Soares retina N2b4HR Homo 


1.79 


4.27 


321130 


H43750 


Hs.1 25494 


ESTs 


1.00 


3.14 


321142 


AI817933 


Hs.298351 


ASPL protein 


8.73 


15.36 


321155 


AA336635 


Hs.99598 


hypothetical protein MGC5338 


3.04 


5.03 


321158 


AA700289 




gb:yu76f1 1.r1 Soares fetal Bver spleen 


4.62 


8.39 


321170 


N53742 


Hs.172982 


ESTs 


2.21 


4.46 


321199 


AW385512 




gb:yy56d10.s1 Soares_multiple_scteros!s_ 


5.69 


8.01 


321206 


H54178 


Hs.226469 


Homo sapiens cDNA FU 1 241 7 fis, clone MA 


4.00 


7.32 


321225 


AL080073 


Hs.251414 


Homo sapiens mRNA; cDNA DKFZp564B1462 (f 


4.17 


4.63 


321236 


AW371941 


Hs.18192 


Ser/Arg-related nuclear matrix protein ( 


1.00 


1.00 


321244 


AF058654 
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334094 1.30 «-03 

334113 4.55 £63 

334161 ?> 82 ir 



0.47 0.76 



334187 1-36 3.70 

334219 0- 69 104 



1.88 1.70 



334223 4.72 114 

334239 



0.79 0.62 
0.45 1.10 
1.00 3.56 



334378 3.98 5.76 

334382 1-50 1.31 



3.59 4.75 



334562 5.94 15.40 

334588 8 - 14 19 - 53 



1.55 1.56 



334633 5,18 8, 0 7 

334648 



0.59 2.13 
3.70 7.15 
8.13 10.60 



334787 
334866 

334891 °-32 1.14 

334933 1.00 3.84 

334945 1.04 2.96 

334967 °- 29 I' 14 

334990 1.50 1.39 



5.88 18.65 
0.55 1.75 
4.31 8.01 



335015 
335093 

335lls 0.38 1.97 

335179 1 - 24 1- 98 

33 5188 0.46 1.47 

335211 IS I'S 

335288 073 0.97 

335289 O- 20 026 
335361 2 18 158 
335379 O- 60 071 
335414 3.64 14.94 
335416 2.93 3.98 

335496 096 091 

335497 1.71 1.92 

335548 VI 5fL 

335551 3 22 10 - 54 

335558 3.42 4.89 

335586 5.50 12.75 

335619 2 - 99 307 

335620 3 80 8 29 



0.2B 0.57 
0.46 1.17 
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335686 


Z55 


3.81 


335755 


2.24 


1.07 


335784 


0.20 


0.97 


335814 


1.13 


1.48 


335815 


2.45 


3.51 


335823 


1.00 


4.16 


335835 


0.49 


1.70 


335851 


1.66 


1.39 


335868 


Z98 


6.43 


336896 


0.98 


0.99 


335936 


12.10 


21.93 


335948 


1.00 


1.64 


335983 


1.00 


4.21 


335995 


0.37 


1.17 


336021 


1.04 


0.84 


336034 


11.40 


23.54 


336038 


1.19 


1.21 


336066 


0.54 


1.63 


336107 


0.95 


0.70 


336205 


3.13 


6.29 


336275 


3.20 


10.10 


336292 


Z34 


3.09 


336331 


1.00 


1.00 


336419 


0.65 


0.79 


336632 


2.33 


216 


336633 


2.55 


223 


336634 


2.19 


203 


336635 


2.69 


248 


336638 


2.13 


1.83 


336637 


2.43 


2.24 


336638 


2.31 


2.03 


336659 


0.60 


1.31 


336675 


0.31 


1.18 


336684 


1.50 


1.14 


336694 


4.74 


7.10 


336716 


4.43 


6.37 


336721 


2.20 


0.74 


336798 
336900 


1,64 


2.14 


6.14 


12.73 


336948 


1.00 


1.00 


337028 


1.30 


2.09 


337043 


4.01 


11.53 


337046 


1.67 


1.84 


337054 


2.78 


7.35 


337128 


7.20 


16.14 


337162 


3.45 


5.34 


337183 


5.72 


11.41 


337184 


3.72 


5.90 


337192 


1.27 


1.06 


337194 


1.88 


1.68 




0.22 


1.03 


337268 


1.00 


3.31 


337299 


3.23 


5.14 


337325 


2.76 


3.72 


337369 


5.80 


10.42 


337493 


Z06 


6.30 


337497 


7.88 


20.29 


337500 


3.80 


4.48 


337549 


1.66 


2.31 




1.27 


8.54 




5.76 


7.16 


337671 


0.73 


0.97 


337755 


1.54 


0.92 


337786 


5.07 


9.73 


337809 


6.18 


12.87 


337862 


3.78 


12.97 


337871 


2.66 


8.16 


337958 
338008 


0.26 


1.34 


1,48 


1.12 


338033 


2.38 


14.59 


338083 


0.65 


2.16 


338110 


1.00 


1.61 


338112 


5.86 


8.25 


338145 


1.70 


1.97 


338148 


8.07 


18.19 


338158 


1.30 


4.55 


338161 


2.58 


3.57 


338179 


1.00 


1.00 


338182 


3.32 


4.63 


338169 


1.00 


3.34 


338197 


0.99 


1.69 


338199 


4.58 


7.62 


338215 


6.01 


15.85 


338279 


0.53 


0.95 


338316 


20.58 


38.66 
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338322 
338357 
338359 
338366 
338374 
338414 
338418 
338469 
338501 
338506 
338523 
338549 
338561 
338662 
338671 
338576 
338726 
338779 
338804 
338836 
338871 
338872 
338879 
33B937 
338966 
338993 
339047 
339100 
339114 
339121 
339170 
339229 
339264 
339293 



3.23 

4.10 

10.12 

0.69 

0.40 

0.47 

6.12 

3.09 

6.28 

6.97 

3.10 

1.70 

0.79 

1.72 

0.17 

2.10 

1.20 

0.12 

0.99 

1.00 

4.30 

5.02 

0.23 

6.55 

1.76 

1.00 

5.26 

5.10 

1.00 

1.00 

10.36 

4.08 

Z64 

1.73 



7.39 

11.39 

21.59 

1.02 

1.18 

1.06 

13.86 

5.11 

10.32 

12.41 

5.84 

2.70 

0.81 

1.46 

0.91 

15.86 

1.09 

0.57 

1.67 

1.00 

9.81 

12.81 

1.12 

12.26 

5.42 

2.40 

10.81 

6.88 

1.70 

3.75 

19.67 

13.48 

3.83 

1.94 



TABLE 88 shows the accession numbers for those Pkeys In Table 8A lacking unlgenel Vs. For each probeset we have listed ihe gene cluster number from which the 
oligonucleotides were designed. Gene clusters were compiled using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence 
similarity using Ctuslering and Alignment Tools (DoubleTwIst. Oakland California), The Genbank accession numbers for sequences comprising each cluster are listed in the 



322044 


187363.1 


AW340926 AA249063 N86075 


322060 


44320 1 


AI341937AW003063 U34725 M904742 


321430 


42705J 


X57414X57415 


321467 


43034.1 


X13075X13076 


322125 


46779.1 


R93901 AF075073R93902 


322166 


46861.1 


H69434AF085958H69846 


322173 


46873.1 


H52567 H52557 AF085970 H52164 


322178 


46882.1 


H56535AF085980 H56712 


322179 


46885.1 


H92891 AF085982H92777 


321577 


1615102J 


H84849 H84252 H84260 H86664 H85320 


321587 


1615333.1 


H95531 H95521 H84529 


313723 


111953.1 


AA07041 2 AA102346 AA081 885 


320997 


627492J 


H22544 H46842AI204929 


322278 


47271 1 


W69304 AF0862B3 W69200 


321687 


218439.1 


AA625149 AA3 1 3030 AA3 13052 H97463 


313883 


129439.1 


AA665089 AA135130 AA484059 AA10241 9 AW877765 


322320 


47422.1 


W79150AF086419 


322339 


814684.1 


A1668646A1734214W17348 


314648 


293660.1 


AW979268 AA878419 AA431342 AA431628 


300201 


682222.1 


A1308300A1308296 


306897 


25196.-2 


AJ093967 


323155 


979809.1 


AL120701 AL135041AL121824 


322527 


38927.1 


AF147359T58511T58560 


322585 


473768.2 


W88919 W89125 


300362 


1574395.1 


242308 H23514 


322635 


82296.1 


AA005129 AA679084 AA694399 


322664 


85042.1 


AA011522AA702841 AA011691 AA330797 


315454 


380580.1 


AI239464 AI239473 AA62581 2 AI208703 


322687 


37372.1 


AF074666 AM 10759 AF090902 


314852 


327472.1 


A1903735 AA491283 AI694953 AW976903 AA761362 


307783 


697809.1 


AI347274AW844024 


324072 


269032J 


AA381 722 AA381829 AW9B3906 AW963902 AA381242 


300527 


221345.1 


AA488472 W27363 AA317053 BE082689 AW967036 BE079872 


323505 


196389.1 


AW970512 AA280251 A16522B7 BE466438 AI650725 AA551854AA281574 AW571481 


315791 


403558J 


AA678177AA677034 


324303 


233842.1 


AL118754AA333202 H38001 


316519 


442885J 


AAB47835AA768376 


300926 


333127.1 


AA5O486OAA504911 



"Accession* column. 



Pkey: Unique Eos probeset identifier number 
CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey 



CAT number Accessions 
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324580 
301882 
324804 
324889 
302697 
302711 
302742 
318499 
310824 
302847 
304122 
303598 
311409 
312094 
319312 
319407 
319425 
320007 
320018 
319484 
318865 
312220 
319546 
312389 
319611 
312437 



311896 
319834 
321102 
321158 
321199 

305528 
321270 
314126 
320714 
306442 
306446 
306458 
306510 
306557 
306572 
306582 
306656 
306686 
306751 
308011 
3Q6892 
308106 
308154 
306956 
306958 
308213 
308216 
308219 



308643 
308673 
308697 
308778 
308808 
308875 
308886 
308898 
308966 
308979 
303011 
303077 
305016 
305034 
305072 
305148 
305190 
303978 
303990 



305235 
305312 
305413 
305447 
321244 
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275087.1 
398093J 
1515978J 
43219J 
45419J 



364430J 

34624.4 

458.105 

77271.-5 

270283J 

837264J 

797889.1 

1540116.1 

1688823.1 

16B9571.1 

229683.1 

1815987J 

1691 553 J 

1535937.1 

1671607J 

24330SJ 

902067.1 

1566863.1 

291472.1 



57919^1 

112523J 

80531.1 

410938.1 

212379.1 

28832L-3 

1662057.1 

177666.1 

743644.1 

AA976899 

M977348 

AA978186 

AA988546 

AA994530 



T78054 T79888 AA398185 
AI692552 AI393343 AI800510 AI37771 1 F24263 AA661876 
D31010 D30991 031168 D31166 D31465 
AJ001409AJ001410 
L08442 D51348 
L12061 

T25451 AA585296 AA585305 

U88896 U88898 AA916056 T03285 AI341594 AI359534 AI634031 U88897 

X98941 X98942 X98943 X98953 X98949 

H28966 

AA382814 AA402411 AA412355 

AI698839AI909260AI909259 

Z78390T97427 

Z45481 F12393 T74437 

R05329R01555 R08276 

T82930R02424T85145 

AA336314 T82938 AA327744 AW9673SB AA639967 T10753 

T83263 T85731 T85730 

T91772 R07257 R07098 

H10818F07831Z43072 

N74613 T98756T98589 

R09692 R09414AA346353 

A1863140W80703R43474 

H14957R56522R11908 

BE0801 80 AW827313 AW231970 AA995028 AA428584 AW87271 6 AW892508 AW854593 AA578441 AW975234 AA664937 AA9841 31 
AA528743 AA552874 AA564758 AW063245 AI267534 AW0701 90 AW893483 AA770330 AA906928 AA906582 AA758746 AA551717 
AW063311AA429538 

AW206447 A1248530 AI084433 AI400976 R16553 
M071267 T65940 T64515 AA071334 
AA018306H38925AA001221 
H79670H47798AA700289 

N34524 AA305071 AW964803 AA502335 AI433430 AI203597 AW026670 AW265323 AW850787 AA31 7554 AW993643 AW835572 

AW385512 A1334966 W32951 H62656 H53902 R88904 AW835732 

AA769156 

N59537 N78278 R83560 
AA226431 AA226569 AA488748 
R91883AI445591 



AA996248 

AJ004024 

AI015615 

AI032589 

AI439473 

A1092465 

AI476803 

A!500600 

A1125111 

AI125152 

AI557041 

AI557135 

AI557246 

AI718299 

A1719893 

AI745040 

AI760864 

A1767143 

AI811109 

AI818289 

AI832332 

A1833240 

AI858845 

A1870704 

A1873111 

41689.1 

4406OJ 

AA626876 

AA630128 

AA641012 

AA654070 

AA665955 

AW513315 

AW515465 

AW516449 

AW516611 

AA670480 

AA700201 

AA724659 

AA737856 

29327J 



AF090405 AF090407 AF090406 
AF163305 AF163307 AF163303 



AF088654 AF068656 AF068655 
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305614 


AA782866 


30S637 


AA806124 


305639 


AA806138 


305650 


AA807709 


305690 


AA813477 


305728 


AA828209 


305759 


AA835353 


305792 


AA845256 


307041 


A1144243 


307091 


AI167439 


307181 


AM 69251 


305901 


AA872968 


305910 


AA875981 


307415 


A1242118 


307426 


AI243364 


307517 


A1275055 


307551 


A1281556 


307581 


A1282207 


307608 


A1290295 


307691 


AI318285 ' 


307730 


A1336092 


307760 


Ai342387 


307764 


A1342731 


307796 


AI350556 


309045 


AI910902 


309051 


AI911975 


307807 


A1351799 


307808 


AI351826 


307820 


AI355761 


307852 


AI365541 


309122 


AI928178 


309164 


AI937761 


309177 


AI951118 


307902 


AI380462 


309299 


AW003478 


309303 


AW004823 


309476 


AW129368 


309532 


AW151119 


309747 


AW2648B9 


309769 


AW272346 


309799 


AW276964 


309866 


AW299916 


302679 


311853J H65022AA186889 


309923 


AW340684 


309928 


AW341418 


309931 


AW341683 


309933 


AW341936 


302705 


31765J U09060 U09061 


302789 


34161J AJ245067AJ245070 


304006 


AW517947 


304024 


T03036 


304026 


T03160 


304028 


T03266 


304046 


T54803 


304061 


T61521 


304063 


T62536 


302802 


34487 J Y0B250Y08245 


304114 


R78946 


304155 


H68696 


304203 


N56929 


304234 


W81608 


304348 


AA179868 


304430 


AA347682 


304456 


AA411240 


304521 


AA464716 


304526 


AA476427 


304607 


AA513322 


304735 


AA576453 


304760 


AA580401 


306015 


AA897116 


306063 


AA906316 


306065 


AA908725 


306104 


AA910956 


306109 


AA911861 


306242 


AA932805 


306288 


AA936900 


306398 


AA970223 


330568 


NOTJ=OUND_entrez U56244 


330599 


15323.-12 U90437 


331131 


genbanK_R54797 R54797 


331203 


NOT_FOUND.entrez T82310 


331531 


genbank_N51343 N51343 


331547 


467396.1 AA826597 N54811 


332074 


genbanK_AA5990l2 AA599012 



PCT/US02/12476 
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TABLE 8C shows the genomic position for (hose Pteys in Table 8A lacking unigene ID'S and accession numbers. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also listed. 

Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et a)." refers to the publication entitled The DMA 

sequence of human chromosome 22/ Dunham L et al., Nature {1999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 

NLposition: Indicates nucleotide positions of predicted exons. 



Pkev 
* 


Ref Strand 


NL_posJ(ion 




332792 


Dunham, I. etal. 


Plus 


73381-73768 


332816 


Dunham, I. et.al. 


Plus 


359844*360030 


332906 


Dunham, I. etaL 


Plus 


1923101-1923205 


332911 


Dunham, (. et.al. 


Plus 


1961767-1961856 


332912 


Dunham, 1. etal. 


Plus 


1962120-1962246 


332922 


Dunham, 1. etaL 


Plus 


2009620-2009738 


332956 


Dunham, 1. et.al. 


Plus 


2510528-2510558 


332959 


Dunham, 1. etal. 


Plus 


2518145-2518213 


333138 


Dunham, ). etal. 


Plus 


3369205-3369323 


333139 


Dunham, 1. etal. 


Plus 


3369495-3369571 


333221 


Dunham, ). etal. 


' Plus 


3978070-3978187 


333380 


Dunham, 1. etal. 


Plus 


4904775-4904846 


333387 


Dunham, 1. etal. 


Plus 


49109354910997 


333512 


Dunham, 1. etaL 


Plus 


5560510-5560664 


333524 


Dunham 1. eta). 


Plus 


5612620-5612780 


333585 


Dunham 1 at al 


■Plus 


6234778-6234894 


333618 


Dunham ) At fil 


Pius 


6562391-5562566 


333627 


Dunham t fit fil 
vuwiaiii, ■■ cum. 


Plus 


662059441620903 


333628 


Dunham t fit al 

UUilllaJU, I. Ol.dl. 


Plus 


66290044)629233 

ulKvu UfUU 


3316.-50 


Dunham 1 of al 


Plus 


679fifl , i n -fi79712fi 


333678 


Dunham 1 at al 
UUIwiaill, I. Cl.oJ. 


Plus 


f UOOfcur f U UUcuU 


JJO/OU 


Dunham, t. et.al. 


rlUS 


7fiflfi1 R«L7finfl9T4 
/QUO 103*1 QUOcu-t 




Dunham, t. etal. 


Phi* 
rlUS 


f 09449 Wu9a0JU 




Dunham 1 at al 

uunnam, i. Di.ai. 


Plim 
rlUo 


7RQ440T.7fi 

1 O 9*H» VI f - 1 O 9-t0£d 


JO J/ DO 


f"tivnh-im 1 a4 al 
UUnndJll, I. ol.aJ. 


Pine 


7RQR44/L7PQ < \RQ7 
/09^*WW 03303/ 


043709 


Dunham, 1. eLai. 


Pins 
riua 


7RQfifl9R.7AQR7rf7 
/ 090040* f OaOf U/ 


WX770 
<X30f ft 


uunnam, i. etai. 


Ptiie 
rlU5 


77nfi77*L77nfiQn9 

r fuof f o- » (uoauz 


000/ If 


Hiinnflfn k at at 

uunnam, i. m.ai. 


Plus 


774fiflnR-774fiQ1 fi 


333846 


Dunham t at at 
UUiinam, I. ULdJ. 


Pius 


QUUOOurQUUui Jl 


333884 


Dunham I at al 


Plus 


O l«M90w*0 1 1 0 1 


333887 


Dunham 1 at al 


Plus 


fl1R4flR?.fi1 ( i5nS"i 


333891 


Dunham 1 at al 

uunnam, i. audi. 


Plus 


R1HfiaT7-fi1-iR7nq 

U 1 OU1UI"0 1 OV 1 \J*J 


333892 


Dunham t fit fit 
uuiiiiaiii, i. ouaJ. 


Plus 


81568254)157001 


3HQ4R 


Dunham 1 at fit 
UUllllalll, I. Dual. 


Pius 


658349741583677 


333954 


nitnham t at al 

uunnam, i. ouai. 


Pius 


RRR3 1 RR-fiRR3'?3fi 


333966 


nnnham 1 At al 


Plus 


86556434)655826 


333968 


ntinhiim 1 fit at 


Plus 


86810044)681241 


334061 


Dunham 1 At si 


Pius 


9686941-9687077 


334094 


Dnnhfim 1 At a) 


Plus 


9889953-9890105 


334113 


Dnnhfim 1 At fit 


Plus 


10282459-1 0282597 


334161 


Dunham, 1. etal. 


Pius 


10599033-10599180 


334219 


Dunham, 1. etal. 


Plus 


12716160-12716384 


334239 


Dunham, ). etal, 


Plus 


13056569-13056693 


334333 


Dunham, 1. etal. 


Plus 


13603544-13603657 


334378 


Dunham 1 At al 


Plus 


13907239-13907370 


334382 


Dnnhfim 1 Atfll 


Plus 


13915866-13916036 


www 


Dunham 1 at al 


Plus 


14Qfl7R47.14Q.ft7q40 

l*t9U# 0**/" l**90f 9*tU 


334588 


Dunham t fit fit 


Plus 


1 5032740-1 503281 7 


334616 


Drtnhnm 1 at al 
1/UiHlalfl, r. audi. 


Pius 


15176123.15176470 

Ivifu 4 lJ" Iwlr 11*11 U 


334633 


Dunham 1 fit al 
uunndm, i. ouai. 


Plus 


1 5333206.1 53333G5 


334866 


Dunham 1 at at 

uunnam, i. yutu. 


Pius 


1RR72214.1R872317 


334891 


Dunham 1 at fil 
uunnam, i. oloi. 




1Q29Q770.1 92QQQ44 


334934 


Dunham 1 At fit 
uuiiiiaiii, I. auaJ. 


Plus 


201 03970-20 1 0405 8 


335015 


Dunham, 1. eta). 


.Plus 


20682792-20682945 


335120 


Dunham, (. etal. 


Plus 


21436286-21436384 


335125 


Dunham, 1. etal. 


Plus 


21441390-21441471 


335179 


Dunham, 1. eta). 


Plus 


21634405-21634526 


335186 


Dunham, 1. etai. 


Plus 


21669118-21669326 


335211 


Dunham, 1. etal. 


Plus 


21774611-21774680 


335361 


Dunham, 1. etal. 


Plus 


22807292-22807445 


335379 


Dunham, 1. et.al. 


Plus 


22899306-22899420 


335414 


Dunham, 1. etal. 


Plus 


23235546-23235684 


335416 


Dunham, 1. eta). 


Plus 


23237354-23237465 


335496 


Dunham, 1. etaL 


Plus 


24164386-24164545 


335497 


Dunham, L etal. 


Plus 


24167666-24167869 


335558 


Dunham, 1. eta). 


Plus 


24740167-24740347 


335586 


Dunham, 1. etal. 


Plus 


24990333-24990497 


335686 


Dunham, 1. etal. 


Plus 


25439839-25439920 


335784 


Dunham, 1. etal. 


Plus 


25942710-25942792 


335823 


Dunham, 1. etal. 


Plus 


26365925-26366004 


335983 


Dunham, 1. etal. 


Plus 


27938968-27939070 


335995 


Dunham, I. etal. 


Phis 


28009044-28009184 


336021 


Dunham, 1. etal. 


Plus 


28686482-28686559 
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336034 


Dunham, I. etat 


Plus 


29014404-29014590 


336038 


Dunham, ). eta). 


Plus 


29022963-29023165 


336107 


Dunham, 


.eta). 


Plus 


29987731-29987869 


336632 


Dunham, 


.etal. 


Plus ' 


983890-985529 


336633 


Dunham, 


.eta!. 


Plus 


985591-986221 


336634 


Dunham, 


.eta). 


Pius 


986296-986670 


336635 


Dunham, 


.eta). 


Plus 


987908-988364 


336636 • 


Dunham, 


.eta). 


Plus 


988418-989185 


336637 


Dunham, 


.eta!. 


Plus 


989276-990813 


336638 


Dunham, 


.etal. 


Plus 


991906-993240 


336659 


Dunham, 


.eta). 


Plus 


1896402-1896478 


336694 


Dunham, 


.eta). 


Plus 


2420546-2420616 


336721 


Dunham, 


.eta). 


Plus 


3371522-3371586 


336900 


Dunham, 


.eta). 


Rus 


10236423-10236523 


336948 


Dunham, 


.eta). 


Pius 


12692290-12692381 


337028 


Dunham, 


.eta). 


Plus 


16644817-16644942 


337054 


Dunham, 


.eta). 


Plus 


17821742-17821922 


337162 


Dunham, 


.eta). 


Plus 


23478943-23479145 


337183 


Dunham, I 


.eta!. 


Plus 


23943606-23943696 


337184 


Dunham, 


.eta!. 


Plus 


23973949-23974016 


337268 


Dunham, 


.eta). 


Plus 


28011979-28012034 


337299 


Dunham, 


.eta). 


Plus 


29022656-29022775 


337389 


Dunham, 


.eta). 


Plus 


31401509-31401579 


337493 


Dunham, 


.eta). 


Plus 


33330760-33330981 


337549 


Dunham, 


.etal. 


Plus 


34474472-34474531 


337755 


Dunham, 


.etal. 


Plus 


3971764-3971900 


337809 


Dunham, 


.eta). 


Plus 


4449069-4449193 


337871 


Dunham, 


.eta). 


Plus 


5443027-5443101 


337958 


Dunham, 


.eta). 


Plus 


6969162-6969270 


338008 


Dunham, 


.eta). 


Plus 


7697068-7697236 


338033 


Dunham 


. eta). 


Plus 


6092128-8092271 


338110 


Dunham, 


.eta). 


Plus 


10384481-10384621 


338112 


Dunham. 


.eta). 


Plus 


10391398-10391600 


338145 


Dunham 

L/UJ U ICU II, 


.eta). 


Rus 


11386629-11386692 


338148 


Dunham, 


.etal. 


Rus 


11448985-11449085 


338179 


Dunham, 


.eta!, 


Plus 


12B08775-12808833 


338197 


Dunham, 


.etal. 


Plus 


13638107-13638181 


338279 


Dunham, 


.eta). 


Plus 


16168944-16169091 


338316 


Dunham, 


.etal. 


Plus 


17089711-17089988 


336322 


Dunham, 


.etal. 


Plus 


17132477-17132547 


338357 


Dunham, 1 


.etal. 


Plus 


18062184-18062402 


338359 


Dunham, 


.eta). 


Plus 


18074402-18074501 


338366 


Dunham, 


.eta). 


Plus 


18262026-18252189 


338374 


Dunham, 1 


. eta). 


Plus 


18371200-18371282 


338414 


Dunham, 1 


. etal. 


Plus 


19345573-19345660 


338418 


Dunham 1 

UUi 11 IOI 1 T, 1 


.eta). 


Plus 


19435506-19435596 


338501 


Dunham 1 


.etal. 


Plus 


21244713-21244828 


338506 


Dunham 1 

Uul JJ IOJII, 1 


. eta). 


Plus 


21221871-21221953 


338523 


Dunham 1 

b/UI IJIOJ II, 1 


.etal. 


Plus 


21509763-21509B64 


338662 


Dunham 1 


.eta). 


Plus 


24404720-24404899 


338804 


Dunham 


. eta). 


Plus . 


27236005-27236108 


338836 


Dunham 1 

UUI1I10III, 1 


. eta). 


Plus 


27792166-27792272 


338879 


Dunham 1 


.eta). 


Plus 


28410653-28410734 


338937 


Dunham 1 

UUIIIIOIII, 1 


.etal. 


Plus 


29160655-29160725 


338993 


Dunham 1 


. eta). 


Plus 


30077787-30078184 


339047 


Dunham, 1 


. eta). 


Rus 


30760793-30760968 


339100 


Dunham, 1 


. eta). 


Plus 


31141580-31141765 


339114 


Dunham, i 


. etal. 


Ptus 


31456454-31456519 


339121 


Dunham, 1 


.eta). 


Rus 


31583467-31583536 


339170 


Dunham 1 

UUIJJ IOJII, 1 


.eta). 


Plus 


32216399-32216527 


339293 


Dunham 1 

WIIIIOJII, 1 


. etal. 


Plus 


33223671-33223819 


332858 


Dunham 1 

WUIIIIdll, 


.etal, 


Minus 


1339607-1339397 


332982 


Dunham 1 

UUIIJ ICUJI, 1 


.eta). 


Minus 


2628296-2628109 


3329B4 


Dunham, 1 


.eta). 


Minus 


2632606-2632457 


332998 


Dunham, 1 


. etal. 


Minus 


2711704-2711565 


333058 


Dunham, 1 


.eta). 


Minus 


3028925-3028811 


333097 


Dunham, 1 


.eta). 


Minus 


3204124-3204036 


333121 


Dunham, 1 


.eta). 


Minus 


3308446-3308358 


333122 


Dunham, 1 


.etal. 


Minus 


3309596-3309531 


333123 


Dunham, 1 


etal. 


Minus 


3310817-3310749 


333140 


Dunham, 1 


.eta). 


Minus 


3377220-3376309 


333260 


Dunham, 1 


. eta). 


Minus 


4308400-4308304 


333603 


Dunham, 1 


.etal. 


Minus 


6466335-6465727 


333604 


Dunham, 1 


etal. 


Minus 


6467090-6466768 


333904 


Dunham, 1 


etal. 


iVEnus 


8217374-8217261 


333906 


Dunham, 1 


eta). 


Minus 


8218238-8218063 


334183 


Dunham, 1 


etal. 


Minus 


11832582-11832508 


334187 


Dunham, 1 


etal. 


Minus 


1 1 921400- 1 IsZllUO 


334222 


Dunham, 1 


eta). 


Minus 


12732417-12732289 


334223 


Dunham, 1 


eta). 


Minus 


12734365-12734269 


334255 


Dunham, 1 


etal. 


Minus 


13200776-13200692 


334492 


Dunham, 1 


etal. 


Minus 


14478333-14478172 


334648 


Dunham, 1 


etat. 


Minus 


15363301-15363222 


334787 


Dunham, 1. etat 


Minus 


16299093-16298937 


334933 


Dunham, 1. eta). 


Minus 


20078117-20077991 
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334945 


Dunham, I. elal. 


Minus 


20138885-20138637 


334967 


Dunham, I. elal. 


Minus 


20173311-20173218 


334990 


Dunham, 1. etal. 


Minus 


20341159-20341087 


335093 


Dunham, 1. elal. 


Minus 


21^7367-21297214 


335288 


Dunham, 1. eta!. 


Minus 


22304275-22303770 


335269 


Dunham, 1. elal. 


Minus 


22305950-22305708 


335548 


Dunham, 1. ete). 


Minus 


24662773-24662673 


335551 


Dunham, 1. etal. 


Minus 


24679828-24678961 


335619 


Dunham, 1. etal. 


Minus 


25082677-25082498 


335620 


Dunham, 1. etal. 


Minus 


25092561-25092434 


335621 


Dunham, I. eLa). 


Minus 


25098878-25098767 


335682 


Dunham, ). et.at. 


Minus 


25421215-25421093 


335755 


Dunham, 1. eLa). 


Minus 


25763806-25763747 


335814 


Dunham, 1. etal. 


Minus 


26320043-26319845 


335815 


Dunham, 1. etal. 


Minus 


26320518-26320421 


335835 


Dunham, 1. etal 


Minus 


26393311-26393245 


335851 


Dunham, 1. eta). 


Minus 


26604863-26604742 


335868 


Dunham, I. eUl 


Minus 


26711437-26711300 


3358S6 


Dunham, 1. eta). 


Minus 


26977639-26977558 


335936 


Dunham, 1. eta). 


Minus 


27360474-27360400 


335948 


Dunham, 1. eta). 


Minus 


27555924-27555788 


336066 


Dunham, ). eta). 


Minus 


29241080-29240842 


336205 


Dunham, 1. eta). 


Minus 


30477456-30477311 


336275 


Dunham, 1. eta). 


Minus 


32086675-32085536 


336292 


Dunham, 1. eta). 


Minus 


3281 803 5-3281 7927 


336331 


Dunham, ). etal 


Minus 


33594527-33594371 


336419 


Dunham, 1. etal. 


Minus 


34052568-34052445 


336675 


Dunham, 1. etal. 


Minus 


2020758-2020664 


336684 


Dunham, 1. eta). 


Minus 


2158060-2157993 


338716 


Dunham, t etal. 


Minus 


3259952-3259862 


336798 


Dunham, (. etal. 


Minus 


5888954-5888757 


337043 


Dunham, 1. etal. 


Minus 


17407330-17407251 


337046 


Dunham, 1. etal. 


Minus 


17610892-17610821 


337128 


Dunham, 1. etal 


Minus 


22215251-22215034 


337192 


Dunham, I. eta). 


Minus 


24591853-24591771 


337194 


Dunham, 1. eta). 


Minus 


24610510-24610359 


337229 


Dunham, ). eta). 


Minus 


26716579-26716481 


337325 


Dunham, ). eta). 


Minus 


30015948-30015800 


337497 


Dunham, 1. etal. 


Minus 


33371317-33371258 


337500 


Dunham, t. etal. 


Minus 


33376212-33376158 


337603 


Dunham, ). eta). 


Minus 


1299296-1299194 


337605 


Dunham, ). eta). 
Dunham, ). eta). 


Minus 


1346555-1346397 


337671 


Minus 


3260634-3260547 


337786 


Dunham, 1. eta). 


Minus 


4133203-4133081 


337862 


Dunham, 1. etal. 


Minus 


5347658-5347550 


338083 


Dunham, ). eta). 


Minus 


9318438-9318301 


338158 


Dunham, 1. eta). 


Minus 


11794465-11794343 


338161 


Dunham, 1. eta). 


Minus 


12124716-12124658 


338182 


Dunham, 1. etal. 


Minus 


12824919-12824827 


338189 


Dunham, 1. etal. 


Minus 


12878594-12878478 


338199 


Dunham, t eta). 


Minus 


13760865-13760780 


338215 
338469 


Dunham, 1. etal. 
Dunham, 1. etal. 


Minus 
Minus 


14055447-14055355 
20520387-20520242 


338549 


Dunham, 1. etal. 


Minus 


22049171-22049081 


338561 


Dunham, I. eta). 


Minus 


22311966-22311856 


338671 


Dunham, 1. etal. 


Minus 


24508421-24508346 


338676 


Dunham, ). etal. 


Minus 


24637427-24637369 


338726 


Dunham, ). etal. 


Minus 


25926206-25925618 


338779 


Dunham, 1. etal. 


Minus 


27030151-27029795 


338871 


Dunham, 1. etal. 


Minus 


28301708-28301611 


338872 


Dunham, I. etaJ. 


Minus 


28300921-28300790 


338966 


Dunham, 1. eta). 


Minus 


29614876-29614749 


339229 


Dunham, ). eta). 


Minus 


3272233W2722199 


339264 


Dunham, 1. eta). 


Minus 


32975145-32975053 


325228 


6381940 Plus 


2630-2694 




325235 


6381943 Minus 


162154-162264 


329588 


3962484 Plus 


1169-1619 




329560 


3962491 Plus 


2095-2990 




329541 


3983503 Minus 


2765-3059 




325328 


5866875 Plus 


86780-86854 




325340 


6017033 Minus 


166656-166819 


325373 


5866920 Minus 


1136686-1136777 


325367 


5866920 Minus 


922881.922958 


325389 


5666921 Plus 


239672-239759 


325436 


5866939 Minus 


29778-29907 




325498 


5866967 Plus 


173372-173930 


325471 


6017034 Minus 


289268-289342 


325557 


6056302 Plus 


50921-51050 




325559 


6249595 Minus 


11859^119172 


325560 


6249595 Minus 


133794-133981 


325569 


6249599 Plus 


79927-80217 




3255B7 


6682462 Plus 


126724-126967 


325585 


6682462 Phis 


73476-73574 




325597 


5866992 Plus 


1065020-1065089 


325639 


5867002 Plus 


253525-253608 
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325739 


5867036 


Minus 


205138-205269 


325740 


6867038 


Minus 


207533-207690 


325792 


6469828 


Minus 


1018-1176 


325735 


6552447 


Minus 


269122-269190 


325685 


6682468 


Plus 


117397-117483 


325686 


6682468 


Plus 


118337-118439 


325819 


6682490 


Minus 


130314-130370 


329764 


6048195 


Minus 


109733-109968 


329703 


6065793 


Minus 


139994-140138 


329643 


6448539 


Plus 


53403-53537 


329816 


6624888 


Minus 


70295-70423 


329860 


6687260 


Minus 


163474-163605 


325883 


5867087 


Plus 


22498-22663 


325895 


5867097 


Plus 


35B317^58476 


325925 


5867124 


Plus 


115749-115962 


325932 


6867127 


Plus 


7369-7441 


325941 


5867133 


Minus 


64228-64402 


325969 


5867153 


Plus 


101911-102061 


325971 


5867153 


Plus 


105841-106035 


329993 


4567166 


Minus 


101307-101434 


330020 


6671887 


Plus 


172397-172491 


326163 


5887168 


Minus 


7831-8035 


326274 


5867171 


Minus 


410289-410404 


326025 


5867176 


Plus 


70854-70915 


326046 


5887182 


Minus 


62663-62825 


326099 


58671 B8 


Minus 


661381-661510 


326108 


5867187 


Minus 


23784-23903 


326165 


5B67208 


Minus 


62787-62929 


326189 


5867212 


Plus 


69288-69413 


326204 


5867218 


Minus 


148088-148200 


326230 


5867230 


Minus 


301868-301972 


330052 


4567182 


Plus 


352560-352963 


330036 


6042048 


Plus 


117120-117216 


326360 


5867293 


Plus 


13627-13844 


326589 


5867320 


Plus 


22760-22919 


326393 


5857341 


Plus 


41702-41841 


326505 


5867435 


Minus 


6818-8949 


326515 


5867439 


Plus 


36683-36809 


326592 


6138928 


Plus 


23689-23828 


330107 


6015249 


Minus 


100091-100282 


330106 


6015249 


Minus 


99443-99778 


330100 


6015253 


Plus 


21166-21301 


330093 


6015278 


Plus 


1043-1199 


330088 


6015293 


PIU3 


37517-37638 


330085 


6015302 


Minus 


59613-59770 


330120 


6671864 


Minus 


127553-127656 


330123 


6671869 


Minus 


35311-35406 


328742 


5867611 


Minus 


95187-95248 


326605 


5867637 


Plus 


24656-24749 


326818 


6117831 


Minus 


15199-15309 


326720 


6552456 


Plus 


84525-84677 


326770 


6598307 


Minus 


513603^13668 


326692 


6682502 


Plus 


117697-117899 


326693 


6682502 


Minus 


335002-335095 


326983 


5867657 


Minus 


16023-16581 


326S91 


5867660 


Plus 


18147-18339 


326936 


6004446 


Minus 


10217-10357 


326964 


6469836 


Plus 


75340-75456 


327040 


6531965 


Plus 


783670-783817 


327053 


6531965 


Plus 


2247267-2247437 


327075 


6531965 


Plus 


4041318-4041431 


327085 


6531965 


Plus 


4734947-4735059 


327038 


6531965 


Pius 


319951-320040 


327130 


6531976 


Pius 


20247-22343 


327156 


5866841 


Minus 


2462-2620 


327288 


5867481 


Plus 


48583-48773 


327332 


5667516 


Minus 


56361-56532 


327220 


5867525 


Minus 


65701-65781 


327224 


5867534 


Pius 


188468-188544 


327321 


6249562 


Minus 


99745-99836 


327381 


6552412 


Minus 


61013-62130 


327396 


5867743 


Plus 


8702-8820 


327414 


5867750 


Plus 


102461-102586 


327442 


5867759 


Plus 


111483-111618 


327467 


5867772 


Pius 


88030-88151 


327473 


5867775 


Plus 


75101-75181 


327483 


5867783 


Plus 


181573-181662 


327377 


5867793 


Minus 


37610-37676 


327562 


5867804 


Minus 


343989-344474 


327568 


5867811 


Minus 


46152-46287 


327608 


6004463 


Plus 


200262-200495 


327611 


686786B 


Minus 


175063-175392 


327642 


5867891 


Minus 


2513-2743 


327654 


5867910 


Minus 


97564-97710 


327734 


5867940 


Minus 


31003-31583 
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327775 


5867964 


Minus 


130791-130871 


327796 


5867982 


Plus 


85267-85405 


327840 


6249578 


Minus 


73065-73206 


330208 


6013599 


Pius 


66517-66931 


330263 


6671884 


Minus 


101503-101634 


328004 


5867993 


Minus 


157407-157887 


328101 


5868020 


Plus 


289920-290014 


328100 


5868020 


Minus 


263545-263635 


328113 


5866024 


Mnus 


60378-60491 


328157 


5868064 


Plus 


73326-73615 


328196 


S868080 


Minus 


16551-16729 


328197 


5868081 


Minus 


42133-42438 


327940 


5868197 


Minus 


95240-95428 


327984 


5668216 


Plus 


66611-66677 


328021 


5902482 


Plus 


713478-714590 


328068 


6117819 


Plus 


253903-254022 


328264 


6381912 


Plus 


55086-55404 


330300 


2905862 


Minus 


3246-3302 


328608 


5868222 


Minus 


8777047953 


326600 


5868229 


Minus 


38889-40010 


328616 


5868239 


Plus 


293920-294224 


328623 


5868246 


Minus 


120020-120126 


328632 


5868247 


Plus 


76734-76853 


328666 


5868254 


Minus 


778-901 


328698 


5868264 


Minus 


625555425633 


328700 


5668264 


Plus 


764089-764203 


328708 


5868271 


Minus 


68114-68854 


328735 


6868289 


Plus 


89389-89455 


328743 


6868289 


Plus 


274638-274726 


328806 


5B68324 


Plus 


29408-29684 


328299 


5868366 


Minus 


149708-149889 


328342 


5868383 


Plus 


5995540094 


328365 


5668387 


Minus 


270724-270798 


328369 


5868388 


Plus 


75371-75583 


328381 


6868392 


Plus 


662758462848 


328451 


5868425 


Minus 


217275-217336 


328481 


5868449 


Minus 


8987-9180 


328500 


5668464 


Plus 


5909849481 


328530 


5868482 


Plus 


3349734335406 


328664 


6004473 


Plus 


1193739-1193866 


328861 


6381928 


Minus 


108317-108403 


328908 


5868493 


Pius 


117002-117059 


328933 


5868500 


Plus 


771755-771889 


328934 


5868500 


Plus 


846342-846448 


328949 


6456765 


Minus 


43552-43619 


330313 


6042030 


Minus 


33642-33775 


329005 


5868542 


Plus 


85470-65673 


330366 


2944106 


Plus 


151837-151914 


330372 


6580495 


Minus 


317461-317688 


329033 


5868561 


Minus 


5390-5479 


329037 


5868562 


Minus 


3246642562 


329067 


5868591 


Minus 


146417-147652 


329134 


5B83679 


Pius 


2995940018 


329157 


5868687 


Minus 


145940-146155 


329178 


5868704 


Plus 


179177-179463 


329192 


5868716 


Plus 


166936-167020 


329194 


5868716 


Minus 


304450404559 


329204 


5868720 


Minus 


3050-3190 


329224 


5868728 


Plus 


27422-27664 


329228 


5B68728 


Minus 


5011B40287 


329288 


5868771 


Plus 


25554-26299 


329337 


5868806 


Minus 


467155467222 


329011 


6682532 


Plus 


4865848741 
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TABLE 9A: Potential Therapeutic, Diagnostic and Prognostic targets for Therapy of Lung Cancer 

Table 9A shows about 1312 genes up-regu!ated in lung tumors (including squamous cell carcinomas, adenocarcinomas, small ceil carcinomas, granulomatous and carcinoid 
tumors) relative to normal body tissues. These genes were selected from about 59680 probesetB on the Eos/Afiymetrix Hu03 Genechip array. 

Table 9B show the accession numbers for those Pke/s lacking UntgenelDs for table 9A. For each probeset we have listed the gene cluster number from which the 
oHgonudeotides were designed. Gene clusters were compiled using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence 
similarity using Clustering and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for sequences comprising each cluster are listed In the 
■Accession" column. 

Table 9C show the genomic positioning for those Pkey's lacking Unlgene ID's and accession numbers In table 9A. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also listed. 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnlgenelD: Unlgene number 

Unlgene Title; Unlgene gene title 

R1 : Average of lung tumors (including squamous cell carcinomas, adenocarcinomas, small cell carcinomas, granulomatous and carcinoid tumors) divided by the 

average of normal lung samples 

Average of non-malignant lung disease samples (including bronchitis, emphysema, fibrosis, atelectasis, asthma) divided by the average of normal lung samples 



R2: 

Pkey 

400195 

400205 

400220 

400277 

400285 

400288 

400289 

400298 

400301 

400303 

400328 

400419 

400512 

400517 

400560 

400664 

400665 

400666 

400749 

400763 

401027 

401093 

401203 

401212 

401411 

401435 

401464 

401714 

401747 

401760 

401760 

4017B1 

4017B5 

401797 

401961 

401985 

401994 

402075 

402260 

402265 

402297 

402408 

402420 

402674 

402802 

402994 

403137 

403306 

403329 

403381 

403478 

403485 

403627 

403715 

404044 

404076 

404101 

404140 

404165 

404185 

404210 

404253 



ExAccn 



X06256 

X0782O 

AA032279 

X03635 

AA242758 

X87344 

AF084545 

AF242388 



UnlgenelD 



Hs. 149609 

Hs.2258 

Hs.61635 

Hs.1657 

Hs.79136 

Hs. 180062 



AF039241 



AF053004 



NMJW6825 



Unlgene Title ' 


R1 


R2 


NMJ)07057*:Homo sapiens ZW10 interactor 
NrVL006265*:Homo sapiens RAD21 (S. pombe) 


1.00 


1.00 


15.80 


396.00 


Eos Control 


2.26 


2.84 


Eos Control 


7.68 


9.72 


Eos Control 


1.00 


1.00 


integrin, alpha 5 (fibronectin receptor, 


1.04 


2.24 


matrix metalloproteinase 10 (stromslysin 


132.45 


4.00 


six transmembrane epithelial an Cg en of 


43.86 


74.00 


estrogen receptor 1 


1.00 


1.00 


UV-1 protein, estrogen regulated 


1J5 


1.65 


transporter 2, ATP-binrilng cassette, sub 


0.87 


1.80 


Target 


156.55 


253.00 


NM~030878*:Homo sapiens cytochrome P450, 


1.00 


2.00 


lengsin 


3.67 


87.00 


NMl030878":Homo sapiens cytochrome P450, 


1.00 


1.00 


NM_002425:Homo sapiens matrix metailopro 


20.26 


45.00 


NM^002425".Homo sapiens matrix metailopro 


1.36 


1.07 


NM w 002425;Homo sapiens matrix metailopro 


3.26 


3.22 


NM^003105*:Homo sapiens sortiiin-related 


1.00 


91.00 




7.63 


24.00 


Target Exon 


1.00 


1.00 


C12000586*:gt|6330167|dbj|BAA86477.1|(A 


1.00 


155.00 


Target Exon 


1.00 


86.00 


C12000457*:gi|7512178lpir||T30337 polypr 


1.00 


400.00 


ENSP00000247172*:HYPOTHETIGAL 126.2 kDa 


1.00 


72.00 


C14000397*:giJ7499898|pirffT33295 hypoth 


1.00 


64.00 


hlstone deacetylase 5 


3.82 


49.00 


ENSP00000241802*:CDNA FU 11007 FIS, CLON 


2.02 


40.00 


Homo sapiens keratin 17 (KRT17) 


12B.43 


68.00 


Target Exon 


1.74 


35.00 


NM_005557*:Homo sapiens keratin 16 (foca 


26.47 


10.50 


Target Exon 


10.33 


4.61 


NM_002275*:Homo sapiens keratin 15 (KRT1 


4.13 


2.70 


Target Exon 


1.44 


2.10 


NM_021626:Homo sapiens serine carboxypep 


1.41 


1.86 


class 1 cytokine receptor 


1.00 


177.00 


Target Exon 


61.84 


47.00 


ENSP00000251056*:Plasma membrane calcium 


1.00 


1.00 


NM_001436*:Homo sapiens fibriHarin (FBL 


1.58 


1.39 


Target Exon 


2.09 


35.00 


Target Exon 


1.00 


92.00 


NM_030920*:Homo sapiens hypothetical pro 


28.87 


13.00 


C1000873^gf|104324O0femb|CAC10290.1| (A 


1.00 


1.44 


Target Exon 


7,44 


243.00 


NM_001397:Homo sapiens endotheHn conver 


1.00 


70.00 


NMJ)02463*:Homo sapiens myxovirus {influ 


1.37 


1.43 


NM_005381 # :Homo sapiens nucleolln (NCL), 


1.00 


19.00 


transmembrane protein (63kD), endoplasmi 


1.00 


43.00 


Target Exon 


1.00 


61.00 


ENSP00000231844 # :Ecotroplc virus Integra 


1.00 


119.00 


NrvL022342:Homo sapiens kinesin protein 9 


28.13 


136.00 


C3001813*:g1|12737279|refpcp_012163.1| k 


20.23 


76.00 


Target Exon 


6.30 


29.33 


Target Exon 


1.30 


35.00 


ENSP00000237855*:DJ398G3.2 (NOVEL PROTEl 


1.00 


54.00 


NM_016020*:Homo sapiens CGI-75 protein ( 


14.29 


91.00 


C8000950:gil423560lplr|lA47318 RNA-trfndi 


1.00 


1.00 


NM_006510:Homo sapiens ret finger protel 


1.42 


1.44 


ENSP00000244562:NRH dehydrogenase [quino 


1.00 


54.00 


Target Exon 


1.00 


117.00 


NWL005936:Homo sapiens myeloloVlymphold 


5.93 


13.77 


NM_021058*:Homo sapiens H2B hlstone fami 


1.00 


1.00 
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10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



404877 
404927 
404996 
405449 
405568 
405572 
405646 
405676 
405770 
405932 
406137 
406360 
406399 
406467 
406621 
406642 
406663 
406671 
406673 
406676 
406678 



406687 
406690 
406698 
406815 
406851 
406964 
406967 
406974 
407103 
407128 
407137 
407168 
407239 
407242 
407244 
407269 
407300 
407366 
407378 
407430 
407453 
407577 
407634 
407710 
407720 
407746 
407756 
407758 
407782 
407788 
407790 
407811 
407839 
407944 
408000 
408031 
408063 
408070 
408101 
408122 
408212 
408243 
408349 
408353 
408354 
408369 
408380 
408482 
408522 
408538 
408545 
408572 
408633 
408660 
408761 
408771 



BE336714 



X57809 


Hs.181125 


AJ245210 




U24683 


Hs. 233441 


AA1 29547 


Hs.285754 


M34996 


Hs. 198253 


X58399 


Hs.81221 


U77534 




Ml 8728 




M31126 


Hs.272822 


M29540 


Hs.220529 


X03068 


Hs.73931 


AA833930 


Hs.288036 


AA609784 








M24349 








AA4248B1 


Hs 256301 


R83312 


Hs.237260 


T97307 




R45175 


Hs 117183 


AA076350 


Hs 67846 


m 18728 




M10Q14 


Hs.75431 






AA102616 


Hs 120769 


AF026942 


Hs. 271 530 


AA2992G4 


Hs 57776 


AF 169351 




AJ 132087 
rut i wiUOi 




AW131324 


Hs 246759 


AWQ16569 


Hs. 136414 


AW022727 


Hs 23616 


AB037776 


Hs 38002 


AK001962 




AA116021 


Hs.38260 


D50915 


Hs.38365 


AA608956 


Hs.1 12619 


BE514982 


Hs.38991 


A1027274 


Hs.288941 


AW190902 


Hs.40098 


AA045144 


Hs.161566 


R34008 


H3.239727 


L11690 


Hs.620 


AA081395 


Hs.42173 


BE086548 


Hs.42346 


AW148852 




AW968504 


Hs. 123073 


AI432652 


Hs.42B24 


AA297567 


Hs.43728 


Y00787 


Hs.624 


BE546947 


Hs.44276 


BE439838 


Hs.44298 


AI382803 


Hs.159235 


R38438 


Hs.182575 


AF123050 


Hs.44532 


NMJW0676 


Hs.45743 


AI541214 


Hs.46320 


AW381532 


Hs.135188 


AW235405 


Hs.253690 


AA055611 


Hs.226568 


AW953372 


Hs.46677 


AA525775 




AA057264 


Hs.238936 


AW732573 


Hs.47584 



C6Q01909:gl|704441|dbj]BAA18909.1| (D298 
C600123B*:gi|121715|sp|P26697|GTA3_CHICK 
Target Exon 

Nftl_021048:Homo sapiens melanoma antigen, 
NM_005596*:Homo sapiens nuclear factor I 
choJesteryl ester transfer protein, plas 
Target Exon 

NMJ)05365:Homo sapiens melanoma antigen, 
Target Exon 
Target Exon 

CY000047*:gil11427234|refIXP^009399.1| z 
NMJ)31413*:Homo sapiens cat eye syndrome 
Target Exon 

C12000200:gi|4557225|reflNP_000005.1| a) 
cytochrome c-1 

NM_002362:Homo sapiens melanoma antigen, 
C15000305.gi|3806122|gbJAAC69198.1) (AF0 
NM.000179*:Homo sapiens mutS (E. coli) h 
Target Exon 

NMJ)03122*:Homo sapiens serine protease 
Target Exon 

immunoglobulin lambda locus 
gb:Homo sapiens mRNA for immunoglobulin 
immunoglobulin heavy constant mu 
met proto-oncogene (hepatocyte growth fa 
major histocompatibility complex, class 
Human L2-9 transcript of prearranged Im, 
gb:Human clone 1A11 immunoglobulin varia 
gb:Human nonspecific crossreacting arttig 
pregnancy specific beta- 1 -glycoprotein 9 
carclnoembryonic antigen-related cell ad 
major histocompatibility complex, class 
tRNA Isopentenylpyrophosphate transferas 
major histocompatibility complex, class 
gb:Human alpha satellite and satellite 3 
gb: Human parathyroid hormone-like protel 
gb.Human parathyroid hormone-related pep 
hypothetical protein MGC13170 
EST 

gb:ye53h05.s1 Soares fetal liver spleen 
ESTs 

leukocyte immunogiobulin-like receptor, 
gb:Human nonspecific crossreacting antig 
fibrinogen, gamma polypeptide 
Homo sapiens cDNA FU121 49 fls, clone MA 
gb:zn43e07.s1 Stratagene He La cell s3 93 
gb:Homo sapiens cig33 mRNA, partial sequ 
ESTs, Moderately similar to 138022 hypot 
gb:Homo sapiens protein tyrosine phospha 
gb:Homo sapiens mRNA for axonemal dynein 
hypothetical protein MGC12538 
UDP-GlcNAcbetaGat beta-1,3-N-scetylgluc 
ESTs 

KIAA1355 protein 

hypothetical protein FU 1 1 100 

ubiquitin specific protease 18 

KIAA0125gene product 

ESTs. Moderately similar to PURKINJE CEL. 

S100 calcium-binding protein A2 

Homo sapiens cONA FU14866 fis. clone PL 

cysteine knot superfamily 1, BMP antagon 

ESTs 

desmocollin 2 

bullous pemphigoid antigen 1 (230/240kD) 
Homo sapiens cDNA FL) 10366 fis, clone NT 
caicineurin-bindlng protein calsarcln-1 
gb:xf05d05.x1 NCLCGAP_Bm35 Homo sapien 
CDC2-retated protein kinase 7 
hypothetical protein FU10718 



interleukin 8 
homeo box C10 

mitochondrial ribosoma) protein S17 
ESTs 

solute carrier family 15 (H77? transport 
diubiquitin 

adenosine A2b receptor 

Small pratine-rich protein SPRK (human, 

ESTs 

ESTs 

ESTs, Moderately sfmflar to ALU4_HUMAN A 
PRO2000 protein 

ESTs, Moderately similar to PC4259 ferri 
ESTs, Weakly similar to (defline not ava 
potassium voltage-gated channel, delayed 



29.71 


42.00 


1.30 


1.00 


1.00 


1.00 


1.00 


15.00 


1.G0 


60.00 


1.07 


1.38 


1.61 


2.01 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 


78.00 


0.76 


1.14 


1.01 


1.28 


1.13 


2.89 


45.52 


37.00 


1.99 


1.99 


2.77 


2,38 


1.00 


35.00 


1.00 


39.00 


1.00 


1.00 


1.41 


1.74 


2.16 


3.91 


2.07 


2.93 


15.00 


51.00 


0.98 


3.09 


1.30 


1.53 


1.33 


1.45 


1.46 


2.85 


8.61 


8.50 


226.37 


350.00 


1.01 


Z52 


20.25 


32.00 


0.75 


1.91 


38.15 


1114.00 


1.00 


1.00 


1.00 


1.00 


1.77 


1.10 


1.00 


1.00 


142.70 


135.00 


2.16 


18.00 


1.10 


1.57 


1.12 


2.85 


3.24 


15.38 


3.53 


3.68 


19.74 


73.00 


0.06 


8.25 


1.00 


26.00 


1.00 


25.00 


1.00 


75.00 


1.00 


1.00 


111.20 


228.00 


1.00 


28.00 


1.89 


1.31 


1.00 


1.00 


4.51 


5.00 


1.00 


28.00 


0.97 


1.14 


7.88 


3.83 


3.63 


42.00 


89.96 


109.00 


173.91 


108.00 


111.30 


70.00 


151.17 


8.00 


9.91 


93.00 


195.78 


231.00 


1.00 


1.00 


37.84 


61.00 


0.85 


1.71 


5.88 


7.91 


4.27 


9.98 


3.79 


3.46 


1.88 


1.65 


1.00 


73.00 


1.41 


16.50 


15.19 


37.22 


1.65 


1.19 


1.98 


1.24 


1.55 


1.50 


1.00 


1.00 


1.00 


44.00 


107.16 


56.00 


1.00 


1.00 


52.24 


141.00 


3.05 


109.00 
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Hs.47860 

Hs.48269 

Hs.256862 

Hs.182278 

Hs .250822 

Hs.71642 

Hs.344096 

Hs.49767 

Hs.50002 

Hs.50081 

Hs.190721 

Hs.50441 

Hs.112208 

Hs.50758 

Hs.50966 

Hs.271695 

Hs.27206 

Hs.1 87579 

Hs.22972 

Hs.54416 

Ks. 129056 

Hs.54451 

Hs.346735 

Hs.67688 



Hs.190721 
Hs.55279 
Hs.56023 
Hs.108681 
Hs.56145 
Hs.56265 



Hs.57101 
Hs.156351 
Hs.278025 
Hs.727 
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Hs.58169 
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Hs.15830 
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NPC1 (Niemann-Pick disease, type C1> gen 
neurotrophic tyrosine kinase, receptor, 
vaccinia related kinase 1 
ESTs 

calmodulin 2 (phosphorylase kinase, delt 
serine/threonine kinase 15 
guanine nucleotide binding protein (G pr 
glycoprotein (transmembrane) nmb 
NM_004553:Homo sapiens NADH dehydrogenas 
small inducible cytokine subfamily A (Cy 
Hypothetical protein, XPJ351860 (KIAA119 
ESTs 

CGI-04 protein 
XAGE-1 protein 

SMC4 (structural maintenance of chromoso 
carbamoyl-phosphate synthetase 1, mitocn 
ESTs, Weakly similar to 210926QA B cell 
ESTs 
ESTs 

hypothetical protein FU 13352 

sine oculls homeobox (Drosophita) homoto 

ESTs 

laminin, gamma 2 (nicein (100kO), kalin) 
splicing factor, arginlne/serine-rich 5 
ESTs 

NCK adaptor protein 1 

gb2m87b03.s1 Stratagene ovarian cancer 

ESTs 

serine (or cysteine} proteinase fnhfbfto 

brain-derived neurotrophic factor 

Homo sapiens brain tumor associated prot 

thymosin, beta, identified in neuroblast 
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cystatinSN 
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mlnlchromosome maintenance deficient (S. 
ESTs 
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hyaluronan synthase 1 
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KIAA0918 protein 

highly expressed in cancer, rich In leuc 

proline oxidase homolog 

ESTs 
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hypothetical protein FU 12691 
hypothetical protein FU 10514 
zinc finger protein 281 
hypoxia-inducible protein 2 
ESTs 

hypothetical protein FU13149 
ESTs 

carbonic anhydrase IX 

transmembrane protease, serine 4 

hypothetical protein KIAA1335 

hypothetical protein MGC14128 

a disintegrin and metailoproteinase doma 

Homo sapiens cDNA: FU22044 fis, clone H 
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ESTs 

leukocyte immunogfobuMke receptor, 
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MD-2 protein 
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NRAS-related gene 
KIAA1077 protein 
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microtubute-associated protein 1B 
v-myc avian myelocytomatosis viral oncog 
hypotheticat protein FU10901 
RAB6 interacting, Wnesin-like (rabkines 
macrophage mlgralion inhibitory factor ( 
ESTs, Weakly similar to 15521 4 salivary 
hypothetical protein FU13346 
nuclear transcription factor Y, alpha 
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20.00 


20.41 


16.00 


7.09 


22.00 


29.75 


19.00 


1.00 


1.00 


2.06 


1.81 


3.02 


131.00 


1.00 


131.00 


1.23 


1.02 


30.63 


171.00 


1.00 


1.00 


1.01 


1.45 


1.26 


1.11 


6.33 


5.04 


4.30 


7.82 


1.00 


1.00 


13.75 


103.00 


2oan 


1.00 


1.00 


1.00 


1.14 


1.10 


1.00 


48.00 


162.25 


189.00 


2.12 


1.80 


1.15 


1.94 


1.00 


1.00 


16.42 


84.00 


0.57 


1.78 


2.71 


4.15 


46.37 


47.00 


1.12 


1.35 


1.55 


2.51 


1.00 


55.00 


4.36 


3.18 


1.00 


1.32 


1.00 


22.00 


Z06 


1.88 


1.00 


1.00 


7.00 


9.85 


1.00 


3.00 


1.31 


2.01 


1.98 


1.71 


12.60 


63.00 


1.00 


1.00 


1.00 


237.00 
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4.58 



90.00 



TABLE 9B 

Pkey: Unique Eos probeset Identifier number 
CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey 
407746 



CAT Number 
10125J 



408070 


1036688J 


408660 


107294J 


409522 


11 3735 J 


409866 


1156522L.1 


410032 


1170435_1 


411089 


12317^.1 


411152 


1234028.1 


412537 


1304.1 



Accession 

AK001962 R69415 BE464605 AA418699 AA053293 AA149075 AA058396 AW338226 AW272659 AA454607 AI139535 AW469852 AI275461 
AW271982 AA730033 AA576507 AA991217 AA782067 AI985851 AA805864 AA505598 AW469857 R69546 AA988279 AW001647 N63320 
D82661 T27343 AA306950 AA360989 R58778 
AW148852BE350895 

AA525775 AA056342 AJ538978 AW975281 AA664986 

AA075382AA075431 

AW502152H41202 H29772 

BE065985 BE065944 BE066008 BE066083 6E066093 

AA456454 AA713730AA091294 AA584921 N86077 AW836781 AA601031 AA579876AA551108AA633188 AW905577AJ955808 AJ679386 
A1679895 AA514764 M454562 AI082382 AA595822 AA551 351 AA586369 AA66B384 AA1 88934 AA666398 AA551297 AA5651 88 
BE069199 AW936012 AW877466 AW819782 AW935798 AW835546 AW936042 BE069121 AW835625 AW877536 AW935885 BE069202 
AW820019 AW935937 BE160180 AW935946 BE069101 BE069125 AW877527 BE160316 BE160398 AW935794 AW835701 AW935784 
AL031 778 X59711 NM 002505 M59079 A1870439 Af494259AW664010 AA405063 AA436132 BE174516 AA412691 AI400314 AA436024 
T29403 BE079412 BE079428 N9Q322 A1631202 A1141758 A\016793 AH 67566 AI862075 AI375230 W208445 AW235763 AL0441 13 AA382556 
AW953918 AA927051 AA889823 BE003094 AW3901 55 AW360805 AW360823 AW360810 AA425472 A1694282 AL0441 14 A1684577 A1809865 
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5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 
412811 132943J 



413690 1383256.1 
414883 15024_1 



415989 156454J 
417324 166714.1 
418574 17690 1 



418712 
419443 
419502 



419936 
421582 



422128 
423034 
423616 
424200 
424999 
426966 
426991 
427260 
428023 



429220 
429978 



430439 
430935 
431089 
431322 
432407 



17841 25J 
184788J 
18535.1 



189181.1 
2041.1 



211994.1 

224122.1 

23234.1 

236595.1 

245835.1 

273896.1 

27415.1 

276598.1 



301384.1 
31150.1 



31808.1 
325772.1 
327825.1 
331 543 J 
34624.1 



434414 38585.1 



436608 42361.3 



PCT/US02/12476 

AM78773 AI160445 AI674630 N69088 AW665529 N49278 All 29239 AI457890 AI621264 AW297152 A1268215 AA907787 AI286170 AI017982 
AI963541 AI469807 AI969353 BE552356 N66509 AA736741 AA382555 AW07581 1 AW292026 

H06382 AW957730 AA352014 R13591 AA121201 D60420 BE263253 BE047862 Z41952 AI424991 AI693507 AIB63108 AA599060 A1091 148 
AA598689 R39887AA813tf2 AVW16452 H06383 R41807 AI364268M^ 
AA121202R17734 
BE157489 BE157560 

AA926960 AA926959 W76521 W24270 W21 526 AA0371 72 BE267636 H831 86 AA469909 N86396 AA001348 BE535736 AA081745 BE566245 
AA082436 H72525 H77575 N49786 W80565 H78746 BE569085 W04339 R981 27 T55938 BE279271 AW960304 T29812 AA476873 BE297387 
AA292753 AA177048 NM.001826 X54941 BE314366 AA908783 AI719075 BE270172 BE269819 AA889955 AI204630 W25243 AI935150 
AA872039 W72395 T99530 AI422691 H98460 N31428 BE255916 H03265 A1857576 AA776920 AA910644 AA459522 AA293140 AW514667 
R75953 AW662396 AA662522 AI865147 AI4231 53 AW262230 AA584410 AA583187 AW024595 AW069734 AI828996 AA282997 AA876046 
AW613002 AA527373 AW972459 AIB3 1360 AA621 337 AA100926 AA772418 AA594628 A1033892 W95096 A1034317 AA398727 AI085031 
N9521 0 AI459432 A1041437 AA932124 AA627684 AA935829 AI004827 AI423513 A1094597 H42079 R54703 A1630359 AA617681 AA978045 
AA643280 W44561 Al 991 988 AI537692 AI090262 AA740817 AI31 2104 Al 91 1 822 AA41 6871 AM 85409 AA129784 AA701 623 AI075239 
AI139549 AA633648 AI339996 AI336880 AA399239 AI078708 AI085351 AI362835 AI34661 8 AM 46955 AI989380 AI34B243 N92892 AA765850 
AI494230 AI278887 AA962598 AI492600 W80435 AA001979 R97424 AJ129015 N24127 AA157451 AA235549 AA459292 AA037114 AA129785 
AI494211 AW059601 AW886710 R92790 N59755 AI361128 AW589407 H47725 H97534 H48076 H48450T99631 AW300758 H03431 R76789 
AA954344 H77576 R96823 AI457100 N92845 N49682 H42038 BE220898 BE220715 H99552 AA701624 N74173 R54704 H79520 H72923 
H03266 BE261919 AA769633 M480310 AA507454 AA910586 A1203723 AW104725 W25611 W25071 T88980 K03513 T77589 R99156 
W95095 R97470 AA702275 T77551 AA91 1952 H82956 N83673 AA2S3872 

A1267700 A1720344 AA191424 AI023543 AI469633 AA172056 AW958465 AA172236 AW953397 AA355086 
AW265494AA455904AA195677AW265432AW991605AA456370 

N20754 N28747 AI568146 AI979339 AA322671 AA322672 AW955043 AI990326 AA776406 AI01 6250 AA843678 AW451882 N23137 N231 29 
W70051 AI038748 AA831327 AI925845 AW945895 
Z42183 T31621T97478 
D82703AA242966 D79798 

AU076704 T74854 T74860 T72098 T73265 T73873 T691 80 T74658 T5B786 T60385 T7341 0 T68781 T67845 T67593 T73952 T67864 T60630 
T68367 T68401 T53959 T72360 T72099 T60377 T58961 T71712 T7282I T64738 T74645 T72037 T68688 T72063 T73258 T72826 T64242 
T68220 T74673 T71BQD T68355 T61227 T62738 T69317 T53850 T64692 T73768 T73962T73382 T6B914 T70975 T73400 T60631 T73277 
T73203 T70498 T61409 T58925 NM.000508 M64982 T68301 T73729 T69445 T60424 T67922 T67736 T68716 T67755 T74765 T73819 T58719 
T74756 T60477 T74863 T61109 T68329 T58850 T71857 T73425 T53736 T68607 T58898 T64309 T72031 T72079 T64305 T71908T68107 
T71916 T73787 T56035 T64425T71870 T60476 T61376 T67820 T71895 T41006 T69441 T68170 T74617 T71958 T69440 T61875 R06796 
H48353 T71914 T53939 T64121 AA693996 T72525 T67779 T68078 AA01 1465 AA345378 AV654847 AV654272 AV656001 AI064740 T82897 
N33594 AA344542 AW805054 AI207457 T61743 AA026737 K94389 AA382695 AA918409 T68044 S82092 T39959 AI017721 AA312395 
AA312919 T40156 H66239 AV652989 H38728 R98521 AV655200 R95790 W03250 W00913 AA344136 AV660126 R97923 AA34359B 
AW470774 AV651266 N54417 AA812862 AW182929 AH 1 1 192 H61463 H72060 AA344503 H38639 A127751 1 AV661 108 AI207625 T47810 
AA235252 T27853 T47778 R95746 H70620 AA701463 AW827166 R98475 C20925 AV657287 T71959 T71313 T73920 T73333 T61618 T69293 
T69283 T73931 T721 78 T72456 AV645639 AV653476 T72957 T72300 T58906 T71 457 T70494 T72958 T70495 T68267 T74407 T85778 
AA344726 T27854T74485T74101 T73868T71518T72304 AA343853 T73909 T68070 T72065 H72149T73493T73495 AV645993 R02293 
T70475 T64761 AA344441 AA343657 AA345732 M344328 AI110639 AA344603 AF06351 3 T64696 T68516 T72223 T60507 T67633 R29500 
T72517 R02292 T60599 T69206 T70452 T74677 R29366 T61 277 T74914 T60352 R29675 T74843 AV645792 AA344408 T691 97 T72057 
T69368 T69358 T6825B AV650429 T73341 T61 702 T74598 T40095 K02272 T401 06 AA343045 AA341908 AA341 907 AA342807 AA341 964 
T53747 T72042 T62764 AI064899 AA343060 T67832 T72440 T71770 T68091 T69108 T72449 T691 67 T71289 T68251 AV654844 T64375 
AA345234 T67598 AA01 141 4 T6B036 H48262 A1207557 T68219 W86031 T69081 T64232 R93196 T62136 AV650539 H67459 T72978 
AA344583 T60362 H58121 T9571 1 T72803 T68055 T7171 5 R29038 T72793 T69122 T64595 T62888 T69139 T68291 T64652 T67971 T46862 
AA693592 AI248502 R29454 T64764 T57001 773052 T71429 T51 1 76 T58866 AV655414 H90426 AA342489 T73666 T67848 T72512 T53835 
T67837 T7331 7 T74273 T69420 T68245 T74380 T67862 T74474 T56068 
AI792788 BE142230 AA252019 

A191 0275 X00474 X52003 X05030 NM.003225 AA314326 AA308400 AA5067B7 AA314825 AI571 948 AA507595 AA61 4579 AA58761 3 R83B18 
AA568312 AA614409 AA307578 AI925552 AW950J55 AI910083 M 1 2075 BE074052 AW004668 AA578674 AA582084 BE074O53 BE074126 
BE0741 40 AA514776 AA588034 BE074051 BE074068 AW009769 AW050590 M858276 R553B9 AI001 051 AW050700 AW75021 6 AA61 4539 
BE074045 AI307407 AW602303 BE073575 AI202532 AA524242 AI970839 AI909751 BE076078 AI909749 R55292 
AW881145 AA490718 M85637AA304575 T06067 AA331991 
AL1 1 9930 AA320696 AW752565 

AL031985 AL137241 AI792386 AI733664 AI857654 AI049911 
AA337221 AA336756AW966196 
AW953120 R56325 AA349562 

A14931 34 AI498691 AW771 508 AI498457 AI768408 AI783624 AI3839B5 AI5B0267 D7981 3 AA393768 
AK001536 AA191092 AW510354 AI554256 AL353968 AA134266 
AA653848 AA400100 AA401424 

AL038843 AA161338 BE26821 3 AA425597 N87306 AA092969 BE566038 AA247451 N47392 AI928802 AW1 82584 AW027872 AI81 983 1 
AI936994 W56258 AI653448 AI27861 1 AI283557 A1824306 AW33B658 AW150899 AA68751 4 N47393 N29B85 AA973469 AIO38904 A1292064 
A1034339 AW674593 N72156 AI079733 AI038683 AI291 616 AA491 599 AA993675 AA837380 BE006554 BE006473 A10B7090 T33044 
AA652043 A1203503 AA583959 W35283 AI129926 Z41844 AW020925 AW575848 AI684603 AA493297 AI140689 AI277175 AA425444 
AI932767 W02632 BE396786 R37261 
AW207206 AW341473 AA448195 AI951341 

AA249027 AL038984 AK001 993 AL080066 AV652725 BE566226 AA345557 AA315222 AA090585 AA375688 AA301 092 AA298454 W05762 
AW607939 H51 658 083880 N84323 BE296821 AW947007 061461 AW079261 AA329482 AW901 780 A1354442 AA772275 R31 663 A1354441 
AI767525 H92431 AI916735 H93575 AI394255 AW014741 AI573090 C06195AW612857 AW265195 AI339558AI377532AJ308821 AI919424 
AI589705 AW055215 AI336532 AI338051 AA806547 C75509 C00618 AW071 172 AW769904 AA630381 AI67801 8 Al 863985 D79662 BE221 049 
AW265018 A1589700 AW196655 N76573 A13709Q8 BE042393 N75017 A1698870 AW9601 15 
AL133561 AL041090 AL117481 AL1 22069 AW439292 AI968826 

AW072916A1184913AA489195AW466994AW469044 N59350 AI819642 AI280239 AI220572 AA789302 AI473611 AW841126 D60937 

BE041395AA491826AA621946AA715980AA666102 

AW970622 AA503009 AA502998 AA502989 AA502805 T921B8 

AA221036 R87170 BE537068 BE544757 C1B935 AW812058 T92565 AA227415 AA233942 AA223237 AA668403 AA601627 AW869639 
BE061833 BE000620 AW961 170 AW847519 AA308542 AW821633 AW945688 C04699 AA205504 AA377241 AW821 667 AA055720 
AW81 7981 AW856468 AA155719 M1 79928 T03007 AW754298 AA227407 AA1 1 3928 AA307904 C1 6859 

AI798376 S46400AW81 1617 AW81 1616 W00557 BE142245 AW858232 AW861851 AWB58362AA232351 AA218567 AA055556 AW85B231 
AW857541 AW814172 H66214 AW814398 AF1 341 64 AA243093 AA1 73345 AA199942 AA2233B4 AA227092 AA227080 T1 2379 AA0921 74 
T61139M149776 M699829AW879188AW813567AW813538Al267168AA157718AA15ni9AA100472AA100774AA130756AA157705 
AA157730M157715AA053524AWB49581 AW854568 C05254 AW882836 T92637 AW812621 AA206583 AA209204 BE1 56909 AA226824 
AI829309 AW991957 N66951 AA527374 H66215 AA045564 AI694265 H60808 AA149726 AW195620 BE081333 BE073424 AW817662 
AW817705 AW817703 AW817659 BE0B1531 H59570 
AA628980 AI126603 BE504035 
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439000 
439285 

433760 
441128 

443068 
443947 
447636 



449305 
451105 
451320' 



451807 . 
452410 



454241 
455175 
456237 



467716.1 
47065.1 

47673.1 
51021.2 

558874.1 
5861 60_1 
7301J 



448993 79225.1 



804424.1 
859083.1 
86576.1 



8865.1 
9163.1 



1067807.1 
1257335.1 
168730.1 
47395.1 
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AW373062T55662AI299190 BE174210AW579001 H01811 W40186R67100AI923886AW952164AA628440AW898607AW898616 
AA709126 AW898628 AW898544 M947932 AW898625 AW898622 AI276125 AI185720 AW510698 AA987230 T62522 BE467708 AW243400 
AW043642 A!268245 AI186932 D52654 D55017 D52715 D52477 D53933 D54679 AI298739 AI14B984 A1922204 N98343 BE174213 AA845571 
AI813854 AI21451 8 AI635262 AH 39455 Ar707807 AI698085 AW884528 At024768 AI004723 AW087420 AI565133 N94964 A(268939 
AW513280 AI061126 AI435818 AI859106 A1360506 AI024767 AA513019 AA757598 X56196 AA902959 AI334784 A1860794 AA010207 
AW890091 AW513771 AI951391 A1337671 T52499 AA890205 AI640908 H75966AA463487AA358688 AI961767 AIB66295AA780994 
AJ98591 3 BE1741 96 AA029094 AW5921 59 T55581 N79072 A161 1201 AA91 081 2 AI220713 AW149306 A175841 2 M04571 3 R79750 N76096 
AW979121 AA847986 AA829098 

AL1 3391 6 N791 1 3 AF0861 01 N76721 AW950B28AA364013AW955684AI346341 A1B67454 N54784 AI655270 AI421 279 AW01 4882 
AA775552 N62351 N59253 M626243 AI341407 BE175639 AA456968 AI358918 AA457077 
AL109688 R23665 R26578 

AA570256AW014761 AA573721 A1473237 AI022165AA554071 AA1 27551 N90525 AW973623 AA447991 AA243852 BE328850 A1148171 
A1359627AI005068AI3S6567AA232991 AW016855AA906902AA233101 AA1 27550 BE512923 
AI188710AI032142AW078833 N30308 AW675632 AI21 9028 A! 341 201 N22181 H95390 
W24187W24194R17789 

Y10043 NM_005342 L05085 ALD34450 BE61 4226 AW749053 AA3791 73 AA248230 BE514634 AA334622 R70656 AA367593 AA214649 
AA369318 AW957081 R05760 AA039903 AI886597 AW630122 AA906264 AA041527 R01145 A1088688 BE463637 AA398795 AI354883 
AI768938 AI569996 AI452952 AI168582 AI1 89869 AI086670 AW262560 AW61 3854 AA862839 AA435840 AA670197 AI024032 AI990659 
AI990089 NB1095 AA847919 AW960150 AA21 1075 AA044704 M367594 AW582587 AW85B854 AW818630 AW818281 AW818433 AW582595 
AA096002N83992 

AI471630 BE540837 BE265481 AW407710 BE513882 BE546739 AA053597 BE140503 BE218514 AW956702 AI656234 AI6362B3 A1567265 

AW340858 BE207794 AAD53085 R69173 AA292343 AA454908 AA293504 AI659741 AI927478 AA399460 AI760441 AA345416 BE047245 

AA730380 AA394063 AA454833 AI982791 A1567270 AI81 3332 A1767858 AA427705 D20284 AI221 458 BE048537 AI263048 M346417 

AA911497BE537702 

AI638293AW813561 

A1761324 AW880941 AW880937 

AW1 18072 AI631982 T15734 AA224195 AI701458 W20198 F26326 AA890570 N90552 AW071907 AI671352 AI375892 T03517 R88265 
AI124088 AA224388 AI08431 6 A13S4686 T33652 A1140719 A172021 1 T03490 A1372637 T15415 AW205836 AA630384 T03515 T33230 
AA017131 AA443303 T33623 AI222556 T33511 T33785AI419606 D55612 

W52854AL1 17600 BE208116 BE208432 BE206239 BE082291 AW953423 AA351619BE180648 BE1 40560 W60080 AA865478 N90291 
AW450652 AW44951 9 AA993634 AI806539 AA351618 AW449522 A1827626 AA904788 AA380381 AA886045 AA774409 BE003229 Z41756 
AL133619 AA4681 18 AA383064 A1476447 T09430 AI673758 AA524895 AI581345 AI300820 AW498812 AA256162 AI559724 A1685732 
AA602400AA905453AI204595AW166541 AA1 57456 AA1 56269 AA383652 AA431 072 AW592707 A143541 0 AW272464 AI2 1 5594 AA622747 
R74039 N35031 AIB04128AW513621 AA868351 A1026826 A1493388 AA614641 W81604AI557080 AI214351 AA730140 AI125754 AI200813 
AI269603 AI565082 AI807095 AI476629 AA505909 AI368449 AI686077 AI582930 AW085038 AA757863 AA7301 54 AI767072 AA46B31 6 
AI734130A1734138AA426284AA433997AI741241AW043563AI732741 AI732734 AA437369 AA425820 AA664048 R74130 
BE144666 BE1 84942 AW238414 BE164946 
AW993247AW861464 
AA2036B2R11958 

BE550224 AA832519 N45402 AW885857 N29245 BE465409 W07677 AW970089 AI299731 AA482971 BE503548 H18151 W79223 AF086393 
AA461301 W74510 R34182 AI090689 N46003 BE071550 R28075 AW1 34982 AJ240204 AI138906 AW026179 AI572316 BE466182 AI205395 
A1276154AI273269A1422817A1371014AI421274AI188525AA939164BE549810AW137865AI694996BE503841AA459718BE327407 
BE467534 BE218421 BE467767 AA989054 BE467063 AI797130 BE327781 



TABLE 9C 



Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et aL" refers to the publication entitled "The DNA 

sequence of human chromosome 22." Dunham 1. et aL, Nature (1999) 402489-495. 
Strand: Indicates DNA strand from which exons were predicted. 
NLposilion: Indicates nucleotide positions of predicted exons. 



Pkey 


Ref 


Strand 


NLposition 


400512 


9796593 


Minus 


1439-1616 


400517 


9796686 


Minus 


49996-50346 


400560 


9843598 


Plus 


94182-94323,97056-97243,101095-101236,102824-103005 


400664 


8118496 


Plus 


13556-13721,13942-14090,14554-14679 


400665 


8118496 


Plus 


16879-17023 


400666 


8118496 


Plus 


17982-18115,20297-20456 


400749 


7331445 


Minus 


9162-9293 


400763 


8131616 


Minus 


35537-35784 


401027 


7230983 


Minus 


70407-70554,71060-71160 


401093 


8516137 


Minus 


22335-23166 


401203 


9743387 


Mnus 


172961-173056,173868-173928 


401212 


9858408 


Plus 


87839-88028 


401411 


7799787 


Mnus 


144144-144329 


401435 


8217934 


Minus 


54508-55233 


401464 


6682291 


Minus 


170688-170834 


401714 


6715702 


Plus 


96484-96681 


401747 


9789672 


Minus 


118596-118816,119119-119244,119609-119761,120422-120990,130161-130381,130468-130593,1310 








131932,132451-132575,133580-134011 


401760 


9929699 


Plus 


83126-83250,85320-85540,94719-95287 


401780 


7249190 


Minus 


28397-28617,28920-29045,29135-29296,29411-29567,29705-29787,30224-30573 


401781 


7249190 


Minus 


83215-83435,83531-83656,8374f>63901 i 84237-84393,84955-85037,86290-868t4 


401785 


7249190 


Minus 


165776-165996,166189-166314,166408.166569.167112-167268,167387-167469,168634-166942 


401797 


6730720 


Pius 


6973-7118 


401961 


4581193 


Minus 


124054-124209 


401985 


2580474 


Plus 


61542-61750 


401994 


4153858 


Mnus 


42904-43124,43211-43336,44607-44763,45199-45281,46337-46732 


402075 


8117407 


Plus 


121907-122035,122804-122921,124019.124161,124455-124610,125672-126076 


402260 


3399665 


Minus 


1 13765-1 13910,1 15653-1 16765,116808-1 16940 


402265 


3287673 


Plus 


21059-21168 


402297 


6598824 


Plus 


35279-35405.35573-35659 


402408 


9796239 


Minus 


110326-110491 



169 



WO 02/086443 



PCI7US02/12476 



402420 
402874 
402802 
402994 
403137 
403306 
403329 
403381 
403478 
403485 
403627 
403715 
404044 
404076 
404101 
404140 
404165 
404185 
404210 
404253 
404287 
404298 
404347 
404440 
404721 
404794 
404854 
404877 
404927 
404996 
405449 
405566 
405572 
405646 
405676 
405770 
405932 
406137 
406360 



9798339 
8077108 
3287156 
2996643 
9211494 
8099945 
8516120 
9438267 
9958258 
9966528 
8569879 



9931752 
8076925 
9843520 



4572584 
5006246 
9367202 
2326514 
9944263 
9838195 
7528051 
9856648 
4826439 
7143420 
1519284 
7342002 
6007890 
7622497 
6006906 
3800891 
4914350 
4557087 
2735037 
7767812 
9166422 
9256107 



406467 9795551 



Plus 

Minus 

Minus 

Minus 

Minus 

Plus 

Plus 

Minus 

Plus 

Pius 

Minus 

Plus 

Minus 

Minus 

Minus 

Pius 

Minus 

Minus 

Pius 

Minus 

Plus 

Minus 

Pius 

Plus 

Minus 

Plus 

Pius 

Pius 

Pius 

Phis 

Pius 

Ptus 

Plus 

Phis 

Plus 

Plus 

Minus 

Minus 

Minus 

Minus 

Plus 



129750-129919 
39290-39502 
53242-53432 
4727-4969 

92349-92572,92958-93084,93579.93712,93949^94072,94591-94748.95214-95337 
127100-127251 



26009-26178 
116458-116564 

2888-3001,3198-353*3655-4117 

23868-24342 

851 28-85292 

225757-225939 

384*3967 

125742-125997 

37701-38147 

69025-69128 

12917M29327 

169926-170121 

55675-56055 

53134-53281 

73591-73723 

74493-74829 

80430-81581 

173763-174294 

101619-101898 

14260-14537 

1095-2107 

68690-69563 

37999-38145,38652-38998,39727-39872,40557-40674,42351-42450 

4223642570 

35912-36065 

85230-85938 

741-969 

73195-73917 

61057-62075 

123525-123713 

30487-31058 

7513-7673 

63448-63554 

182212-182958 



TABl£ 10A: Potential Therapeutic. Diagnostic and Prognostic targets for Therapy of Lung Cancer and Non-maBgnant Lung Disease 

Table 2A shows about 307 genes up-regulated In non-mailgnant lung disease relative to lung tumors and normal body tissues and/or down-regulated in lung tumors relative to 
normal lung and non-maflgnant lung disease. These genes were selected from about 59680 probesets on the Eos/AnVmetrix Hu03 Genechtp array. 

Table 108 show the accession numbers for those Pke/s lacking UnfgeneiD's for table 1QA. For each probeset we have fisted the gene cluster number from which the 
oligonucleotides were designed. Gene clusters were compiled using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence 
similarity using Clustering and Alignment Tools (DoubteTwist, Oakland California). The Genbank accession numbers for sequences comprising each cluster are listed in the 
'Accession* column. 

Table 10C show the genomic positioning for those Pkey's tacking Unigene ID'S and accession numbers In table 1QA. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also listed. 



Pkey: ( Unique Eos probeset identifier number 

ExAcca' Exemplar Accession number, Genbank accession number 

UnlgenelD: Unigene number 

Unigene Title: Unigene gene 



R1: 
R2: 



Average of lung tumors (Including squamous cell carcinomas, adenocarcinomas, small cell carcinomas, granulomatous and carcinoid tumors) divided by the 
average of norma) lung samples 

Average of non-malignant lung disease samples Onctuding bronchitis, emphysema, fibrosis, atelectasis, asthma) divided by the average of normal lung samples 

R2 
3.10 
159.00 
422.00 
2.33 
123.00 
230.00 
128.00 
149.00 
109.00 
13.14 
227 
173.00 
11.90 
2.09 
4.55 
297 
8.97 
1.65 
264 
145.00 



Pkey 


ExAccn 


UnigenelD 


Unigene Title 


R1 


404394 




ENSP00000241075:TRRAP PROTEIN. 


0.79 


404916 






Target Exon 


1.00 


405257 






Target Exon 


1.00 


407226 


M25079 


Hs.155376 


hemoglobin, beta 


0.47 


407568 


AA740964 


Hs.62699 


ESTs 


1.00 


408562 


A/436323 


Hs,31141 


Homo sapiens mRNA for KfAAt 568 protein, 


1.00 


409031 


AA376836 


Hs.76728 


ESTs 


1.00 


410434 


AF051152 


Hs.63668 


toiWike receptor 2 


39.65 


410467 


AF102546 


Hs.63931 


dachshund (Drosophila) homotog 


1.00 


410808 


T40326 


Hs.167793 


ESTs 


1.14 


412351 


AL135960 


HS.7382B 


T-ceil acute lymphocytic leukemia 1 


0.37 


412372 


RB5998 


Hs.285243 


hypothetical protein FU 22029 


1.00 


413795 


AL040178 


Hs.142003 


ESTs 


0.10 


414154 


AW205314 


Hs.323060 


ESTs 


0.62 


414214 


049958 


Hs.75819 


glycoprotein M6A 


0.03 


414998 


NM.002543 


Hs.77729 


oxidised low density lipoprotein (lectin 


0.64 


415122 


D60708 


Hs.22245 


ESTs 


0.07 


415765 


NWL005424 


Hs.78824 


tyrosine kinase with Immunoglobulin and 


0.67 


415775 


H00747 


Hs.29792 


ESTs, Weakly similar to 138022 hypotheti 


0.29 


415910 


U20350 


Hs.78913 


chemokine (C-X3-Q receptor 1 


1.00 
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416319 


AI815601 


Hs.79197 


416402 


NM_000715 


Hs.1 01 2 


417355 


D13168 


Hs.82002 


417421 


AL138201 


Hs.82120 


417511 


AL049176 


Hs.82223 


418489 


U76421 


Hs.85302 


418726 


BE241812 


Hs.87860 


418741 


H83265 


Hs.8881 


416883 


BE387036 


Hs.1211 


419086 


NMJJ0Q216 


Hs.89591 


419150 


T29618 


Hs.89640 


419235 


AW470411 


Hs.288433 


419407 


AW410377 


Hs.41502 


420556 


AA278300 


Ha.1 24292 


420656 


AA279098 


Hs.187636 


420729 


AW964897 


Hs.290825 


421177 


AW070211 


Hs.102415 


422060 


R20893 


Hs.325823 


422426 


W79117 


Hs.58559 


422652 


AW967969 


Hs.11 8958 


423099 


NM-002837 


Hs. 123641 


424433 


H04607 


Hs.9218 


424585 


AA464840 


Hs. 131987 


424711 


NM.005795 


Hs.152175 


424973 


X92521 


Hs.154057 


425023 


AW956889 


Hs. 154210 


425664 


AJ006276 


Hs.159003 


425998 


AU076629 


Hs. 165950 


426657 


NlvL.015865 


Hs. 171731 


426753 


T89832 


Hs. 170278 


427558 


D49493 


Hs.2171 


427983 


M17706 


Hs.2233 


42B467 


AK002121 


Hs.184465 


428927 


AA441837 


Hs.90250 


429496 


AA453800 


Hs.192793 


430468 


NM-004673 


Hs.241519 


431385 


BE178536 


Hs.11090 


431728 


NNL007351 


Hs.268107 


431848 


A1378857 


Hs.126758 


432128 


AA1 27221 


Hs.1 17037 


432519 


AI221311 


Hs.130704 


433043 


W57554 


Hs.125019 


433803 


A1823593 


Hs.27688 


434730 


AA644669 


Hs.193042 


435472 


AW972330 


Hs.283022 


436532 


AA721522 




437119 


AI379921 


Hs.177043 


437140 


AA312799 


Hs.283689 


437211 


AA382207 


Hs.5509 


437960 


A1669586 


Hs.222194 


438202 


AW1 69287 


Hs.22588 


438873 


AI302471 


Hs.1 24292 


438875 


AA827640 


Hs.189059 


441048 


AA913488 


Hs.192102 


441188 


AW292830 


Hs.255609 


441499 


AW298235 


H3.101689 


444513 


AL120214 


Hs.7117 


444527 


NM.005408 


Hs.1 1383 


444561 


NM.004469 


Hs.1 1392 


445279 


R41900 


Hs.22245 


446017 


N98238 


Hs.55185 


446984 


AB020722 


Hs.16714 


446998 


N99013 


Hs.16762 


447357 


AI375922 


Hs.159367 


448106 


AI800470 


Hs.171941 


448253 


H25899 


Hs.201591 


449275 


AW45Q848 


Hs.205457 


450400 


AI694722 


Hs.279744 


450696 


AI654223 


Hs.16026 


450726 


AW204600 


Hs.250505 


451497 


H83294 


Hs.284122 


451533 


NMJ104657 


Hs.26530 


453636 


R67837 


Hs.1 69872 


458332 


AI000341 


Hs.220491 


459580 


AA022888 


Hs.176065 


400269 
403421 






407570 


Z19002 


Hs.37096 


412295 


AW088826 


Hs.117176 


414517 


M24461 


Hs.78305 


417204 


N81037 


Hs.1074 


418307 


U70667 


Hs.83974 


418935 


T28499 


Hs.89485 


421502 


AF 111856 


Hs.105039 


421798 


N74880 


Hs.29877 



CD83 antigen (activated B lymphocytes, i 

complement component 4-blndIng protein, 

endothelin receptor type B 

nuclear receptor subfamily 4, group A, m 

chordin-like 

adenosine deaminase, RNA-speclflc, B1 (h 

protein tyrosine phosphatase, non-fecepl 

ESTs. Weakly similar to S41044 chromosom 

acid phosphatase 5, tartrate resistant 

Kallmann syndrome 1 sequence 

TEK tyrosine kinase, endothelial (venous 

rteurotrimln 

hypothetical protein FU21276 

Homo sapiens cDNA; FU23123 fis, done L 

ESTs 

ESTs 

Homo sapiens mRNA; cDNA DKFZp586N0121 (f 
ESTs, Moderately similar to ALU5JWMAN A 
ESTs 

syntaxin 1 1 

protein tyrosine phosphatase, receptor t 

ESTs 

ESTs 

calcitonin receptor-tike 
matrix metalloproteinase 19 
endothelial differentiation, sphingolipi 
transient receptor potential channel 6 
fibroblast growth factor receptor 4 
solute carrier family 14 (urea transport 
ESTs 

growth differentiation factor 10 
colony stimulating factor 3 (granulocyte 
hypothetical protein FU11259 
ESTs 
ESTs 

angiopoieMke 1 

membrane-spanning domains, subfamily A 
multimsrin 

ESTs, Highly similar to AF1 75283 1 zinc 
ESTs 

ESTs, Weakly similar to BCHUIA S-100 pro 
lymphoid nuclear protein (LAF-4) mRNA 
ESTs 
ESTs 

triggering receptor expressed on myeloid 
gb:nv54h12.r1 NCI_CGAP_Ew1 Homo sapiens 
ESTs 

activator of CREM In testis 
eco tropic viral Integration site 2B 
ESTs 
ESTs 

Homo sapiens cDNA: FU23123 fis, clone L 

ESTs 

ESTs 

ESTs 

ESTs 

glutamate receptor, lonotroplc, AMPA 1 
small inducible cytokine subfamily A (Cy 
c-fos induced growth factor (vascular en 
ESTs 
ESTs 

Rho guanine exchange factor (GEF) 15 

Homo sapiens mRNA; cDNA DKFZp564B2062 (f 

ESTs 

ESTs 

ESTs 



ESTs 

hypothetical protein FU23191 

retinolc add receptor, alpha 

Wnt inhibitory factor-1 

serum deprivation response (phosphatidyl 

ESTs 

ESTs 

ESTs 

Eos Control 

NMJ)16369':Homo sapiens claudin 18 (CLDM 
zinc finger protein 145 (Kruppel-fike, e 
poly(A)-bindlng protein, nuclear 1 
surfactant pulmonary-associated protein 
surfactant pulmonary-associated protein 
solute carrier family 21 (prostaglandin 
carbonic an hydras e IV 
solute carrier family 34 (sodium phospha 
N-acyisphlngosine amidohydrolase (acid c 



15.32 

0.64 

0.01 

38.30 

1.00 

0.02 

1.00 

0.44 

0.96 

0.62 

0.03 

1.48 

37.55 

0.80 

1.65 

2.99 

0.46 

1.00 

0.03 

0.14 

0.01 

0.75 

1.00 

0.43 

0.37 

0.14 

1.00 

0.68 

0.03 

1.00 

1.00 

0.75 

0.76 

0.01 

1.00 

1.00 

1.00 

1.00 

0.34 

0.00 

0.01 

1.00 

1.00 

1.05 

0.83 

1.00 

1.00 

0.67 

1.00 

1.00 

1.00 

0.71 

23.32 

0.77 

3.43 

1.00 

1.00 

46.47 

0.01 

0.60 

0.18 

0.10 

0.01 

0.46 

18.05 

1.00 

0,56 

0.88 

0.52 

0.79 

0.35 

0.13 

1.00 

1.00 

0.20 

0.40 

0.53 

0.01 

0.56 

0.64 

0.33 

0.53 

0.20 

0.78 
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4.00 
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357.00 

179.00 
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113.00 

1.90 

Z04 

Z74 

6.90 

5.13 

336.00 

3.65 

8.07 

25.82 

1.95 

156.00 

7.44 

3.62 

3.16 

141.75 

167.00 

3.01 

19.46 

3.35 

94.00 

1.42 

3.74 

141.00 

117.00 

2.20 

2.25 

3.62 

138.00 

132.00 

167.00 

157.00 

Z2A 

1.15 

2.06 

267.00 

105.00 

3.15 

1.94 

218.00 

133.00 

122.67 

142.00 

147.00 

141.00 

3.66 

370.00 

8.50 

16.36 

167.00 

151.00 

153.00 

3.08 

141.00 

239 

2.16 

Z53 

2.64 

296.00 

141.00 

1.38 

4.33 

208 

2.01 

2.03 

2.25 

116.00 

192.00 

2.98 

240 

1.77 

3.18 

1.74 

1.50 

1.16 

1.55 

1.28 

1.90 

1.54 
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423354 


AB011130 


Hs.127436 


423738 


AB002134 


Hs.132195 


425211 


M18667 


Hs.1867 


425438 


T62216 


Hs.270840 


426828 


NM.000020 


Hs.172670 


427019 


AA001732 


Hs.173233 


428043 


T92248 


Hs.2240 


430280 


AA361258 


Hs.237868 


431433 


X65018 


Hs.253495 


431723 


AW058350 


Hs.16762 


432985 


T92363 


Hs.178703 


441835 


AB036432 


Hs.184 


442275 


AW449467 


Hs.54795 


443709 


AI082692 


Hs.134662 


444325 


AW152618 


Ks.16757 


450954 


A1904740 


Hs^5691 


451558 


NMJ3O1089 


Hs.26630 


453310 


X70697 


Hs.553 


456855 


AF035528 


Hs.153863 


444342 


NM_014398 


Hs.10887 
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408714 


AI219304 


Hs.266959 


406753 


AA505665 


Hs.217493 


406973 


M34996 


Hs.198253 


407248 


U82275 


Hs.94498 


407510 


U 96191 




407731 


NM_000068 


Hs.38069 


407830 
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Hs.587 


408045 


AW138959 


Hs.245123 
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R20723 
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AW025430 


H3.155591 
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Hs.15780 
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Hs.73853 
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AA290712 
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Hs.20631 
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AA131466 


Hs.23767 
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AI129238 
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Hs.32241 
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Hs.72116 
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Hs.38163 
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Hs.283091 
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Hs.34950 
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Hs.949 
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Hs.111030 
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W92445 


Hs.165195 


416030 
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NM.000132 
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Hs.79386 
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Hs.80261 
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Hs.293885 
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Hs.82030 
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T87281 


Hs.16355 
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3.72 


1.00 


79.00 


0.14 


27.35 


1.00 


113.00 


0.64 


2.47 


0.55 


£40 


1.00 


111.00 


1.00 


95.00 


1.00 


87.00 


1.43 


8.07 


0.02 


3.07 


0.93 


1.72 


0.97 


1.51 


0.02 


242 


0.65 


1.50 


20.87 


232.00 


1.00 


80.00 


1.00 


85.00 


1.00 


213.00 


0.02 


3.93 


1.00 


115.00 


0.49 


1.94 


0.03 


3.75 


0.86 


1.95 


1.00 


120.00 


0.60 


2.48 


1.00 


95.00 


1.00 


136.00 


0.02 


8.07 


1.00 


73.00 


0.70 


3.36 


0.06 


6.56 


0.70 


3.66 


1.00 


114.00 


0.85 


1.30 


0.15 


15.54 
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5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



418067 


AI127958 


Hs.83393 


cystatin E/M 


416296 


C01566 


Hs.86671 


ESTs 


418643 


J03798 


H3.86948 


small nuclear ribonucteoprolein D1 polyp 


418832 


X04011 


Hs.88974 


cytochrome b-245, beta polypeptide (chro 


418945 


BE246762 


Hs.89499 


arachldonate 5-lipoxygenase 


419261 


X07876 


Hs.89791 


wingless-type MMTV Integration site fami 


419564 


U08989 


Hs.91139 


solute carrier family 1 (neuronal/epithe 


419574 


AK001989 


Hs.91165 


hypothetical protein 


419968 


X04430 


Hs.93913 


interteukln 6 (interferon, beta 2) 


420256 


U84722 


Hs.76206 


cadherin 5, type 2, VE-cadherin (vascula 


420285 


AA258124 


HS.293B78 


ESTs, Moderately similar to ZN91 .HUMAN Z 


420577 


AA278436 


Hs. 186649 


ESTs 


421262 


AA286746 


Hs.9343 


Homo sapiens cDNA FU 14265 fis, clone PL 


421445 


AA913059 


Hs.1 04433 


Homo sapiens, done IMAGE:4054868, mRNA 


421470 


R27496 


HS.137B 


annextn A3 


421478 


AI683243 


Hs.97258 


ESTs, Moderately similar to S29539 ribos 


421563 


KM.006433 


HS.105B06 


granulysln 


421566 


NM.000399 


Hs.1395 


early growth response 2 (Krox-20 (Drosop 


421855 


F06504 


Hs.27384 


ESTs, Moderately similar to ALU4.HUMAN A 


421913 


A1934365 


Hs.109439 


osteoglycln (osteoinductive factor, mime 


421952 


AA30O90O 


Hs.98849 


ESTs, Moderately similar to AF1 61 51 1 1 H 


422232 


D43945 


Hs.1 13274 


transcription factor EC 


422386 


AF105374 


Hs.1 15830 


heparan sulfate (glucosamine) 3-O-sulfot 


423168 


R34385 


Hs.124940 


GTP-binding protein 


423196 


AK001866 


Hs.125139 


hypothetical protein FU11004 


423387 


AJ012074 




vasoactive intestinal peptide receptor 1 


423424 


AF150241 


Hs.128433 


prostaglandin 02 synthase, hematopoietic 


423456 


AL110151 


H8.12B797 


DKFZP586D0824 protein 


423696 


Z92546 




Sushi domain (SCR repeat) containing 


424027 


AW337575 


Hs.201591 


ESTs 


424212 


NM-005814 


Hs.143131 


glycoprotein A33 (transmembrane) 


425087 


R62424 


Hs.126059 


ESTs 


425175 


AF020202 


Ks.155001 


UNC13(C.e)egansHIke 


425771 


BE561776 


Hs.159494 


Bruton agammaglobulinemia tyrosine kinas 


426486 


BE178285 


Hs.1 70056 


Homo sapiens mRNA; cDMA DKFZp586B0220 (f 


427507 


AF240467 


Hs.179152 


toll-like receptor 7 


427618 


NMJJ00760 


Hs.2175 


colony stimulating factor 3 receptor (gr 


427732 


NIUL002980 


H3.2199 


secretin receptor 


427952 


AA765368 


Hs.293941 


ESTs, Moderately similar to A53959 throm 


428709 


BE268717 


Hs.104916 


hypothetical protein FU21940 


428769 


AW207175 


Hs.1 06771 


ESTs 


428780 


AI478578 


Hs.50636 


ESTs 


428833 


AI928355 


Hs.185605 


ESTs 


429657 


D13626 


Hs.2465 


KIAA0001 gene product putative 6-protel 


430212 


AA469153 




gb:nc67f04.s1 NC1_CGAP_Pr1 Homo sapiens 


430226 


BE245562 


Hs.2551 


adrenergic, beta-2-, receptor, surface 


430376 


AW292053 


Hs.12532 


chromosome 1 open reading frame 21 


430414 


AW365665 


Hs.120388 


ESTs 


430656 


AA482900 


Hs.162080 


ESTs 


430843 


AI734149 


Hs.1 19514 


ESTs 


430998 


AF128847 


Hs.204038 


indolethylamine N-methyl transferase 


431217 


NMJ)13427 


Hs.250830 


Rho GTPase activating protein 6 


431921 


N46466 


Hs.58879 


ESTs 


432176 


AW090386 


Hs.112278 


arresUn, beta 1 


432203 


AA305746 


Hs.49 


macrophage scavenger receptor 1 


432231 


AA339977 


Hs.274127 


CLST 11240 protein 


432485 


N90866 


Hs.276770 


CDW52 antigen (CAMPATH-1 antigen) 


432522 


D11466 


Hs.51 


phosphatidyiinosltol glycan, class A (pa 


432596 


AJ224741 


Hs.278461 


matriiin 3 


432850 


X87723 


Hs.3110 


angiotensin receptor 2 


433138 


AB029496 


Hs.59729 


semaphorin sem2 


433563 


AI732637 


Hs.277901 


ESTs 


433568 


A1056672 


Hs.133386 


ESTs 


434445 


AI349306 


Hs.1 1782 


ESTs 


435496 


AW840171 


Hs.265398 


ESTs, Weakly similar to transformation-r 


435974 


U29690 


Hs.37744 


Homo sapiens beta-1 adrenergic receptor 


436061 


A1248584 


Hs.190745 


Homo sapiens cDNA: FU21326 fis, done C 


437157 


BE048860 


Hs.120655 


ESTs 


437207 


T27503 


Hs.1 5929 


hypothetical protein FU12910 


437311 


AA370041 


Hs.9456 


SWUSNF related, matrix associated, acti 


437439 


H29796 


Hs.269622 


ESTs 


438199 


AW016531 


HS.122147 


ESTs 


439551 


W72062 


Hs.11112 


ESTs 


440515 


AJ131245 


Hs.7239 


SEC24 (S. cerevlsiae) related gene fami) 


440887 


AI799488 


Hs.135905 


ESTs 


441025 


AA913880 


Hs.176379 


ESTs 


441384 


AA447849 


Hs.288660 


Homo sapiens cDNA: FU22182 fis, clone H 


441735 


AI738675 


Hs.1 27346 


ESTs 


442200 


AW590572 


Hs.235768 


ESTs 


442832 


AW206560 


Hs .253569 


ESTs 


442957 


AI949952 


Hs.49397 


ESTs 


443282 


T47764 


Hs.132917 


ESTs 


443547 


AW271273 


Hs.23767 


hypothetical protein FU12666 


443951 


F13272 


Hs.111334 


ferritin, light polypeptide 


444330 


AI597655 


Hs.49265 


ESTs 



0.81 


1.74 


1.00 


99.00 


1.00 


60.00 


Z40 


14.74 


0.67 


3.16 


1.00 


73.00 


1,00 


192.00 


1.00 


94,00 


61.16 


500.00 


0.52 


1.70 


1.00 


172.00 


1.00 


97.00 


1.00 


64.00 


0.88 


1.51 


0.05 


11.26 


1.00 


73.00 


0.82 


2.42 


5.50 


31.57 


1.00 


129.00 


1.00 


101.00 


0.60 


63.60 


1.00 


148.00 


1.40 


3.98 


0.34 


3.59 


0.55 


2.00 


0.09 


2.13 


1.00 


141.00 


1.00 


66.00 


0.73 


1.27 


0.54 


2,58 


0.77 


2.47 


1.00 


74.00 


0.85 


1.96 


1.18 


2.56 


1.00 


76.00 


1.00 


63.00 


0.60 


119 


0.97 


1.42 


1.00 


105.00 


1.00 


80.00 


0.09 


2.55 


1.00 


98.00 


1.00 


113.00 


1.00 


52.00 


1.00 


132.00 


0.11 


15.60 


1.00 


103.00 


0.50 


6.96 


1.00 


70.00 


1.00 


90.00 


0.29 


1.84 


1.00 


79.00 


0.91 


1.67 


0.66 


Z63 


1.00 


76.00 


0.46 


1.46 


0.79 


Z25 


1.93 


4.83 


0.04 


5.79 


1.00 


167.00 


0.04 


9.16 


1.00 


91.00 


120.16 


315.00 


0.60 


1.84 


1.00 


128.00 


1.00 


108.00 


1.00 


91.00 


1.00 


87.00 


1.00 


105.00 


1.00 


71.00 


1.00 


115.00 


1.00 


80.00 


0.30 


3.10 


1.00 


77.00 


1.00 


85.00 


1.00 


82.00 


0.79 


1.89 


1.00 


75.00 


0.78 


5.83 


0.03 


10.68 


1.00 


70.00 


1.00 


197.00 


1.00 


253.00 


0.55 


2.09 


1.00 


90.00 
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444515 


AW20490B 


Hb. 169979 


445769 


AI741471 


Hs.23666 


445908 


R13580 


Hs. 13436 


446291 


BE397753 


Hs. 14623 


446917 


Al 347863 


Hs.156672 


447261 


NMJJ06691 


Hs.17917 


447432 


AW958473 


Hs. 301 957 


447482 


AB033059 


Hs.18705 


447997 


H00656 


Hs.29792 


448299 


AA497044 


Hs.20887 


448762 


AL050295 


Hs.22039 


450575 


NMJJQ5859 


Hs.29117 


450584 


AA040403 


Hs.60371 


450693 


AW450461 


Hs.203965 


450715 


Al 266484 


Hs.31570 


451103 


R52804 


Hs.25956 


451220 


AF 124251 


Hs.26054 


451668 


Z43948 


Hs.326444 


452197 


AW023595 


Hs.232048 


462331 


AA598509 


Hs.29117 


452353 


CI8825 


Hs.29191 


453049 


BE537217 


Hs.30343 


453107 


NM.016113 


Hs.279746 


453355 


AW295374 


Hs.31412 


453390 


AA862496 


Hs.28482 


453531 


AA417940 




454741 


BE154396 




456579 


AA287827 


Hs.284205 


456672 


AK002016 


Hs.114727 


457400 


AF032906 


Hs.252549 


457718 


F18572 


Hs.22978 


459696 


F03027 





esTs 

ESTs 

Homo sapiens clone 24425 mRNA sequence 
interferon, gamma-tnducible protein 30 
ESTs 

extracellular link domain-containing 1 
nudlx (nucleoside diphosphate linked mol 
WAA1233 protein 

ESTe, Weakly similar to 138022 hypotheti 
hypothetical protein FLM0392 
WAA0758 protein 

purine-rich element binding protein A 

ESTs 

ESTs 

ESTs, Weakly similar to KIAA1324 protein 
DKF2P564D206 protein 
novel SH2-containing protein 3 
cartilage acidic protein 1 
ESTs 

purtne-ricti element binding protein A 
epithelial membrane protein 2 
ESTs 

vanHioid receptor-tike protein 1 

Homo sapiens cDNA FU11422 fis, done HE 

ESTs 

ESTs, WeaWy similar to JC5795 CDEP prot 
gb:CM2-HT0342-091299-05O-b05 HT0342 Homo 
up-regulated by BCG-CWS 
Homo sapiens, clone M6C:16327, mRNA, com 
cathepsin Z 

ESTs, Weakly similar to ALU4_HUMAN ALU S 
gb:HSC1KA072 normalized Infant brain cDN 



TABLE 10B 



Pkey. Unique Eos probeset identifier number 
CAT number Gene cluster number 
Accession: Genbank accession numbers 



1.00 


64.00 


ao2 


4.38 


1.00 


97.00 


0.93 


1.69 


1.00 


106.00 


0.40 


47.20 


1.00 


100.00 


0.05 


8.21 


0.02 


5.42 


1.00 


79.00 


0.42 


1.56 


0.17 


11.33 


1.00 


94.00 


1.00 


91.00 


1.00 


152.00 


1.00 


86.00 


0.60 


1.30 


0.54 


1.91 


1.00 


67.00 


4.53 


11.07 


0.72 


2.24 


1.00 


68.00 


0.83 


1.70 


1.00 


132.00 


1.00 


72.00 


1.00 


68.00 


0.57 


2.89 


1.00 


82.00 


0.79 


1.96 


1.03 


3.25 


1.00 


113.00 


1.00 


544.00 



Pkey 
408074 

411667 
413533 

423387 



423696 

430212 
436532 
453531 
454741 



CAT Number 
103684J 

1253334J 
1375344J 

22779.1 



23112J 

31 4437 J 
421802J 
97026J 
1232559J 



Accession 

R20723 AA263003 AA333976 AA334725 AA334151 AW965490 AA310513 AI810530 D31302 AW134897 AA830127 AA046953 AI668930 
C06094AW104534 

BE1601 98 AW935898 T1 1 520 AW935930 AW856073 AW861034 

BE146973 BE146972 BE147042 BE147018 BE146783 BE147020 BE146781 BE147019 BE146766 BE147021 BE146952 BE146767 BE147044 
BE146797 BE146776 BE146985 BE146793 BE146768 BE146771 BE146954 BE146760 BE147048 BE147025 BE147030 
AJ01 2074 U1 1087 L13288 X75299 L20295 AW630780 H14880 T28037 A1872991 R72136 AW449839 T81622 T79697 T29519 R94105 T83923 
R73300 AI797007 R73390 AA961010 H74168 A1689932 BE045543 AI808418 AI608912 AI806573 AWB84084 AW872978 AW872985 AA565655 
AI02291 5 R50647 R7321 0 H45098 R46451 AW1 66269 T71 132 AI264547 R52146 AI304920 R73391 AW884059 AW884085 H73241 T6003B 
T79612 R731 45 R50549 A1094557 A1668793 R72302 A1564386 W01956 AA418962 W32571 R72840 H45409 R72085 R46356 R46758 
AA508805 AA418798 T83751 R94072 T16182 AA928785 AA903896 

Z92546 AA3305B6 AI570568 AW341487 AI827050 AW298668 AI792189 AI015693 A1733599 AI572251 A167248B AW193262 A1244716 

A1864375 AI206100 AA912444 AI269365 A1640254 AW772466 AI867336 AA627604 H1 6914 AA358477 AA338009 

M469153 AI718503 AA469225 

AA721522 AW975443 T93070 

AA417940 AA036735 T07025 

BE154396AW817959BE154393 



TABLE 10C 



Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers In this column are Genbank Identifier (Gl) numbers. "Dunham L et al." refers to the publication entitled The DNA 

sequence of human chromosome 22/ Dunham I. et al., Nature (1999) 402:489495. 
Strand: Indicates DMA strand from which exons were predicted. 
NLposltion: Indicates nucleotide positions of predicted exons. 



Pkey 

400754 

401045 

401083 

402474 

402808 

403021 

403421 

403438 

403687 

403764 

404277 

404288 

404394 

404518 

404916 

405106 

405257 

405381 



Ref 

7331445 
8117619 
3242744 
7547175 
6456148 
7547270 
9665041 
9719679 
7387384 
7717105 
1834458 
2769644 
3135305 
8151988 
7341826 
8079395 
7329310 
6006920 



Strand 

Plus 

Plus 

Plus 

Minus 

Minus 

Rus 

Minus 

Rus 

Plus 

Minus 

Minus 

Rus 

Minus 

Rus 

Plus 

Minus 

Rus 



NLpositfon 
144559-144684 
90044-90184,91111-91345 
33192-33360 

53526-53628,55755-55920,6753(^57757 

114964-115136,115461-115585,115931-116047,117666-117771,118004-118102 
120799-120966 

126509-126773,139986-140205 

90792-90938 

9009-9534 

118692-118853 

91665-91946 

3812-3691 

37121-37205,37491-37762,4105341140,41322-41593,4177341919 

84494-84603 

91057-91188 

80877-81418 

73121-73273 

7636-8054 



174 



WO 02/086443 PCT/US02/12476 
406387 9256160 Plus 116229-116371,117512-117651 



175 



WO 02/086443 

TABLE 1 1 A: Genes Distinguishing Adenocarcinoma from Other Lung Diseases and Normal Lung 
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Table 11A shows about 84 genes upregulated to lung adenocarcinomas relative to other lung tumors, non-malignant lung disease, and normal lung. These genes were selected 
from about 59880 probesets on the Eos/Affymetrix Hu03 Genechlp array. 

Table 1 1 B show the accession numbers for those Pkey's lacking UnigenelD's for table 1 1 A. For each probeset we have listed the gene cluster number from which the 
oligonucleotides were designed. Gene clusters were compiled using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence 
similarity using Clustering and Alignment Tools (DoubleTwtet, Oakland California). The Genbank accession numbers for sequences comprising each cluster are Dated m the 
"Accession' column. 

Table 1 1 C show the genomic positioning for those Pkey's lacking Uragene ID'S and accession numbers in table 1 1 A. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also listed. 



Pkey: 
ExAccn: 
UnigenelD: 
Unigene Title: 
R1: 

R2: 



Unique Eos probeset identifier number 

Exemplar Accession number, Genbank accession number 

Unlgene number 

Unigene gene title 

Average of lung tumors (including squamous cell carcinomas, adenocarcinomas, small cell carcinomas, granulomatous and carcinoid tumors) divided by the 
average of norma! lung samples 

Average of non-maBgnant lung disease samples (including bronchitis, emphysema, fibrosis, atelectasis, asthma) divided by the average of normal lung samples 

Unlgene Title 
Target Exon 

NM_0031 22*^01110 sapiens serine p 
cardnoembryonic antigen-related celf ad 
hypothetical protein FU13612 
heparan sulfate (glucosamine) 3-0-suIfot 
serine/threonine kinase 15 
XAGE-1 protein 

carbamoyl-phosphate synthetase 1, mitoch 
hypothetical protein FU13352 
ESTs 

Homo sapiens cDNAFU 14035 lis, done HE 
synuclein, gamma (breast cancer-specific 
cytidine deaminase 

arylsuffatase E (chondrodysplasia puncta 
amiloride binding protein 1 (amine codda 
ESTs, Weakly similar to MUC2.HUMAN MUCIN 
progestagen-associated endometrial prote 
cytochrome P450, subfamily XXIV (vitamin 
fibrinogen, A alpha polypeptide 
popeye protein 3 

small Inducible cytokine S subfamily (Cy 
lysyl oxidase 

mucin 5, subtype B, tracheobronchial 
solute carrier family 1 (glutamate trans 
GalNAc a)pha~2, 6-sialyttransferase ), ) 
trefoil factor 1 (breast cancer, estroge 
trinucleotide repeat containing 9 
hypothetical protein FU22704 
cytokine receptor-like factor 1 
cartilage olfgomeric matrix protein (pse 
breast carcinoma amplified sequence 1 
glutamlne-fructose-S-phosphaie transamln 
lengsln 

dopa decarboxylase (aromatic L- amino acl 
NIMA (never in mitosis gene a}-related k 
5* nucleotidase (CD73) 
KIAA0479 protein 
protease, serine, 1 (trypsin 1) 
nuclear autoanligenlc sperm protein (his 
KIAA0403 protein 
hypothetical protein FU14303 
dual specificity phosphatase 4 
ATP-blndlng cassette, sub-family A (ABC1 
LUNX protein; PLUNC (palate lung and nas 
ESTs 

ESTs, Weakly similar to 178885 serme/th 
novel protein 

J domain containing protein 1 
Novel human gene mapping to chomosome 20 
S100 calcium-binding protein P 
UDP-N-acetyl-aIpha-r>galactosamlne:pofyp 
calclunVcalmodulln-dependent protein kin 
ESTs 

serine (or cysteine) proteinase tnhibito 
Homo sapiens cDNA: FU23523 fis, done L 
ESTs 

fibrlnogen-Jike 1 

lectin, galactoslde-blnding, soluble, 4 
ctaudin 3 

metallothionein 1E (functional) 
mucin 13, epithelial transmembrane 
Williams- Beuren syndrome chromosome regi 
heterochromatin-like protein 1 



Pkey 


ExAccn 


UnigenelD 


403329 






406399 






406690 


M29540 


Hs.220529 


407869 


M827976 


H5.2439V 


407881 


AW072003 


Hs.40968 


408908 


BE296227 


Hs.250822 


409103 


AF251237 


Hs.112208 


409187 


AF154830 


Hs.50966 


409269 


M576953 


Hs.22972 


410076 


T05387 


Hs.7991 


410102 


AW248508 


Hs.279727 


410399 


8E068889 




411908 


L27943 


Hs.72924 


412612 


NM.000047 


Hs.74131 


414075 


U11862 


Hs.75741 


416208 


AW291168 


Hs.41295 


417542 


J04129 


Hs.82289 


419183 


U60669 


Hs.89663 


419502 


AU076704 




419631 


AW188117 


Hs.303154 


420931 


AF044197 


Hs.100431 


421155 


H87879 


Hs.102267 


421190 


U95031 


Hs.102482 


421474 


U76362 


Hs.104637 


421515 


Y11339 


Hs.105352 


421582 


AI910275 




422026 


UB0736 


Hs.1 10826 


422095 


AI868872 


Hs.282804 


422311 


AF073515 


Hs.1 14948 


422867 


L32137 


Hs.1584 


423472 


AF041260 


Hs.129057 


423554 


M90516 


Hs.1674 


424502 


AF242388 


Hs.149585 


424544 


MB8700 


Hs.150403 


424905 


NMJJ02497 


Hs.153704 


424960 


BE245380 


Hs.153952 


425523 


AB007948 


Hs.158244 


426230 


AA367019 


Hs.241395 


427701 


M411101 


HSJ243886 


428585 


AB007863 


Hs.185140 


428758 


AA433988 


Hs.98502 


429170 


NJVL001394 


Hs.2359 


429263 


AA019004 


Hs.198396 


429610 


AB024937 


Hs.211092 


430508 


AI015435 


Hs.104637 


430985 


AA490232 


Hs.27323 


431548 


AI834273 


Hs.9711 


431566 


AF176012 


Hs.260720 


431986 


AA536130 


Hs.149018 


432375 


BE536069 


Hs.2962 


432677 


NM_004482 


Hs.278611 


433558 


W56321 


Hs.1 11460 


433819 


AW511097 


Hs.112765 


434001 


AW950905 


Hs.3697 


434424 


A1811202 


Hs.325335 


434792 


AA649253 


Hs.132458 


436217 


T53925 


Hs.107 


436749 


AA5B4890 


Hs.53Q2 


436972 


AA284679 


Hs.25640 


437866 


AA156781 




437935 


AW939591 


Hs.5940 


438915 


AA280174 


Hs.285681 


439451 


AF086270 


Hs.278554 



R1 


R2 


1.00 


61.00 


1.00 


39.00 


226.37 


350.00 


0.77 


1.18 


1.00 


10.00 


7.76 


1.00 


60.44 


40.00 


1.00 


1.00 


1.00 


1.00 


1.12 


1.50 


9.89 


1.00 


0.92 


1.06 


1.00 


1.00 


1.02 


1.03 


0.84 


1.07 


3.67 


1.00 


1.28 


1.35 


1.00 


1.00 


13.05 


115.00 


1.00 


13.00 


1.00 


8.00 


1.00 


15.00 


1.17 


1.55 


1.46 


1.76 


1.00 


3.00 


1.23 


1.00 


1.00 


52.00 


4.37 


Z34 


1.16 


1.78 


1.69 


3.17 


48.13 


72.00 


1.00 


60.00 


1.00 


1.00 


1.00 


59.00 


21.35 


1.00 


1.00 


1.00 


1.00 


35.00 


1.00 


83.00 


7.41 


34.00 


1.00 


6.00 


1.06 


1.13 


16.18 


105.00 


1.07 


1.00 


1.59 


1.69 


4.75 


7.27 


0.94 


1.28 


5.66 


15.00 


49.76 


37.00 


1.19 


1.47 


1.65 


1.06 


1.00 


48.00 


1.00 


19.00 


3.71 


8.00 


29.31 


72.00 


1.00 


64.00 


8.52 


44.00 


57.97 


31.00 


1.10 


1.41 


1.59 


1.46 


3.62 


101.00 


1.60 


1.39 


1.00 


1.00 


23.28 


52.00 
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439759 


AL359055 


Hs.67709 


Homo sapiens mRNA full length insert cDN 


1.00 


21.00 


441031 


All 10664 


Hs.7645 


fibrinogen, B beta polypeptide 


1.41 


99.00 


441377 


BE218239 


Hs.202656 


ESTs 


22.03 


1.00 


443614 


AV655386 


Hs.7645 


fibrinogen, B beta polypeptide 


1.00 


16.00 


443813 


AA876372 


Hs.93961 


Homo sapiens mRMA; cDNA OKFZp667O095 (fr 


1.20 


1.99 


443991 


NM 002250 


Hs.10082 


potassium intermedlaie/sman conductance 


5.71 


6.87 


444670 


H58373 


Hs.332938 


hypothetical protein MGC5370 


1.98 


38.00 


444931 


AV652066 


Hs.75113 


general transcription factor IHA 


1.00 


54.00 


446102 


AW168067 


Hs.317694 


ESTs 


1.00 


1.00 


446163 


AA026B80 


Hs.25252 


Homo sapiens cDNA FU 13603 fis, clone PL 


1.00 


36.00 


446469 


BE094848 


Hs.15113 


homogentisate 1,2-dioxygenase (homogentl 


1.00 


11.00 


44738B 


AW630534 


Hs.76277 


Homo sapiens, clone MGC'9381, mRNA, comp 


1.24 


1.16 


447532 


AK000614 


Hs.18791 


hypotheticai protein FU20607 


1.23 


1.63 


446243 


AW369771 


Hs.52620 


integrln, beta 8 


15.84 


1.00 


448844 


A1581519 


Hs.177164 


ESTs 


1.00 


31.00 


449444 


AW818436 


Hs.23590 


solute carrier family 16 (monocarboxytic 


1.00 


83.00 


451807 


W52854 




hypothetical protein FU23293 similar to 


1.55 


35.00 


452689 


F33868 


Hs.284176 


transferrin 


1.54 


1.44 


453392 


U23762 


Hs.32964 


SRY (sex determining region Y)-box 1 1 


• 1.00 


16.00 


453464 


A1884911 


Hs.32989 


receptor (calcitonin) activity modifying 


1.55 


2.45 


453735 


A1066629 


Hs. 125073 


ESTs 


1.01 


1.30 



TABLE 11B 

Pkey: Unique Eos probeset Identifier number 
CAT number: Gene cluster number 
Accession: Genbank accession numbers 

Pkey GAT Number Accession 

410399 11995 1 BE068889 BE068882 AF044311 AF01 7256 NM.003087 AF037207 AF0101 26 AA633976 AA872836 BE298825 BE299889 A101 6464 A1684600 

A1936527 AA804675 AA394097 AI139933 AA946606 BE171 313 AA722407 AA293803 AI468480 AA056035 AA055968 AW796957 AI637713 
AM10737H49348AA486472M411094AA235594AA4M624M443638AW452137AA421708AVV265211AI493266M3 

419502 18535 1 AU076704 T74854 T74860T72098 T73265 T73873T69180n4658T58786T60385T73410T68781 T67845 T87593 T73952 T67864 T60830 

T68367 T68401 T53959 T72360 T72099 T60377 T58961 T71712 T72821 T64738 T74645 T72037 T68688 T72063 T73258 T72826 T64242 
T68220 T74673 T71800 T68355 T61227 T62738 T69317 T53850 T64892 T73768 T73962 T73382 T68914 T70975 T73400 T60631 T73277 
T73203 T70498 T61409 T5B925 NMJ3Q0508 M64982 T68301 T73729 T69445 T60424 T67922 T67736 T68716 T67755 T74765 T73819 T58719 
T74756 T60477 T74863 T61 1 09 T68329 T58850 T71857 T73425 T53736 T6B607 T5B89B T64309 T72031 T72079 T64305 T71908 T68107 
T71916 T73787 T56035 T64425 T71870 T60476 T61376 T67820 T71895 T41006 T69441 T68170 T74617 ni958 T69440 T61875 R06796 
H48353 T71914 T53939 T641 21 AA693996 T72525 T67779 T68078 AA01 1465 AA345378 AV654847 AV654272 AV656001 AI064740 T82B97 
N33594 AA344542 AW805054 AI207457 T61 743 AA026737 H94389 AA382695 AA918409 T68044 S82092 T39959 AI01 7721 AA31 2395 
AA312919 T40156 H66239 AV652989 H3872B R98521 AV655200 R95790 W03250 W00913 AA344136 AV660126 R97923 AA343596 
AW470774AV651256 N54417AA812862AW182929AI111192H61463H72060AA344503 H38639AI277511 AV661108AI207625T47810 
AA235252 T27853 T47778 R95746 H70620 AA701 463 AW827166 R98475 C20925 AV657287 T71959 T71 31 3 T73920 T73333 T61 618 T69293 
T692B3 T73931 T72178 T72456 AV645639 AV653476 T72957 T72300 T5B906 T71457 T70494 T72956 T70495 T68267 T74407 T85778 
AA344726 T27854 T74485 T74101 T73868 T71518 T72304 AA343853 T73909 T68070 T72065 H72149 T73493 T73495 AV645993 R02293 
T70475 T64751 AA344441 AA343657 AA345732AA344328 AM 10639 AA344603 AF063513T64696 T68516T72223 T60507T67633 R29500 
T72517 R02292 T60599 T69206 T70452 T74677 R29366 T61 277 T74914 T60352 R29675 T74843 AV645792 AA344408 T691 97 T72057 
T69368 T69358 T68256 AV650429 T73341 T61702 T74598 T40095 K02272 T40106 AA343045 AA341908 AA341907 AA342807 AA341964 
T53747 T72042 T62764 AI064899 AA343080 T67832 T72440 T71770 T68091 T69108 T72449 T69167 T71289 T6B251 AV654844 T64375 
AA345234 T67598 AA01 1414 T66036 H48262 A1207557 T68219 W86031 T69081 T64232 R93198 T62136 AV650539 H67459 T72978 
AA344583 T60362 H58121 T95711 T72803 T68055T71715 R29036T72793T69122T64595T62888 T69139T68291 T64652T67971 T46862 
AA693592 AI248502 R29454 T647B4 T57001 T73052 T71429 T51 1 76 T58866 AV65541 4 H90426 AA342489 773666 T67848 T72512 T53835 
T67B37 T73317 T74273 T69420 T68245 T74380 T67862 T74474 T56068 

421582 2041 1 AI910275 X00474 X52003 X05030 NM 003225 AA314326AA308400AA506787 AA314825 AI571948AA507595AA614579AA587613RB3B18 

AA568312AA614409AA307578AJ925552AW950155AI910083 M12075 BE074052 AW004668 AA578674 AA582084 BE074053 BE074126 
BE074140 AA514776 AA588034 BE074051 BE074Q68 AW009769 AW050690 AA858276 R55389AI001051 AW050700AW750216 AA614539 
BE074045 AI307407 AW602303 BE073575 A1202532 AA524242 AI970839 AI909751 BE076078 A1909749 R55292 

437866 44433 2 AA156781 AW293839 U52054AA024963AA778446 BE073977 AW444904 AW602574 BE 164040 BE164012 BE163972 BE163974 BE163992 

AA837481 AW468444 BE165091 AW468002 AA687333 AA81 1830 AA581606 AI866686 AI572124 AA043777 AA040926 D20160 AI536733 
AA812489 AW874142 AI471883 W84421 AA166850 

451807 8865.1 W52854 AL1 17600 BE2081 16 BE208432 BE205239 BE082291 AW953423 AA351619 BE1 80648 BE140560 W60080 AA865478 N90291 

AW450652 AW449519AA993634 AI806539 AA351618 AW449522 AI827626 AA904788 AA380381 AA886045 AA774409 BE003229 241756 



TABLE 11C 

Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (GQ numbers. "Dunham I. et a)." refers to the publication entitled "The DNA 

sequence of human chromosome 21* Dunham I. et a!., Nature (1999) 402:489-495. 

Strand: Indicates DNA strand from which exons were predicted. 

NLposition: Indicates nucleotide positions of predicted axons. 

Pkey Ref Strand NLposition 

403329 8516120 Plus 96450-96598 

406399 9255288 Minus 63448-63554 
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TABLE 12A: Genes Distinguishing Squamous Cell Carcinoma from Other Lung Diseases and Norma) Lung 

Table 12A shows about 72 genes upregulated In squamous cell carcinomas of the lung relative to other lung tumors, non-malignant lung disease, and normal lung. These genes 
were selected from about SS680 probesets on the Eos/Affymetrix Hu03 Genechlp array. 

Table 12B show the accession numbers for those Pke/s lacking UnigenelD's for table 12A. For each probeset we have listed the gene cluster number from which the 
oligonucleotides were designed. Gene dusters were compiled using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence 
similarity using Clustering and Alignment Tools (DouWeTwist, Oakland California). The Genbank accession numbers for sequences comprising each duster are listed in the 
•Accession* column. 

Table 1 2C show the genomic positioning for those Pke/s lacking Unlgene ID'S and accession numbers In table 12A. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also listed. 



Pkey: Unique Eos probeset Identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnlgenelD: Unlgene number 

Unlgene Title: Unlgene gene title 



R1: 

R2: 

Pkey 

400289 

400666 

401780 

401781 

401785 

401994 

402075 

404996 

407839 

40BO0O 

408522 

410561 

415091 

415817 

416658 

417034 

417366 

418663 

418678 

419121 

420783 

421773 

421948 

421978 

422158 

422440 

423634 

423725 

423738 

424012 

424046 

424098 

424834 

425650 

427099 

427335 

428182 

428645 

428748 

429259 

429538 

429903 

430486 

430890 

431009 

431846 

433091 

434360 

434880 

435505 

435793 

436511 

438403 

439285 

439606 

439670 

439706 

440325 

441525 

443162 

444378 



Average of lung tumors (indudlng squamous cell carcinomas, adenocarcinomas, small cell carcinomas, granulomatous and carcinoid tumors) divided by the 
average of norma) lung samples 

Average of non-malignant lung disease samples (including bronchitis, emphysema, fibrosis, atelectasis, asthma) divided by the average of normal lung samples 

ExAccn UnigenelD Unigene Title 

X07820 Hs.2258 matrix metalloproteinase 10 (stromelysln 

NM_002425:Homo sapiens matrix metallopro 

NMJ)05557*:Homo sapiens keratin 16 (foca 

Target Exon 

NMJ)02275*:Homo sapiens keratin 15 (KRT1 
Target Exon 

ENSP00000251056*:Plasma membrane calcium 
Target Exon 

AA045144 Hs.161566 ESTs 

L1 1690 Hs.620 bullous pemphigoid antigen 1 (230/240kD) 

AI541 21 4 Hs.4632 0 Small pro tine-rich protein SPRK [human, 

BE540255 Hs.6994 Homo sapiens cDNA: FL) 22044 fis, done H 

AL044872 Hs.77910 3-hydroxy-3-methylglutaryl-Coenzyme A sy 

U88967 Hs.78867 protein tyrosine phosphatase, receptor-t 

U03272 Hs.79432 fibrillin 2 (congenital contracture] ara 

NM.006183 Hs.80962 neurotensin 

BE185289 Hs.1076 small praline-rich protein 1B (comifin) 

AK001100 Hs.41690 desmocolDn3 

NM.001327 Hs.87225 cancer/tesfls antigen 

AA374372 Hs.89626 parathyroid hormone-like hormone 

AI659838 Hs.99923 lectin, gatactosi de-binding, soluble, 7 

W69233 Hs.1 12457 ESTs 

L42583 Hs.334309 keratin 6A 

AJ243662 Hs.110196 NICE-1 protein 

L10343 Hs.1 12341 protease inhibitor 3, skin-derived (SKAL 

NM.00481 2 Hs.1 16724 aldo-keto reductase family 1, member B1 0 

AW959908 Hs.1 690 heparin-binding growth factor binding pr 

AJ403108 Hs.132127 hypothetical protein LOC57822 

AB002134 Hs.1 32195 airway trypsln-llke protease 

AW368377 Hs.1 37569 tumor protein 63 kDa with strong homolog 

AF027866 Hs. 1 38202 serine (or cysteine) proteinase Inhibit© 

AF077374 Hs.1 39322 small praline-rich protein 3 

AK001432 Hs.1 53408 Homo sapiens cDNA FU 10570 fis, done NT 

NM.001 944 Hs.1 925 desmogleln 3 (pemphigus vulgaris antigen 

AB032953 Hs. 1 73560 odd Oz/terwn homolog 2 (Drosophila, mous 

AA448542 Hs.251677 G antigen 7B 

BE386042 Hs.293317 ESTs, Weakly similar to GGC INHUMAN G ANT 

AA431400 Hs.98729 ESTs, Weakly similar to 201 7205A dihydro 

AW593206 Hs.98785 Ksp37 protein 

AA420450 Hs.292911 ESTs, Highly similar to S 6071 2 band-6-pr 

BE182592 Hs.1 1261 small praline-rich protein 2A 

AL1 34197 Hs.93597 cyclln-dependent kinase 5, regulatory su 

BE062109 Hs.241551 chloride channel, calcium activated, fam 

X54232 Hs.2699 gtyplcan 1 

BE1 49762 Hs.48956 gap {unction protein, beta 6 (connexln 3 

BE019924 Hs.271580 uroplakln 1B 

Y12642 Hs.3185 lymphocyte antigen 6 complex, focus D 

AW015415 Hs.127780 ESTs 

U02388 Hs.1 01 cytochrome P450, subfBmily IVF, polypept 

AF200492 Hs.211238 lnterleukin-1 homdog 1 

AB037734 Hs.4993 KIAA1 31 3 protein 

AA721252 Hs.291502 ESTs 

AA806607 Hs.292206 ESTs 

AL133916 hypothetical protein FU20093 

W79123 Hs.58561 G protein-coupled receptor 87 

AF088076 Hs.59507 ESTs, Weakly similar to AC004856 3 U1 sm 

AW872527 Hs.59761 ESTs, Weakly similar to DAP1JWMAN DEATH 

NM_00381 2 Hs.71 64 a dlsmtegnn and metalloproteinase doma 

AW241867 Hs.127728 ESTs 

T49951 Hs.9029 DKFZP434G032 protein 

R41339 Hs.12569 ESTs 



R1 


R2 


132.45 


4.00 


3.26 


3.22 


26.47 


10.60 


10.33 


4.61 


4.13 


2.70 


61.84 


47.00 


1.00 


1.00 


1.00 


1.00 


173.91 


108.00 


151.17 


8.00 


1.98 


1.24 


10.04 


1.00 


1.00 


30.00 


24.30 


1.00 


53.29 


51.00 


1.00 


1.00 


8.97 


3.27 


112.17 


19.00 


1.18 


1.10 


1.00 


1.00 


3.04 


1.25 


1.12 


1.14 


51.83 


20.25 


1,01 


0.91 


2.37 


1.10 


47.53 


32,00 


76.02 


1.00 


4.20 


1.00 


10.14 


51.00 


233.42 


68.00 


1.00 


1.00 


137.82 


54.00 


56.19 


12.00 


33.45 


1.00 


4.24 


17.00 


51.83 


4.00 


1.00 


1.00 


1.00 


16.00 


1.00 


87.00 


2.01 


1.18 


4.43 


290 


11.80 


1.00 


12.28 


41.00 


1.58 


1.40 


60.25 


28.00 


4.49 


2.51 


1.20 


1.09 


40.98 


27.00 


1.00 


1.00 


1.00 


38.00 


23.68 


42.00 


16.76 


14.00 


1.00 


1.00 


46.23 


139.00 


33.61 


1.00 


1.00 


1.00 


86.55 


11.00 


62.88 


147.00 


1.53 


1.42 


31.11 


38.00 


1.00 


1.00 



178 



WO 02/086443 




446292 


AF081497 


Hs.279682 


Rh type C glycoprotein 


447078 


AW885727 


Hs.9914 


ESTs 


447342 


A1199268 


Hs.19322 


Homo sapiens, Similar to RIKEN cDNA 2010 


449003 


X76342 


Hs.389 


aiconoi asnyorogenase / iciass i vj, mu o 


449101 


AA205847 


Hs.23016 


G protein-coupled receptor 


450832 


AW970602 


Hs.105421 


ESTs 


452240 


A1591147 


Hs.61232 


ESTs 


453317 


NMJJ02277 


Hs.41698 


keratin, hair, acioic.1 


453630 


AA534296 


Hs.20953 


ESTs 


454098 


W27953 


Hs.292911 


ESTs, Highly similar to S60712 band-6-pr 


455601 


A1368680 


Hs.816 


SRY (sex determining region Y)-box 2 



TABLE 12B 

Pkey: Unique Eos probeset identifier number 
CAT number Gene cluster number 
Accession: Genbank accession numbers 



PCTAJS02/12476 



1.55 


1.26 


47.24 


24.00 


28.63 


1.00 


1.00 


1.00 


2.58 


27.00 


25.17 


36.00 


13.42 


1.00 


1.19 


1.27 


24.92 


25.00 


1.26 


1.11 


206.11 


1.00 



439285 47085.1 AL133916 N79113 AF086101 N76721 AW950828 AA364013 AW955684A1346341 AJ867454 N54784 AI656270AI421 279 AW01 4882 

AA775552 N62351 N59253AA626243AI341407 BE175639 AA456968AI358918AA457077 



TABLE 12C 



Pkey: Unique number corresponding to an Eos probeset JirrL Ml4 

Ref: Sequence source. The 7 digH numbers in this column are Genbank Identifier (Gt) numbers. 'Dunham t. et a!.* refers to the publication entitled The ONA 

sequence of human chromosome 22.' Dunham I. et a)., Nature (1999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted 
NLposition: Indicates nucleotide positions of predicted exons. 

reposition 

17982-18115,20297-20456 

28397-28617,28920-29045,29135.29298,29411-29567,29705-29787,30224-30573 
83215-83435,83531-83656,8374(W3901,84237-84393,84955-85037,86290-86814 
165776.165996,166189-1663l4,16640M 66569,167112-167268,167387-167469 t 168634-16B942 
4290443124,43211-43336,44607-44763,4519^5281,46337^6732 
121907-122035,122804-122921.124019-124161,124455.124610,125672-126076 
37999-38145.38652-38998,39727-39872,40557-40674,4235142450 



Pkey 


Ref 


Strand 


400666 


8118496 


Plus 


401780 


7249190 


Minus 


401781 


7249190 


Minus 


401785 


7249190 


Minus 


401994 


4153858 


Minus 


402075 


8117407 


Plus 


404996 


6007890 


Plus 



179 



WO 02/086443 PCT/US02/12476 

TABLE 13A: Genes Distinguishing Non-Malignant Lung Disease from Lung Tumors and Normal lung 

Table 13A shows about 23 genes unregulated in non-malignant lung disease relative to lung tumors and normal lung. These genes were selected from about 59680 probesets on 
the Eos/Affymetrix Hu03 Genechip array. 

Table 13B show the accession numbers for those Pke/s lacking UnigenelD's for table 13A. For each probeset we have listed the gene duster number from which the 
oligonucleotides were designed. Gene clusters were compiled using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence 
similarity using Clustering and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for sequences comprising each cluster are listed in the 
"Accession* column. 

Table 13C show the genomic positioning for those Pke/s lacking Unigene ID'S and accession numbers in table 13A. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also listed. 

Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

R1 ; Average of lung tumors (including squamous ceil carcinomas, adenocarcinomas, small cell carcinomas, granulomatous and carcinoid tumors) divided by the 
average of normal lung samples 

R2; Average of non-malignant lung disease samples (including bronchitis, emphysema, fibrosis, atelectasis, asthma) divided by the average of normal tung samples 



Pkey 


ExAccn 


UnigenelD 


Unigene Title 


R1 


R2 


408562 


AI436323 


Hs.31141 


Homo sapiens mRNA for KIAA1568 protein, 


1.00 


230.00 


409031 


AA376836 


Hs.76728 


ESTs 


1.00 


128.00 


412372 


R65998 


Hs.285243 


hypothetical protein FU22029 


1.00 


173.00 


415910 


U20350 


Hs.78913 


chemokine (C-X3-C) receptor 1 


1.00 


145.00 


417511 


AL049176 


Hs.82223 


chordirvlike 


1.00 


179.00 


418819 


AA228776 


Hs.191721 


ESTs 


1.00 


140.00 


422060 


R20893 


Hs.325623 


ESTs. Moderately similar to ALU5JWMAN A 


1.00 


156.00 


424585 


AA464840 


Hs.131987 


ESTs 


1.00 


167.00 


426753 


T89832 


Hs.170278 


ESTs 


1.00 


141.00 


429496 


AA453800 


Hs.1 92793 


ESTs 


1.00 


138.00 


430719 


AA488988 


Hs.293798 


ESTs 


1,00 


133.00 


431089 


BE041395 




ESTs, Weakly similar to unknown protein 


23.32 


941.00 


431385 


BE178536 


Hs.11090 


membrane-spanning 4-domains, subfamily A 


1.00 


157.00 


431728 


NM_007351 


Hs.268107 


multimerln 


1.00 


157.00 


436532 


AA721522 




gb:nv54h12.M NCLCGAP_Ew1 Homo sapiens 


1.00 


218.00 


437960 


AI669586 


Hs.222194 


ESTs 


1.00 


147.00 


438202 


AW1 69287 


Hs.22588 


ESTs 


1.00 


141.00 


441499 


AW298235 


Hs.101689 


ESTs 


1.00 


167.00 


444513 


AL120214 


Hs.7117 


glutamate receptor, tonotropic, AMPA 1 


1.00 


151.00 


448253 


H25899 


Hs.201591 


ESTs 


1.00 


141.00 


453636 


R67837 


Hs.169872 


ESTs 


1.00 


116.00 


458332 


A1000341 


Hs.220491 


ESTs 


1.00 


192.00 


459587 


AA031956 




gb:zk15e04.s1 Soaresjregnantuterus.NbH 


1.00 


154.00 



TABLE 13B 

Pkey: Unique Eos probeset identifier number 

CAT number: Gene cluster number 

Accession: Genbank accession numbers 

Pkey CAT Number Accession 

431089 327625 1 BE041395AA491826AA621946AA715980AA666102 

436532 421802J AA721522AW975443 T93070 



TABLE 13C 

Pkey: 
Ref: 

Strand: 



Unique number corresponding to an Eos probeset 

Sequence source. The 7 digit numbers In this column are Genbank Identifier (Gl) numbers. "Dunham I. et at." refers to the publication entlfled "The DNA 
sequence of human chromosome 22." Dunham L et al., Nature (1 999) 402:489495. 
Indicates DNA strand from which exonB were predicted. 



NLposittan: indicates nucleotide positions of predicted exons. 
Pkey Ref Sirand NLposition 

402075 8117407 Plus 121907-122035,122804-122921,124019-124161,124455-124610,125672-126076 



180 



WO 02/086443 PCT7US02/12476 

TABLE 1 4A: Preferred Utility and Subcellular Localization for Potential Lung Disease Targets 

Table 14A shows the subcellular localization and preferred utility for the genes appearing in Tables 9A and 10A. mAb symbolizes monoclonal antibody, diag symbolizes 
diagnostic, s.m. symbolizes small molecule, and CTL symbolizes cytotoxic lymphocytic ligand. These genes were selected from 59680 probesets on the Ecs/Affymstrix Hu03 
Genechlp array. 

Table 14B show the accession numbers for those Pkey's lacking Unig enelffs for table 14A. For each probeset we have listed the gene duster number from which the 
oUgonucleoSdes were designed, Gene dusters were compiled using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence 
similarity using Clustering and Alignment Tools (DoubleTwIst, Oakland California). The Genbank accession numbers for sequences comprising each cluster are listed in the 
"Accession* column. 

Table 14C show the genomic positioning for those Pkey's lacking Unlgene ID's and accession numbers In table 14A. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also listed. 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnlgenelD: UnJgene number 

Unigene Title: Unigene gene title 

Pre/.Utility: Preferred Utility 

Predloc: Predicted subcellular localization 



Pkey 


ExAccn 


UnlgenelD 


Unigene Title 


Pref Utility 


Pred. Loc 


400289 


X07820 


Hs.2258 


matrix metaUoproteinase 10 (stromelysJn 


mAb & diag & s.m. 


extracellular 


400303 


AA242758 


Hs.79136 


UV-1 protein, estrogen regulated 


mAb 


plasma membrane 


402075 






ENSP00000251056*:Plasma membrane calcium mAb & diag 


secreted 


407811 


AW1 90902 


Hs.40098 


cysteine knot superfamlly 1, BMP antagon 


diag 


secreted 


403243 


Y0Q787 


Hs.624 


Interieukin 8 


diag 


secreted 


408790 


AW580227 


Hs.47860 


neurotrophic tyrosine kinase, receptor, 


mAb & s.m. 


plasma membrane 


408908 


BE296227 


Hs.250822 


serine/threonine kinase 15 


s.m. 


cytoplasm 


40S041 


AB033025 


Hs.50081 


Hypothetical protein, XP 051860 (KIAA1 19 


CTL & diag 


secreted 


409103 


AF251237 


Hs.112208 


XAGE-1 protein 


CTL 


nuclear 


409420 


Z15008 


Hs.54451 


lamlnln, gamma 2 (nicein (100kD), kalini 


diag 


secreted 


409632 


W74001 


Hs.55279 


serine (or cysteine) proteinase Inhibito 


diag 


secreted 


409757 


NM.001898 


Hs.123114 


cystatin SN 


diag 


extracellular 


409893 


AW247090 


Hs.57101 


minlchromosome maintenance deficient (S. 


CTL 


nuclear 


409956 


AW103364 


Hs.727 


inhibin, beta A (activin A, acCvin AB a 


diag 


extracellular 


410001 


AB041036 


Hs.57771 


kallikreln 11 


diag . 


extracellular 


410407 


X66839 


Hs.63287 


carbonic anhydrase IX 


mAb & s.m 


plasma membrane 


410418 


D31382 


Hs.63325 


transmembrane protease, serine 4 


mAb & diag & s.m. 


plasma membrane 


412140 


AA219691 


Hs.73625 


RAB6 Interacting, kinesin-like (rabkines 


s.m. 




412719 


AW016610 


Hs.816 


ESTs 


s.ra 


nuclear 


414774 


X02419 


Hs.77274 


plasminogen activator, urokinase 


diag 


extracellular 


414883 


AA926960 




CDC28 protein kinase 1 


s.m. 




415138 


C18356 


Hs.295944 


tissue factor pathway Inhibitor 2 


CTL & diag 


extracellular 


415669 


NM_005O25 


Hs.78589 


serine (or cysteine) proteinase inhibito 


mAb & diag & s.m. 


secreted 


415817 


U88967 


Hs.78867 


protein tyrosine phosphatase, receptor-t 


mAb & s.ra 


plasma membrane 


416658 


U03272 


Hs.79432 


fibrillin 2 (congenital contracture! ara 


diag 


extracellular 


417034 


NNL006183 


Hs.80962 


neurotensin 


diag 


extracellular 


417079 


U65590 


Hs.81134 


interieukin 1 receptor antagonist 


diag 


extracellular 


417308 


H60720 


Hs.81892 


KIAA0101 gene product 


s.m. 


mitochondrial 


417389 


BE260964 


Hs.82045 


mldkine (neurits growth-promoting factor 


mAb & diag 


secreted 


417433 


BE270266 


Hs.82128 


5T4 oncofetal trophobtast glycoprotein 


mAb 


plasma membrane 


417933 


X02308 


Hs.82962 


thymidylate synthetase 


s.m. 


endoplasmic reticulum 


418478 


U38945 


Hs.1174 


cyclin-dependent kinase inhibitor 2A (me 


s.m. 


cytoplasm 


418506 


AA084248 


Hs.85339 


G protein-coupled receptor 39 


mAb Asm 


plasma membrane 


418678 


NNL001327 


Hs.167379 


cancer/testis antigen (NY-ESO-1) 


CTL 


cytoplasmic 


419121 


AA374372 


Hs.89626 


parathyroid hormone-like hormone 


diag 


secreted 


419171 


NM.002846 


Hs.89655 


protein tyrosine phosphatase, receptor t 


mAb & s.m. 


plasma membrane 


419183 


U60669 


Hs.89663 


cytochrome P450, subfamOy XXIV (vitamin 


CTL&s.m. 


mitochondrial 


419216 


AU076718 


Hs.164021 


small Inducible cytokine subfamily B (Cy 


diag 


secreted 


419235 


AW470411 


Hs.288433 


neurotrimin 


mAb & diag 


plasma membrane 


419452 


U33635 


Hs.90572 


PTK7 protein tyrosine kinase 7 


mAb & s.m. 


plasma membrane 


419556 


U29615 


Hs.91093 


chitlnase 1 (chitotriosldase) 


mAb & diag 


extracellular* 


420610 


AI683183 


Hs.99348 


distal-less homeo box 5 


CTL 


nuclear 


421110 


AJ250717 


Hs.1355 


cathepsin E 


sm&diag 


extracellular 


421379 


Y15221 


Hs.103982 


small Inducible cytokine subfamily B (Cy 


diag 


secreted 


421474 


U76362 


Hs.104637 


solute carrier family 1 (glutamate trans 


mAb & s.m. 


plasma membrane 


421552 


AF026692 


Hs.105700 


secreted frizzled-related protein 4 


diag 


secreted 


421753 


BE314828 


Hs.107911 


ATP-btnding cassette, sub-family B (MDFV 


mAb & s.m. 


plasma membrane 


421817 


AF146074 


Hs.108660 


ATP-binding cassette, sub-family C (CFTR 


mAb & s.m. 


plasma membrane 


422109 


S73265 


Hs.1473 


gastrin-releaslng peptide 


diag 


secreted 


422158 


L10343 


Hs.112341 


protease inhibitor 3, skin-derived (SKAL 


diag 


secreted 


422282 


AF019225 


Hs.1 14309 


apolipoprotein L 


diag 


secreted 


422283 


AW411307 


Hs.114311 


COC45 (cell division cycle 45, S.cerevfs 


s.m. 


nuclear 


422424 


A11B6431 


Hs.296638 


prostate differentiation factor 


diag 


extracellular 


422765 


AW409701 


Hs.1578 


baculoviral IAP repeatcontaining 5 (sur 


s.m. 


cytoplasm 


422809 


AK001379 


Hs.1 21028 


hypothetical protein FU10549 


s.m. 


nuclear 


422867 


L32137 


Hs.1584 


cartilage oligomer^ matrix protein (pse 


diag 


extracellular 


422956 


BE545072 


Hs.1 22579 


ECT2 protein (Epithelial cell transforml 


CTL&s.m. 




423634 


AW959908 


Hs.1690 


hepartn-binding growth factor binding pr 


diag 


secreted 


423673 


BE003054 


Hs.1 695 


matrix metallo proteinase 12 (macrophage 


mAb & diag & s.m. 


423961 


D13666 


Hs.136348 


periosbn (OSF-2os) 


mAb & diag 


extracellular 


424046 


AF027866 


Hs.138202 


serine (or cysteine) proteinase inhibito 


diag 


secreted 


424381 


AA285249 


Hs.146329 


protein kinase Chk2 


s.m. 


nuclear 
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5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



424502 
424503 
424687 
425247 
425322 
425650 
425734 
425776 
425652 
426215 
426427 
426514 
427335 
427747 
428242 
428330 
428450 
428479 
428484 
428664 
428698 
428748 
428758 
428S69 
429211 
429263 
429547 
429610 
429903 
430488 
431462 
431515 
431846 
431958 
432201 
433001 
435505 
43648V 
437016 
437044 
437789 
437852 
439223 
439477 
439606 
439738 
440006 
441362 
442117 
443247 
443426 
443859 
444006 
444371 
444381 
444781 
445537 
446619 
446921 
447033' 
447342 
448243 
448844 



AF242388 

NM.Q022G5 

J05070 

NM.00594Q 

U63630 

NM.001944 

AF056209 

U25128 

AK001504 

AW963419 



449722 
450001 
450375 
450701 
450983 
451668 
452281 
452401 
452747 
452838 
453968 
457489 



AA448542 

AW411425 

H55709 

L22524 

NM.014791 

Y00272 

AF104032 

AK001666 

AA852773 

AW593206 

AA433988 

AF120274 

AF052693 

AA019004 

AW009166 

AB024937 

AL134197 

BE062109 

AW583672 

NM..012152 

BE019924 

X63629 

AI538613 

AF217513 

AF200492 

AA379597 

AU076916 

AL035864 

AI581344 

BE001836 

AW238299 

W69813 

W79123 

BE246502 

AK000517 

BE614410 

AW664964 

BE6143B7 

AF098158 

NM.013409 

BE395085 

BE540274 

BE3B7335 

NM.014400 

AJ245671 

AU076643 

AB012113 

AI357412 

AI199268 

AW369771 

Al 581519 

Z45051 

BE280074 

NMJJ01Q44 

AA009647 

H39960 

AA305384 

243948 

T93500 

NM.007115 

BE153855 

U65011 

AA847843 

AI693B15 



Hs.149585 

Hs.149609 

Hs.151738 

Hs.155324 

Hs.155637 

Hs.1925 

Hs.159396 

Hs.159499 

Ks.159651 

Hs.1 55223 

Hs.169840 

Hs.170195 

Hs.251677 

Hs.180655 

Hs.2250 

Hs.2256 

Hs.184339 

Hs.334562 

Hs.184601 

Hs.189095 

Hs.334838 

Hs.98785 

Hs.98502 

Hs.194689 

Hs.198249 

Hs.198396 

Hs.99376 

Hs.211092 

Hs.93597 

Hs.241551 

Hs.2S6311 

Hs.258583 

Hs.271580 

Hs.2877 

Hs.298241 

Hs.279905 

Hs.211238 

Hs.5199 

Hs.5398 

Hs.69517 

Hs. 127812 

Hs.256897 

Hs.250618 

Hs.58042 

Hs.58561 

Hs.9598 

Hs.6844 

Hs.23044 

Hs.128899 

Hs.333893 

Hs.9329 

Hs.9914 

Hs.10086 

Hs.239 

Hs.283713 

Hs.11950 

Hs.12844 

Hs.313 

Hs.16530 

Hs.157601 

Hs.19322 

Hs.52620 

Hs.177164 

Hs.22920 

Hs.23960 

Hs.406 

Hs.288467 

Hs.25740 

Hs.326444 

Hs.28792 

Hs.29352 

Hs.61460 

Hs.30743 

Hs.62711 

Hs.127179 



lengsln 

Integrin, alpha 5 (fibronectin receptor, 



B 

matrix metatloproteinase 11 (stromelysln 
protein kinase, DNA-activated. catalytic 
desmoglein 3 (pemphigus vulgaris antigen 
peptidylglycine alpha-amldating monooxyg 
parathyroid hormone receptor 2 
death receptor 6, TNF superfamiry member 
stanniocafcln 2 
TTK protein kinase 

bone morphogenetic protein 7 (osteogenic 
G antigen 7B 

serine/threonine kinase 12 

leukemia inhibitory factor (cholinergic • 

matrix metailoproteinase 7 (matritysin, 

KIAA0175gene product 

cell division cycle 2, G1 to S and G2 to 

sotute carrier family 7 (cationic amino 

similar to SALL1 (sal (Drosophila)-like 

KIAA1866 protein 

Ksp37 protein 

CA1 25 antigen; mucin 16 

artemin 

gap junction protein, beta 5 (connexin 3 
ATP-blnding cassette, sub-family A (ABC1 
ESTs 

LU NX protein; PLUNC (palate lung and nas 
cydln-dependent kinase 5, regulatory su 
chloride channel, calcium activated, flam 
granln-like neuroendocrine peptide precu 
endothelial differentiation, lysophospha 
uroplakln 1B 

cadherin 3, type 1, P-cadherin (placenta 
Transmembrane protease, serine 3 
clone HQO310PRO0310p1 
interleukin-1 homolog 1 
HSPC150 protein similar ta ublquitln-con 
guanine monphosphate synthetase 
differentially expressed In FanconPs an 
ESTs, Weakly similar to T17330 hypotheti 
ESTs, Weakly similar to dJ365012.1 IH.sa 
UL1 6 binding protein 2 

ESTs, Moderately similar to GFR3.HUMAN G 
G protein-coupled receptor 87 
sema domain, immunoglobulin domain (Ig), 
NALP2 protein; PYRlN-Containing APAFI-li 
RAD51 (S. cerevlsiae) homolog (E coll Re 
ESTs; hypothetical protein for IMAG&447 
c-Myc target JP01 

chromosome 20 open reading frame 1 
folflstatln 

type I transmembrane protein Fn14 
forkhead box Ml 

ESTs. Weakly similar to S64054 hypotheU 

GPI-ancrtored metastasis-associated prote 

EGF-like-domaln, multiple 6 

secreted phosphoprotein 1 (osteopontin, 

small Inducible cytokine subfamily A (Cy 

ESTs 

Homo sapiens, Similar to RIKEN cONA 2010 

integrin, beta 8 

ESTs 

similar to S68401 (cattle) glucose Indue 
cyclln B1 

sotute carrier family 6 (neurotransmitte 
adlsintegrin and metailoproteinase doma 
hypothetical protein XP.098151 (leucine- 
ER01 (S. cerevlsJae)-like 
cartilage acidic protein 1 
Homo sapiens cDNA FU1 1041 fis. clone PL 
tumor necrosis factor, afoha-lnduced pro 
Ig superfamily receptor LNIR 
preferentially expressed antigen In mela 
High mobility group (nonhistone chromoso 
cryptic gene 



s.m. 


cytoplasmic 


mAb&s.m. 


plasma membrane 


diag 


extracellular 


mAb & diag & s.m. 


secreted 


s.m. 


cytoplasmic 


mAb 


plasma membrane 


s.m. 




mAb & dag 


plasma membrane 


mAb & s.m. 


plasma membrane 


mAb & diag 


secreted 


CTL & s.m. 


nuclear 


mAb & diag 


secreted 


CTL 


cytoplasmic 


s.m. 


cytoplasmic 


diag 




mAb & diag & s.m. 


extracellular 


s.m. 


nuclear 


s.m. 


nuclear 


mAb & s.m. 


plasma membrane 


CTL & s.m. 


nuclear 


mAb 




diag 


extracellular 


diag 


mltochodria* 


diag 


extracellular 


mAb & s.m. 


plasma membrane 


mAb&s.m. 


plasma membrane 


diag 


secreted 


mftD a aiag 




s.m. 




mAb&s.m 


plasma membrane 


diag 


extracellular 


mAb & s.m. 


plasma membrane 


mAb & (flag 


plasma membrane 


mAb & diag 


plasma membrane 


mAb & diag & s.m. 


plasma membrane 


s.m. 


nuclear 
secreted 


diag 
s.m. 




s.m, 


cytoplasm 


CTL 


ER 


CTL 


nuclear 


mAb & s.m. 


plasma membrane 


mAb 


plasma membrane 


mAb&s.m. 




mAb & s.m. 


plasma membrane 


mAb & s.m 


plasma membrane 


s.m. 


nuclear 


s.m. 




mAb&s.m. 


plasma membrane 


CTL 


extracellular* 


CTL 




diag 


extracellular 


mAb 


plasma membrane 


sjn. 


nuclear 


diag 


secreted 


mAb & diag 


plasma membrane 


mAb & diag 


secreted 


diag 


secreted 
extracellular 


diag 

CTL & diag 


secreted 


CTL 




mAb & s.m. - 


plasma membrane 


mAb&s.m. 




mAb 


plasma membrane 


s.m. 


cytoplasm 


mAb & am 


plasma membrane 


mAb & diag & s.m. 


plasma membrane 


mAb&diag 


plasma membrane 
secreted 


(flag 

mAb&diag 


plasma membrane 


diag 




diag 


extracellular 


mAb 


plasma membrane 


CTL 


nuclear 


CTL & s.m. 


nuclear 


diag 


secreted 



TABLE 14B 



Pkey: Unique Eos probeset Identifier number 
CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey 



CAT Number Accession 
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WO 02/086443 
414883 15024J 



450375 83327J 



PCT/US02/12476 

AA926960 AA926959 W76521 W24270 W21526 AA037172 BE267636 H83186 AA469909 N86396 AA001348 BE535736 AA081745 BE566245 
AA082436 H72525 H77575 N49786 W80565 H78746 BE559085 W04339 R98127 T55938 BE279271 AW960304 T29812 AA476873 BE297387 
AA292753 AA177048 NM_CQ1826 X54941 BE314366 AA908783 AI719075 BE270172 BE269819 AA889955 A!204630 W25243 AI935150 
AA872039 W72395 T99630 AI422691 H98460 N31428 BE255916 H03265 AI857576 AA776920 AA910644 AA459522 AA293140 AW514667 
R75953 AW662396 AA662522 AI865147 AI423153 AW262230 AA584410 AA583187 AW024595 AWQ69734 A1828996 AA282997 AA876046 
AW613002 AA527373 AW972459 AI831360 AA621337 AA100926 AA772418 AA594628 A1033892 W95098 AI034317 AA398727 AI085031 
N95210 AI459432 AI041437 AA932124 AA6276B4 AA935829 AIQQ4827 AI423513 AI094597 H42079 R54703 A1630359 AA617681 AA978045 
AA643280 W44561 AI991988 AI537692 AW90262 AA740817 AI312104 AI911822 AA416B71 A1185409 AA129784 M701623 AI075239 
AJ139549 AA633648 AJ339996 AI336880 AA399239 AI078708 AI085351 A1362835 AI346618 AI146955 AI989380 AI348243 N92892 AA765850 
AI494230AI278887AA962596A1492600W80435AA001979 R97424AM29015N24127 AA157451 AA235549 AA459292 AA037114AA129785 
A149421 1 AW059601 AW886710 R92790 N59755 AI36112B AW5B9407 H47725 H97534 H48076 H48450 T99631 AW300758 H03431 R76789 
AA954344 H77576 R96823 AI457100 N92845 N49682 H42038 BE220698 BE220715 H99552 AA701624 N74173 R54704 H79520 H72923 
H03266 BE261 919 AA769633 AA480310 AA507454 AA910588 A1203723 AW104725 W2561 1 W25071 T88980 H03513 T77589 R99156 
W95095 R97470 AA702275 T77551 AA91 1952 H82956 N83673 AA283672 

AA009647AA131254AA374293AW954405 H04410AW606284AA151166BE157467BE157601 H04384 W46291 AW663674 H04021 H01532 
AA190993 H03231 H59605 H01642 AA852876 AA1 13758 AA626915 AA746952 AI161014 AA099554 R69067 



TABLE 14C 

Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers In this column are Genbank Identifier (GI) numbers. 'Dunham I. et al," refers to the publication entitled The DNA 

sequence of human chromosome 22.' Dunham I. et a)., Nature (1999) 402:489-495. 

Strand: Indicates DNA strand from which exons were predicted. 

NLposition: Indicates nucleotide positions of predicted exons. 

Pkey Ref Strand NLposition 

402075 8117407 Plus 121907-122035,122804-122921.124019-124161,124455-124610,125672-126076 
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TABLE 15A: Information for all sequences In Table 16 

Table 15A shows the Seq ID No, Pkey, ExAccn, UnlgenelD, and Unlgene Title for all of the sequences In Table 16. 

Table 15B show the accession numbers for those Pke/s lacking UnigenelD's for table 15A. For each probeset we have listed the gene cluster number from ^which the 
oligonucleotides were designed. Gene clusters were compiled using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence 
similarity using Clustering and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for sequences comprising each cluster are listed in the 
"Accession" column. 

Table 15C show the genomic positioning for those Pke/s lacking Unigene IPs and accession numbers in table 15A. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also listed. 



Seq ID No: Sequence ID number 
Pkey: Unique Eos probeset Identifier number 
ExAccn: Exemplar Accession number, Genbank accession number 
UnigenelD: Unigene number 
Unlgene Title: Unlgene gene title 



Seq 10 No: 

Seq ID No: 1 & 2 
Seq ID No: 3& 4 
Seq lDNo:S&6 
SeqIDNo:7&8 
Seq ID No: 9 & 10 
Seq ID No: 11 & 12 
Seq ID No*. 13 & 14 
Seq ID No: 15 & 16 
Seq ID No: 17 & 18 
Seq ID No: 19 & 20 
Seq ID No: 21 & 22 
Seq ID No: 23 & 24 
Seq ID No: 25 & 26 
Seq 10 No: 27 & 28 
Seq ID No: 29&30 
Seq ID No: 31&32 
Seq ID No: 33 & 34 
Seq ID No: 35 4 36 
Seq ID No: 37 & 38 
Seq ID No: 39 & 40 
Seq ID No: 41 & 42 
Seq ID No: 43 & 44 
Seq ID No: 45 & 46 
Seq ID No: 47 & 48 
Seq ID No: 49 
Seq ID No: 50 & 51 
Seq ID No: 52 & 53 
Seq ID No: 54 & 55 
Seq ID No: 56 & 57 
Seq lDNo:58&59 
Seq ID No: 60 & 61 
Seq ID No: 62 & 63 
Seq lDNo:64&65 
Seq ID No: 66 & 67 
Seq ID No: 68 & 69 
Seq IDNo:70&71 
Seq ID No: 72 & 73 
Seq ID No: 74 & 75 
Seq ID No: 76 & 77 
Seq ID No: 78479 
Seq IDNo:80&81 
Seq ID No: 82 & 83 
Seq ID No: 84 & 85 
Seq ID No: 86 & 87 
Seq ID No: 68 & 89 
Seq ID No: 90&91 
Seq ID No: 92 & 93 
Seq ID No: 94 & 95 
Seq ID No: 96 & 97 
Seq ID No: 98 4 99 
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small proline-rich protein 1B (comifin) 
H2A Nstone family, member X 
gb:Homo sapiens full length insert cDNA 
glycoprotein (transmembrane) nmb 
alpha-fetoprotein 

Integrln, alpha 5 (fibronectln receptor, 
matrix metalloproteinase 10(stromalysin 
matrix metalloproteinase 1 (interstitial 
matrix metalloproteinase 1 (Interstitial 
solute carrier family 7, (catlonic amino 
tissue factor pathway inhibitor 2 
G protein-coupled receptor 39 
periosb'n (OSF-2os) 
monokine induced by gamma Interferon 
5T4 oncofetal trophoblast glycoprotein • 
5T4 oncofetal trophoblast glycoprotein 
cartilage oligomeric matrix protein (pse 
small inducible cytokine subfamily 8 (Cy 
ESTs, Weakly similar to S64054 hypotheti 
LIV-1 protein, estrogen regulated 
Adlican 

KIAA1866 protein 

hypothetical protein FU21080 

secreted frizzled-reJated protein 4 

Ig superfamily receptor LNIR 

a disrntegrin and metalloproteinase doma 

stanniocalcin 2 

matrix metalloproteinase 11 (stromelysln 
Transmembrane protease, serine 3 
collagen, type X alpha 1 (Schmld metaph 
ESTs; hypolhe&cal protein for IMAGE:447 
gap Junction protein, beta 2. 26kD (conn 
ESTs 
ESTs 
ESTs 

c-Myc target JP01 
transmembrane protease, serine 4 
Hypothetical protein, XPJ351860 (KIAA119 
Hypothetical protein, XPJJ51860 (K1AA1 19 
transcription factor 
bone morphogenetic protein 2 
fibrillin 2 (congenital contracturaJ ara 
cysteine knotsuperfarnfly 1, BMP antagon 
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Seq ID No: 462 & 463 437852 

Seq ID No: 464 & 465 402075 

Seq ID No: 466 & 467 421110 

Seq ID No: 468 & 469 451668 

Seq ID No: 470 & 471 451668 

Seq ID No: 472 & 473 451668 

Seq ID No: 474 & 475 422282 

Seq ID No: 476 & 477 425852 

SeqlDNo:47B&479 439738 

Seq ID No: 460 & 481 427747 

Seq ID No: 462 & 483 420281 

Seq ID No: 464 & 465 405932 

Seq ID No: 486 & 487 405932 

Seq ID No: 488 & 489 444342 

SeqjDNo:490& 491 421379 

Seq ID No: 492 & 493 417079 

Seq ID No: 494 & 495 430890 

Seq ID No: 496 & 497 419721 

Seq ID Not 498 & 499 444471 

Seq ID No: 500 & 501 413063 

Seq ID No: 502 & 503 433600 

Seq ID No: 504 & 505 452401 

Seq ID No: 506 & 507 452401 

Seq ID No: 508 & 509 450001 

Seq ID No: 510 & 511 410407 

Seq ID No: 51 2 & 513 309931 

Seq ID No: 514 & 515 412719 

Seq ID No: 516 & 517 417034 

Seq ID No: 518 & 519 430486 

Seq ID No: 520 & 521 413753 

Seq ID No: 522 & 523 425650 

Seq ID No: 524 & 525 423673 

Seq ID No: 526 & 527 418663 

Seq ID No: 528 & 529 418663 

Seq ID No: 530 & 531 429610 

Seq ID No: 532 & 533 406690 

Seq ID No: 534 & 535 431846 

Seq ID No: 536 & 537 422158 

Seq ID No: 538 & 539 431958 

Seq ID No: 540 & 541 437044 

Seq ID No: 542 & 543 428484 

Seq ID No: 544 & 545 429211 

Seq ID No: 546 & 547 417389 

Seq ID No: 548 & 549 431009 

Seq ID No: 550 & 551 417542 

Seq ID No: 552 & 553 449230 

Seq ID No: 554 & 555 410555 

Seq ID No: 556 & 557 410555 

Seq ID No: 558 & 559 424687 

Seq ID No: 560 & 561 418462 

Seq ID No: 562 & 563 410274 

Seq ID No: 564 & 565 439606 

Seq ID No: 566 & 567 404877 

Seq ID No: 568 & 569 444781 

Seq ID No: 570 & 571 418543 

Seq ID No: 572 & 573 415817 

Seq ID No: 574 & 575 415817 

Seq ID No: 576 & 577 415817 

Seq ID No: 578 & 579 415817 

Seq ID No: 580 & 581 415817 

Seq ID No: 582 & 583 415817 

Seq ID No: 584 & 585 421817 

Seq ID No: 586 & 587 418678 

Seq ID No: 588 & 589 418678 

Seq ID No: 590 & 591 409420 

Seq ID No: 592 & 593 332180 

Seq ID No: 594 & 595 408790 

Seq ID No: 596 & 597 408790 

Seq ID No: 598 & 599 439223 

Seq ID No: 600 & 601 409757 

Seq ID No: 602 & 603 428969 

Seq ID No: 604 & 605 426969 

Seq ID No: 606 & 607 428969 

Seq ID No: 608 & 609 426969 

Seq ID No: 610 & 611 450701 

Seq ID No: 612 & 613 450701 

Seq ID No: 614 & 615 414774 

Seq ID No: 616 & 617 407944 

Seq ID No: 618 & 619 407944 

Seq ID No: 620 & 621 457489 

Seq ID No: 622 & 623 429547 

Seq ID No: 624 a 625 407242 

Seq ID No: 626 & 627 407242 

Seq ID No: 628 & 629 407242 

Seq ID No: 630 & 631 444006 



BE001836 Hs.256897 

AJ250717 Hs.1355 
Z43948 Hs.326444 



Z43948 


Hs.326444 


Z43948 


Hs.326444 


AF019225 


Hs.1 14309 


AK001504 


Hs.159651 


BE246502 


Hs.9598 


AW41U25 


Hs.180655 


AI623693 


Hs.323494 


NMJM4398 


Hs.10887 


Y15221 


Hs.1 03982 


U65590 


Hs.81134 


X54232 


Hs.2699 


NM.001650 


Hs.288650 


AB020684 


Hs.11217 


AL035737 


Hb.75184 


AI034361 


Hs.135150 


NMJJ07115 


Hs.29352 


NM 007115 


Hs.29352 


NM.001044 


Hs.406 


X66839 


Hs.63287 


AW341663 




AW016610 


Hs.816 


NKL006183 


Hs.80962 


BE062109 


Hs.241551 


U17760 


Hs.75517 


NM.001944 


Hs.1925 


BE003054 


Hs.1695 


AK001100 


Hs.41690 


AK001100 


Hs.41690 


AB024937 


Hs.211092 


M29540 


Hs.220529 


BE019924 


Hs.271580 


L10343 


Hs.1 12341 


X63629 


Hs.2877 


ALG35864 


Hs.69517 


AF104032 


Hs.184601 


AF052693 


Hs.198249 


BE260964 


Hs.82045 


BE149762 


Hs.48956 


J04129 


Hs.82269 


BE613348 


Hs.211579 


U92649 


H&.64311 


U92649 


Hs.64311 


J05070 


Hs.1 51738 


8E001596 


Hs.85266 


AA381B07 


Hs.61762 


W79123 


Hs.58561 


NM.O1440O 


Hs.1 1950 


NMJ1Q5329 


Hs.85962 


U88967 


HS.7BB57 


U88967 


Hs.78867 


U88967 


Hs,78867 


U88967 


Hs.78867 


U8B967 


Hs.78867 


U88967 


Hs.78867 


AF146074 


Hs.1 08660 


NM.001327 


Hs.1 67379 


NMJJ01327 


Hs.1 67379 


Z15008 


Hs.54451 


AF134160 


Hs.7327 


AW580227 


Hs.47860 


AW580227 


Hs.47860 


AW238299 


Hs.250618 


NM.001898 


Hs.123114 


AF120274 


Hs.194689 


AF120274 


Hs.194689 


AF120274 


Hs.194689 


AF120274 


Hs.194689 


H39960 


Hs.288467 


H39960 


Hs.288467 


X02419 


Hs.77274 


R34008 


Hs.239727 


R34008 


Hs.239727 


AI6S3815 


Hs.127179 


AW009166 


Hs.99376 


M18728 




M18728 




M18728 




BE395085 


Hs.10086 



ESTs, Weakly similar to dJ36501Z1 [H.sa 
ENSP00000251056*:Plasma membrane calcium 
calhepstn E 

cartilage acidic protein 1 
cartilage acidic protein 1 
cartilage acidic protein 1 
apollpoprotelnl 

death receptor 6. TNF superfamiiy member 
sema domain, Immunoglobulin domain (lg), 
serine/threonine kinase 12 
Predicted cation efflux pump 
C15000305:gi|3806122|gb|AAC69198.1| (AF0 
C15000305:gl|3806122|gblAAC69198.1I(AFO 
similar to lysosome-assoclated membrane 
small inducible cytokine subfamily B (Cy 
Interteukln 1 receptor antagonist 
glypican 1 
aquaporin 4 
K1AA0877 protein 

chitinase 3-iike 1 (cartilage glycoprote 
lung type-l cell membrane-associated gly 
tumor necrosis factor, alpha-induced pro 
tumor necrosis factor, alpha-induced pro 
solute carrier family 6 {neurotransmltte 
carbonic anhydrase IX 

gb:hd13d01.x1 Soares_NFU.T_GBC.S1 Homos 
ESTs 

neurotensin 

chloride channel, calcium activated, fam 
laminin, beta 3 (nicein (1 25kD), kalinln 
desmogleln 3 (pemphigus vulgaris antigen 
matrix metailoproteinase 12 (macrophage 
desmocollin 3 
desmocollfn 3 

LUNX protein; PLUNC (palate lung and nas 
card no embryonic antigen-related cell ad 
uroplakin 1B 

protease Inhibitor 3, skin-derived (SKAL 
cadherin 3, type 1, P-cadherin (placenta 
differentially expressed In Fanconfs an 
solute carrier family 7 (cationlc amino 
gap junction protein, beta 5 (connexin 3 
midklne (neurite growth-promoting factor 
gap junction protein, beta 6 (connexin 3 
progestagen-assodaied endometrial prote 
melanoma cell adhesion molecule 
adisintegrin and metailoproteinase doma 
a (Dsintegrtn and metailoproteinase doma 
matrix metailoproteinase 9 (getatinase B 
integrin, beta 4 
hypoxia-inducible protein 2 - 
G protein-coupled receptor 87 
NM_005365:Homo sapiens melanoma antigen, 
GPl-anchored metastasis-associated prote 
hyaluronan synthase 3 
protein tyrosine phosphatase, receptor- 1 
protein tyrosine phosphatase, receptor-t 
protein tyrosine phosphatase, receptor-t 
protein tyrosine phosphatase, receptor-t 
protein tyrosine phosphatase, receptor-t 
protein tyrosine phosphatase, receptor-t 
ATP-bindlng cassette, sub-family C (CFTR 
cancer/testis antigen (NY-ESO-1) 
cancer/testis antigen (NY-ESO-1) 
laminin, gamma 2 (nicein (100kD), kaJina 
daudln 1 

neurotrophic tyrosine kinase, receptor, 

neurotrophic tyrosine kinase, receptor, 

UL16 binding protein 2 

cystatln SN 

artemln 

artemin 

artemln 

artemin 

hypothetical protein XPJJ98151 (leucine- 
hypothetical protein XP.098151 (leucine- 
plasminogen activator, urokinase 
desmocollin 2 
desmocollin 2 
cryptic gene 
ESTs 

gb:Human nonspecific crossreading anflg 
gb:Human nonspecific crossreacling anCg 
gb:Human nonspecific crossreacling antlg 
type I transmembrane protein Fn14 
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Seq ID No: 632 & 633 
Seq ID No: 6346635 
Seq ID No: 636 & 637 
Seq ID No: 638 & 539 
Seq ID No: 640 & 641 
Seq ID No: 642 & 643 
Seq ID No: 644 & 645 
Seq ID No: 6464647 
Seq ID No: 648 & 649 
Seq ID No: 650 & 651 
Seq ID No: 652 & 653 
Seq ID No: 654 & 655 
Seq ID No: 656 & 657 
Seq ID No: 658 & 659 
Seq ID No: 660 & 661 
Seq ID No: 662 & 653 
Seq ID No: 664 & 665 
Seq ID No: 666 & 667 
Seq ID No: 666 & 669 
Seq ID No: 670 & 671 
Seq ID No: 672 & 673 
Seq ID No:674 &675 
Seq ID No: 676 & 677 
Seq ID No: 678 & 679 
Seq ID No: 680 &681 
Seq IDNo:682&683 
Seq ID No: 684 & 685 
Seq ID No: 686 & 687 
Seq ID No: 688 & 689 

TABLE 15B 



Pkey: Unique Eos probeset identifier number 
CAT number Gene duster number 
Accession: Genbank accession numbers 



PCT/US02/12476 



429597 


NM.003816 


Hs.2442 


a dfstntegrfn and metafloproteinase doma 


422109 


S73265 


Hs.1473 


gastrirweleasing peptide 


419235 


AW470411 


Hs.288433 


neurotrimln 


449048 


Z45051 


Hs.22920 


similar to S68401 (cattle) glucose indue 


419216 


AU076718 


Hs.1 64021 


small inducible cytokine subfamily B (Cy 


431462 


AW583672 


Hs.256311 


granln-like neuroendocrine peptide precu 


448243 


AW369771 


Hs.52620 


integrin, beta 8 


426427 


M86699 


Hs.1 69840 


TTK protein kinase 


445537 


AJ245671 


Hs.12844 


EGF-like-domain, multiple 6 


422278 


AF072873 


Hs.1 14218 


frizzled (Drosophila) homolog 6 


428450 


NMJM4791 


Hs.184339 


WAA0175gene product 


446619 


AU076643 


Hs.313 


secreted phosphoprotein 1 (osteoponOn, 


453392 


U23752 


Hs.32964 


SRY (sex determining region Y)-box 1 1 


426514 


BE616633 


Hs.170195 


bone morphogenetlc protein 7 (osteogenic 


425776 


U25128 


Hs.159499 


parathyroid hormone receptor 2 


425776 


U25128 


Hs.159499 


parathyroid hormone receptor 2 


431515 


NM.012152 


Hs.258583 


endothelial differentiation, lysophospha 


419452 


U33635 


Hs.90572 


PTK7 protein tyrosine kinase 7 


432653 


N62096 


Hs.293185 


ESTs, Weakly similar to JC7328 amino acl 


432653 


N6209S 


Hs.293185 


ESTs, Weakly similar to JC7328 amino acl 


432653 


N620S6 


Hs.293185 


ESTs, Weakly similar to JC7328 amino acl 


432653 


N62096 


Hs.293185 


ESTs, Weakly similar to JC7328 amino aci 


410001 


AB041036 


Hs.57771 


kaillkreln 11 


426501 


AW043782 


Hs.293616 


ESTs 


408369 


R38438 


Hs.1B2575 


solute carrier famfly 15 (H??? transport 


445413 


AA151342 


Hs.12677 


CGM47 protein 


422424 


AI186431 


Hs.296638 


prostate differentiation factor 


428330 


L22524 


Hs.2256 


matrix metalloprotelnase 7 (maMrysin, 


420610 


AI683183 


Hs.99348 


distal-less homeo box 5 



Pkey 

309931 

330493 



450375 



CAT Number 
AW3416B3 
33264.5 
47065J 

83327J 



451320 86576.1 



Accession 

M27826 R78416 AA307645 AW957879 AW957800 AA633529 H03662 

AL1 3391 6 N791 13 AF086101 N76721 AW950828 AA364013 AW955684 AI346341 AI867454 N54784 AI655270 AI421 279 AW01 4B82 
AA775552 N62351 N59253 AA626243 AI341407 BE175639 AA456968 AI358918 AA457077 

M009647M131254 M374293AW954405 H04410AW606284AA151166BE157467BE157601 H04384 W46291 AW663674 H04021 H01532 
AA190993 H03231 H59605 H01642 AA852876 AA1 13758 AA626915 AA746952 AI161014 AA099554 R69067 
AW1 18072 AI631982 T15734 AA224195 AI701458 W20198 F26326 AA890570 N90552 AW071907 AI671352 AI375892 T03517 R88265 
AI124088 AA224388 AI08431 6 AI 354686 T33652 AI140719 AI72021 1 T03490 AI372637 T15415 AW205B36 AA630384 T03515 T33230 
AA017131 AA443303 T33623 AI222556T33511 T33785AI4 19606 D55612 



TABLE 15C 



Pkey: 



Strand: 
NLposltion: 

Pkey 

402075 

403329 

403478 

404440 

404877 

405770 

405932 



Unique number corresponding to an Eos probeset 

Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham L et a).' refers to the publication entitled "The DNA 
sequence or human chromosome 22* Dunham I. et a).. Nature (1999) 402:489-495. 
Indicates DNA slrand from which exons were predicted. 
Indicates nucleotide positions of predicted exons. 

Ref Strand NLposltion 

8117407 Plus 121907-122035,122804-122921,124019.124161,124455.124610,126672-126076 

8516120 Plus 96450*96598 

9958258 Plus 116458.116564 

7528051 PIUS B0430-81581 

1519284 Plus 1095-2107 

2735037 Plus 61057-62075 

7767812 Minus 123525-123713 
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Table 16 

Seq ID NO: 1 DNA sequence 

Nucleic Acid Accession #» NM_001216 

Coding sequence : 43 . . 1422 



PCT/US02/12476 



i 
I 

GCCOGTACAC 
AGCCCCTGGC 
CTGTCACTGC 

AGTGAAGAGG 
GAGGATCTAC 
TCCCTGAAGT 
AATAATGCCC 
CCGCCCTGGC 
CGCCCCCAGC 
CTCCCGCCGC 
CCTCCTGGGC 
CT0CACTG6G 
CCTGCCGAGA 
GGGCGCCCGG 
AGTGCCTATG 
CAGGTCCCAG 
TATGAGGGGT 
CAGACAGTGA 
GGTGACTCTC 
GAGG CCTCCT 
AATTCCTGCC 
ACCAGCGTCG 
GTGAGCTACC 
TGTGAGAAGC 
ATGCCACTTC 



11 

1 

ACCGTGTGCT 
TCCCTCTGTT 
TGCTTCTGAT 
GAGGAGGCTC 
ATTCACCCAG 
CTGGAGAGGA 
TAGAGGATCT 
ACAGGGACAA 
CCCGGGTGTC 
TCGCCGCCTT 
TCCCAGAACT 
TAGAGATGGC 
GGGCTGCAGG 
TCCACGTGGT 
GAGGCCTGGC 
AGCAGTTGCT 
GACTGGACAT 
CTCTGACTAC 
TGCTGAGTGC 
GGCTACAGCT 
TCCCTGCTGG 
TGGCTGCTGG 
CGTTCCTTGT 
GCCCAGCAGA 
CAGCCAGAGG 
CTTTTAACTG 



21 
I 

GGGACACCCC 
GATCCCGGCC 
GCCTGTCCAT 
TTCTGGGGAA 
AGAGGAGGAT 
GGATCTACCT 
ACCTACTGTT 
AGAAGGGGAT 
CCCAGCCTGC 
CTGCCCGGCC 
GCGCCTGCGC 
TCTGGGTCCC 
TCGTCCGGGC 
TCACCTCAGC 
CGTGTTGGCC 
GTCTCGCTTQ 
ATCTGCACTC 
ACCGCCCTGT 
TAAGCAGCTC 
GAACTTCCGA 
AGTGGACAGC 
TGACATCCTA 
GCAGATGAGA 
GGTAGCCGAG 
CATCTGAGGG 
CCAAGAAATT 



31 
I 

ACAGTCAGCC 
CCTGCTCCAG 
CCCCAGAOGT 
GATGACCCAC 
CCACCCGGAG 
GAAGTTAAGC 
GAGGCTCCTG 
GACCAGAGTC 
GCGGGCCGCT 
CTGCGCCCCC 
AACAATGGCC 
GGGCGGGAGT 
TCGGAGCACA 
ACCGCCTTTG 
GCCTTTCTGG 
GAAGAAATCG 
CTGCCCTCTG 
GCCCAGGGTG 
CACACCCTCT 
GCGACGCAGC 
AGTCCTCGGG 
GCCCTGGTTT 
AGGCAGCACA 
ACTGGAGCCT 
GGAGCCGGTA 
TTTTAAAATA 



41 
I 

GCATGGCTCC 
GCCTCACTGT 
TGCCCCGGAT 
TGGGCGAGGA 
AGGAGGATCT 
CTAAATCAGA 
GAGATCCTCA 
ATTGGCGCTA 
TCCAGTCCCC 
TGGAACTCCT 
ACAGTGTGCA 
ACCGGGCTCT 
CTGTGGAAGG 
CCAGAGTTGA 
AGGAGGGCCC 
CTGAGGAAGG 
ACTTCAGCCG 
TCATCTGGAC 
CTGACACCCT 
CTTTGAATGG 
CTGCTGAGCC 
TTGGCCTCCT 
GAAGGGGAAC 
AGAGGCTGGA 
ACTGTCCTGT 
AATATTTATA 



Seq ID NO: 2 Protein sequence! 
Protein Accession #: NP_001207 



MAPLCPSPWIi 



DPQEPQNNAH 
ELLGFQLPPL 
VEGHRFPAEI 
EEGSETQVPG 
DTLWGPGDSR 
GLLFAVTSVA 



11 

I 

PLLIPAPAPG 
SPREEDPPGE 
RDKEGDDQSH 
PELRLRNNGH 
HWHLSTAFA 
LDISALLPSD 
LQLNFRATQP 
FLVQMRRQHR 



21 



I 

LTVQLLLSLL 
EDLPGEEDLP 
WRYGGDPPWP 
SVQLTLPPGL 
RVDEALGRPG 
PSRYFQYEGS 
LNGRVIEASF 
RGTKGGVSYR 



31 
I 

LLMPVHPQRL 
GEEDLPEVKP 
RVSPACAGRF 
EMALGPGREY 
GLAVIiAAFLE 
LTTPPCAQGV 
PAGVDSSPRA 
PAEVAETGA 



41 
I 

PRMQEOSPLG 
KSEEEGSLKL 
QSFVDIRPQL 
RALQLHLHWG 
EGPEENSAYE 
IWTVFNQTVM 
AEPVQLNSCL 



51 
I 

CCTGTGCCCC 
GCAACTGCTG 
GCAGGAGGAT 
GGATCTGCCC 
ACCTGGAGAG 
AGAAGAGGGC 
AGAACCCCAG 
TGGAGGCGAC 
GGTGGATATC 
GGGCTTCCAG 
ACTGACCCTG 
GCAGCTGCAT 
CCACCGTTTC 
CGAGGCCTTG 
GGAAGAAAAC 
CTCAGAGACT 
CTACTTCCAA 
TGTGTTTAAC 
GTGGGGACCT 
GCGAGTGATT 
AGTCCAGCTG 
TTTTGCTGTC 
CAAAGGGGGT 
TCTTGGAGAA 
CCTGCTCATT 
AT 



51 
I 

GGSSGEDDPL 
EDLPTVBAPG 
AAFCPALRPL 
AAGRPGSEHT 
QLLSRLEEIA 
LSAKQLHTLS 
AAGDILALVF 



Seq ID NO: 3 DNA sequence 

Nucleic Acid Accession #s BC013923 

Coding sequence: 438-1391 



1 
I 

AGCGGGGTTG 
GTGTTTGCAA 
AGAGGAGAGA 
TAATAATAAC 
TGATCCTGAT 
TCCTCGCGGA 
CCCGCGGGCC 
CCGCGCACAG 
AGCAAACTTC 
AAAACAGCCC 
AGCGGCGCAA 
TGGGCGCCGA 
AGCGGCTGCG 
AAACCAAGAC 
GCGGCAATAG 
AGCGCATGGA 
AGGACCAGCT 
AGCCCATGCA 
CCTACATGAA 
TGGCTCTTGG 
TTACCTCTTC 
GCATGTATCT 
CCCAGCACTA 
CACACATGTG 
GGAAATGGGA 
TCAAAAAAAA 
AACACCAATC 
GAGAGATCCT 
GGGGAGGGCG 
TTTAAAAGTT 
TAATATTTAG 
TTGTACAGTA 



11 

1 • 

TCTATTAACT 
AAGGGGGAAA 
GAAAGAAAGG 
AATCATCGGC 
TCCAGTTTGC 
GCCCTGCGCT 
CCCCAAAGTC 
CGCCCGCATG 
GGGGGGCGGC 
GGACCGCGTC 
GATGGCCCAG 
GTGGAAACTT 
AGCGCTGCAC 
GCTCATGAAG 
CATGGCGAGC 
CAGTTACGCG 
GGGCTACCCG 
CCGCTACGAC 
CGGCTCGCCC 
CTCCATGGGT 
CTCCCACTCC 
CCCCGGCGCC 
CCAGAGCGGC 
AGGGCCGGAC 
GGGGTGCAAA 
AAAAAAAAAA 
CCATCCACAC 
GGACTTCTTT 
GGGGAATGGA 
CTAGTGGTAC 
AGCTAGTCTC 
TTTATCGAGA 



21 
I 

TGTTCAAAAA 
GTAGTTTGCT 
GAGAGAAGTT 
GGCGGCAGGA 
CTCTCTCTTT 
CCCGACACCC 
CCGGCCGGGC 
TACAACATGA 
GGCGGCAACT 
AAGCGGCCCA 
GAGAACCCCA 
TTGTCGGAGA 
ATGAAGGAGC 
AAGGATAAGT 
GGGGTCGGGG 
CACATGAACG 
CAGCACCCGG 
GTGAGCGCCC 
ACCTACAGCA 
TCGGTGGTCA 
AGGGCGCCCT 
GAGGTGCCGG 
CCGGTGCCCG 
AGCGAACTGG 
AGAGGAGAGT 
AAAATCCCAT 
TCACGCAAAA 
TKGGGGGACT 
CCTTGTATAG 
GGTAGGAGCT 
CAAGCGACGA 
TAAACATGGC 



31 
I 

GTATCAGGAG 
GCCTCTTTAA 
TGAGCCCCAG 
TCGGCCAGAG 
TTTTCCCCCA 

CGAGGGTCGG 
TGGAGACGGA 
CCACCGCGGC 
TGAATGCCTT 
AGATGCACAA 
CGGAGAAGCG 
ACCCGGATTA 
ACACGCTGCC 
TGGGCGCCGG 
GCTGGAGCAA 
GCCTCAATGC 
TGCAGTACAA 
TGTCCTACTC 
AGTCCGAGGC 
GCCAGGCCGG 
AACCCGCCGC 
GCACGGCCAT 
AGGGGGGAGA 
AAGAAACAGC 
CACCCACAGC 
ACCGCGATGC 
ATTTTTGTAC 
ATCTGGAGGA 
TTGCAGGAAG 
AAAAAATGTT 
AATCAAAATG 



41 

I 

TTGTCAAGGC 
GACTAGGACT 
GCTTAAGCCT 
GAGGAGGGAA 
AATTATTCTT 
CCCCTCCTCC 
CGGCCGCCGG 
GCTGAAGCCG 
GGCGGCCGGC 
CATGGTGTGG 
CTCGGAGATC 
GCCGTTCATC 
TAAATACCGG 
CGGCGGGCTG 
CCTGGGCGCG 
CGGCAGCTAC 
GCACGGCGCA 
CTCCATGACC 
GCAGCAGGGC 
CAGCTCCAGC 
GGACCTCCGG 
CCCCAGCAGA 
TAACGGCACA 
AATTTTCAAA 
ATGGAGAAAA 
AAATGACAGC 
CGACAAGAAA 
AGAGAAAACC 
AAGAAAGCTA 
TTTGCAAAAG 
TTAATATTTG 
TCCATTGTTT 




CGGGCCGGGC 
CCGGGCCCGC 
GGCAACCAGA 
TCCCGCGGGC 
AGCAAGCGCC 
GACGAGGCTA 
CCCCGGCGGA 
CTGGCCCCCG 
GGCGTGAACC 
AGCATGATGC 
GCGCAGATGC 
AGCTCGCAGA 
ACCCCTGGCA 
CCCCCTGTGG 
GACATGATCA 
CTTCACATGT 
CTGCCCCTCT 
GAAAAACGAG 
CCCGGTACGC 
TGCAAAAGAG 
ACTTTTATGA 
TGGGGAGGGT 
CGAAAAACTT 
TCTTTACCAA 
CAAGCAACTT 
ATAAGCTGAG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 



60 
120 
1B0 
240 
300 
360 
420 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
640 
900 
960 
1020 
10B0 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 



189 
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AATTTGCCAA TATTTTTCAA GGAGAGGCTT CTTGCTGAAT TTTGATTCTG CAGCTGAAAT 1980 

TTAGGACAGT TGCAAACGTG AAAAGAAGAA AATTATTCAA ATTTGGACAT TTTAATTGTT 2040 

TAAAAATTGT ACAAAAGGAA AAAATTAGAA TAAGTACTGG CGAACCATCT CTGTGGTCTT 2100 

GTTTAAAAAG GGCAAAAGTT TTAGACTGTA CTAAATTTTA TAACTTACTG TTAAAAGCAA 2160 

5 AAATGGCCAT GCAGGTTGAC ACCGTTGGTA ATTTATAATA GCTTTTGTTC GATCCCAACT 2220 

TTCCA TTT TG TTCAGATAAA AAAAACCATG AAATTACTGT GTTTGAAATA TTTTCTTATG 2280 

GTTTGTAATA TTTCTGTAAA TTTATTGTGA TATTTTAAGG TTTTCCCCCC TTTATTTTCC 2340 

GTAGTTGTAT TTTAAAAGAT TCGGCTCTGT ATTATTTGAA TCAGTCTGCC GAGAATCCAT 2400 

GTATATATTT GAACTAATAT CATCCTTATA ACAGGTACAT TTTCAACTTA AGTTTTTACT 2460 

10 CCATTATGCA CAGTTTGAGA TAAATAAATT TTTGAAATAT GGACACTGAA AAAAAAAAAA 2520 

AAAAAAACAA AACAAAAAAA CAAAAAACAA AAACAGAAAA AACAAAAAAA AAAACAAAAC 2580 

CACAACACAA AAACAAAAAA AAAAAAAAGA AACAAACACA CAACACAACA CAACACAAAA 2 640 
CCACAACACA AACAACAACA CACAGAGGG 

15 Seq ID NO: 4 Protein sequence: 
Protein Accession #:CAA83435.1 

1 11 21 31 41 

OA I ' 1 1 1 

2U MYNMMETELK PPGPQQTSGG GGGNSTAAAA GGNQKNSPDR VKRPMNAFMV 

QENPKMHNSE ISKRLGAEWK LLSETEKRPF IDEAKRLRAL HMKEHPDYXY 

KKDKYTLPGG LIiAPGGKSMA SGVGVGAGLG AGVNQRMDSY AHMNGWSNGS 

PQHPGLNAHG AAQMQPMHRY DVSALQYNSM TSSQTYMNGS PTYSMSYSQQ 

GSWKSEASS SPPWTSSSH SRAPCOAGDI* RDMISMYLPG AEVPEPAAPS 

25 GPVPGTAING TLPLSHM 

Seq ID NO: 5 DNA sequence 
Nucleic Acid Accession #: U91618 
Coding sequences 29-541 

1 11 21 31 41 51 

I I I I I I 

CGGACTTGGC TTGTTAGAAG GCTGAAAGAT GATGGCAGGA ATGAAAATCC AGCTTGTATG 60 

CATGCTACTC CTGGCTTTCA GCTCCTGGAG TCTGTGCTCA GATTCAGAAG AGGAAATGAA 120 

35 AGCATTAGAA GCAGATTTCT TGACCAATAT GCATACATCA AAGATTAGTA AAGCACATGT 180 

TCCCTCTTGG AAGATGACTC TGCTAAATGT TTGCAGTCTT GTAAATAATT TGAACAGCCC 240 

AGCTGAGGAA ACAGGAGAAG TTCATGAAGA GGAGCTTGTT GCAAGAAGGA AACTTCCTAC 300 

TGCTTTAGAT GGCTTTAGCT TGGAAGCAAT GTTGACAATA TACCAGCTCC ACAAAATCTG 360 

TCACAGCAGG GCTTTTCAAC ACTGGGAGTT AATCCAGGAA GATATTCTTO ATACTGGAAA 420 

40 TGACAAAAAT GGAAAGGAAG AAGT CAT AAA GAGAAAAATT CCTTATATTC TGAAACGGCA 4B0 

GCTGTATGAG AATAAACCCA GAAGACCCTA CATACTCAAA AGAGATTCTT ACTATTACTG 540 

AGAGAATAAA TCATTTATTT ACATGTGATT GTGATTCATC ATCCCTTAAT TAAATATCAA 600 

ATTATATTTG TGTGAAAATG TGACAAACAC ACTTATCTGT CTCTTCTACA ATTGTGGTTT 660 

ATTGAATGTG TTTTTCTGCA CTAATAGAAA TTAGACTAAG TGTTTTCAAA TAAATCTAAA 720 
45 TCTTCAAAAA AAAAAAAAAA AAATGGGGCC GCAATT 



51 
I 

WSRGQRRKMA 60 

RPRRKTKTLM 120 

YSMMQDQLGY 180 

GTPGMALGSM 240 

RLHMSQHYQS 300 



Seq ID NO j 6 Protein sequence; 
50 Protein Accession #: AAB50564 

1 11 21 31 41 51 

I I I I I I 

MMAGMKI QLV CMLLLAFSSW SLCSDSEEEM KALEADFLTN MHTSKISKAH VPSWKMTLLN 60 

55 VCSLVNNLNS PAEBTGEVHE EELVARRKDP TALDGPSLEA MLTIYQLHKI CHSRAFQHWE 120 
LIQEDILDTG NDKNGKEEVI KRKI P YILKR QLYENKPRRP YILKRDSYYY 

Seq ID NO: 7 DNA sequence 
Nucleic Acid Accession #» NM_006536.2 
60 Coding sequence: 109-2940 

1 11 21 31 41 51 

I I ! I I I 

ACCTAAAACC TTGCAAGTTC AGGAAGAAAC CATCTGCATC CATATTGAAA ACCTGACACA 60 

65 ATGTATGCAG CAGGCTCAGT GTGAGTGAAC TGGAGGCTTC TCTACAACAT GACCCAAAGG 120 

AG CATTGCAG GTCCTATTTG CAACCTGAAG TTTGTGACTC TCCTGGTTGC CTTAAGTTCA 180 

GAACTCCCAT TCCTGGGAGC TGGAGTACAG CTTCAAGACA ATGGGTATAA TGGATTGCTC 240 

ATTGCAATTA ATCCTCAGGT ACCTGAGAAT CAGAACCTCA TCTCAAACAT TAAGGAAATG 300 

ATAACTGAAG CTTCATTTTA CCTATTTAAT GCTACCAAGA GAAGAGTATT TTTCAGAAAT 360 

70 ATAAAGATTT TAATACCTGC CACATGGAAA GCTAATAATA ACAGCAAAAT AAAACAAGAA 420 

TCATATGAAA AGGCAAATGT CATAGTGACT GACTGGTATG GGGCACATGG AGATGATCCA 480 

TACACCCTAC AATACAGAGG GTGTGGAAAA GAGGGAAAAT ACATTCATTT CACACCTAAT 540 

TTCCTACTGA ATGATAACTT AACAGCTGGC TACGGATCAC GAGGCCGAGT GTTTGTCCAT 600 

GAATGGGCCC ACCTCCGTTG GGGTGTGTTC GATGAGTATA ACAATGACAA ACCTTTCTAC 660 

75 ATAAATGGGC AAAATCAAAT TAAAGTGACA AGGTGTTCAT CTGACATCAC AGGCATTTTT 720 

GTGTGTGAAA AAGGTCCTTG CCCCCAAGAA AACTGTATTA TTAGTAAGCT TTTTAAAGAA 780 

GGATGCACCT TTATCTACAA TAGCACCCAA AATGCAACTG CATCAATAAT GTTCATGCAA 840 

AGTTTATCTT CTGTGGTTGA ATTTTGTAAT GCAAGTACCC ACAACCAAGA AGCACCAAAC 900 

CTACAGAACC AGATGTGCAG CCTCAGAAGT GCATGGGATG TAATCACAGA CTCTGCTGAC 960 

80 TTTCACCACA GCTTTCCCAT GAATGGGACT GAGCTTCCAC CTCCTCCCAC ATTCTCGCTT 1020 

GTACAGGCTG GTGACAAAGT GGTCTGTTTA GTGCTGGATG TGTCCAGCAA GATGGCAGAG 1080 

GCTGACAGAC TCCTTCAACT ACAACAAGCC GCAGAATTTT ATTTGATGCA GATTGTTGAA 1140 

ATT CAT AC CT TCGTGGGCAT TGCCAGTTTC GACAGCAAAG GAGAGATCAG AGCCCAGCTA 1200 

CACCAAATTA ACAGCAATGA TGATCGAAAG TTGCTGGTTT CATATCTGCC CACCACTGTA 1260 

85 TCAGCTAAAA CAGACATCAG CATTTGTTCA GGGCTTAAGA AAGGATTTGA GGTGGTTGAA 1320 

AAACTGAATG GAAAAGCTTA TGGCTCTGTG ATGATATTAG TGACCAGCGG AGATGATAAG 1380 

CTTCTTGGCA ATTGCTTACC CACTGTGCTC AGCAGTGGTT CAACAATTCA CTCCATTGCC 1440 



V 
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CTGGGTTCAT CTGCAGCCCC AAATCTGGAG GAATTATCAC GTCTTACAGG AGGTTTAAAG 1500 

TTCTTTGTTC CAGATATATC AAACTCCAAT AGCATGATTG ATGCTTTCAG TAGAATTTCC 1560 

TCTGGAACTG GAGACATTTT CCAGCAACAT ATTCAGCTTG AAAGTACAGG TGAAAATGTC 1620 

AAACCTCACC ATCAATTGAA AAACACAGTG ACTGTGGATA ATACTGTGGG CAACGACACT 1680 

ATGTTTCTAG TTACGTGGCA GGCCAGTGGT CCTCCTGAGA TTATATTATT TGATCCTGAT 1740 

GGACGAAAAT ACTACACAAA TAATTTTATC ACCAATCTAA CTTTTCGGAC AGCTAGTCTT 1800 

TGGATTCCAG GAACAGCTAA GCCTGGGCAC TGGACTTACA CCCTGAACAA TACCCATCAT 1860 

TCTCTGCAAG CCCTGAAAGT GACAGTGACC TCTCGCGCCT CCAACTCAGC TGTGCCCCCA 1920 

GCCACTGTGG AAGCCTTTGT GGAAAGAGAC AGCCTCCATT TTCCTCATCC TGTGATGATT 1980 

TATGCCAATG TGAAACAGGG ATTTTATCCC ATTCTTAATG CCACTGTCAC TGCCACAGTT 2040 

GAGCCAGAGA CTGGAGATCC TGTTACGCTG AGACTCCTTG ATGATGGAGC AGGTGCTGAT 2100 

GTTATAAAAA ATGATGGAAT TTACTCGAGG TATTTTTTCT CCTTTGCTGC AAATGGTAGA 2160 

TATAGCTTGA AAGTGCATGT CAATCACTCT CCCAGCATAA GCACCCCAAC CCACTCTATT 2220 

CCAGGGAGTC ATGCTATGTA TGTACCAGGT TACACAGCAA AOGGTAATAT TCAGATGAAT 2280 

GCTCCAAGGA AATCAGTAGG CAGAAATGAG GAGGAGCGAA AGTGGGGCTT TAGCCGAGTC 2340 

AGCTCAGGAG GCTCCTTTTC AGTGCTGGGA GTTCCAGCTG GCCCCCACCC TGATGTGTTT 2400 

CCACCATGCA AAATTATTGA CCTGGAAGCT GTAAAAGTAG AAGAGGAATT QACCCTATCT 24 60 

TGGACAGCAC CTGGAGAAGA CTTTGATCAG GGCCAGGCTA CAAGCTATGA AATAAGAATG 2520 

AGTAAAAGTC TACAGAATAT CCAAGATGAC TTTAACAATG CTATTTTAGT AAATACATCA 2S80 

AAGCGAAATC CTCAGCAAGC TGGCATCAGG GAGATATTTA CGTTCTCACC CCAGATTTCC 2640 

ACGAATGGAC CTGAACATCA GCCAAATGGA GAAACACATG AAAGCCACAG AATTTATGTT 2700 

GCAATACGAG CAATGGATAG GAACTCCTTA CAGTCTGCTG TATCTAACAT TGCCCAGGCG 2760 

CCTCTGTTTA TTCCCCCCAA TTCTGATCCT GTACCTGCCA GAG ATT AT CT TATATTGAAA 2820 

GGAQTTTTAA CAGCAATGGG TTTGATAGGA ATCATTTGCC TTATTATAGT TGTGACACAT 2880 

CATACTTTAA GCAGGAAAAA GAGAGCAGAC AAGAAAGAGA ATGGAACAAA ATTATTATAA 2 940 

ATAAATATCC AAAGTGTCTT CCTTCTTAGA TATAAGACCC ATGGCCTTCG ACTACAAAAA 3 000 

CATACTAACA AAGTCAAATT AACATCAAAA CTGTATTAAA ATGCATTGAG TTTTTGTACA 3060 

ATACAGATAA GATTTTTACA TGGTAGATCA ACAATTCTTT TTGGGGGTAG ATTAGAAAAC 3120 

CCTTACACTT TGGCTATGAA CAAATAATAA AAATTATTCT TTAAAGTAAT GTCTTTAAAG 3180 

GCAAAGGGAA GGGTAAAGTC QQACCAGTOT CAAGGAAAGT TTGTTTTATT GAGGTGGAAA 3240 

AATAGCCCCA AGCAGAGAAA AGGAGGGTAG GTCTGCATTA TAACTGTCTG TGTGAAGCAA 3300 

TCATTTAGTT ACTTTGATTA ATTTTTCTTT TCTCCTTATC TGTGCAGTAC AGGTTGCTTG 3360 

TTTACATGAA GATCATGCTA TATTTTATAT ATGTAGCCCC TAATGCAAAG CTCTTTACCT 3420 

CTTGCTATTT TGTTATATAT ATTTCAGATG ACATCTCCCT GCTAATGCTC AGAGATCTTT 3480 

TTTCACTGTA AGAGGTAACC TTTAACAATA TGGGTATTAC CTTTGTCTCT TCATACCGGT 3540 

TTTATGACAA AGGTCTATTG AATTTATTTG TNTGTAAGTT TCTACTCCCA TCAAAGCAGC 3600 

TTTCTAAGTT TATTGCCTTG GGTTATTATG GAATGATAGT TATAGCCCCN TATAATGCCT 3660 
TACCTAGGAA A 

Seq ID NO: 8 Protein sequence: 
Protein Accession #: NP_006527.1 

1 11 21 31 41 51 

I I I I I I 

MTQRSIAGPI CNLKFVTLLV AIiSSELPFLG AGVQLQDNGY NGLI*IAINPQ VPENQNLISN 60 

IKEMITEASF YLFNATKRRV FFRNIKILIP ATWKANNNSK IKQESYEKAN VIVTDWYGAH 120 

GDDPYTLQYR GCGKEGKYIH FTPNFLLNDN LTAGYGSRGR VPVHEWAHLR WGVFDEYNND 180 

KPFYINGQNQ IKVTRCSSDI TGIFVCEKGP CPQENCIISK LFKEGCTFIY NSTQNATASI 240 

MFMQSLSSW EFCNASTHNQ EAPNLQNQMC SLRSAWDVIT DSADPHHSPP MNGTELPPPP 300 

TFSLVQAGDK WCLVLDVSS KMAEADRLLQ LQQAAEFYLM QIVBIHTFVG IASFDSKGEI 360 

RAQLHQINSN DDRKLIiVSYL PTTV8AKTDI SICSGLKKGF EWEKLNGKA YGSVMILVTS 420 

GDDKLLGNCL PTVLSSGSTI HSlAJiGSSAA PNIiEELSRLtT GGLKFFVPDI SNSNSMIDAF 480 

SRISSGTGDI FQQHIQLEST GENVKPHHQL KNTVTVDNTV GNDTMFLVTW QASGPPEIIL 540 

FDPDGRKYYT NNFITNLTFR TASIiWIPGTA KPGHWTYTLN NTHHSLQALK VTVTSRASNS 600 

AVPPATVEAF VERDSLHFPH PVMXYANVKQ GFYPILNATV TATVEPETGD PVTLRLLDDG 660 

AGADVIKNDG IYSRYFFSFA ANGRY8LKVH VNHSPSISTP AHS IPG SHAM YVPGYTANGN 720 

IQMNAPRKSV GRNEEERKWG FSRVSSGGSF SVLGVPAGPH PDVFPPCKII DLEAVKVEEE 780 

LTLSWTAPGE DFDQGQATSY EIRMSKSLQN IQDDFNNAIL VNTSKRNPQQ AGIREIFTF3 840 

PQISTNGPEH QPNGETHESH RIYVAIRAMD RNSLQSAVSN IAQAPLFIPP NSDPVPARDY 900 
LZLKGVLTAM GLIGIICLII WTHHTLSRK KRADKKENGT KL»L 

Seq ZD NO j 9 DNA sequence 

Nucleic Acid Accession # t Eos sequence 

Coding sequence: 336-632 

1 11 21 31 41 51 

I I I I I 1 

CTCCCCTCAC CCCGGTCCAG GATGCCCAGT CCCCACGACA CCTCCCACTT CCCACTGTGG 60 

CCTGGGTGGG CTCAGGGGCT GCCCTTGACC TGGCCTAGAG CCCTCCCCCA GCTGGTGGTG 120 

GAGCTGGCAC TCTCTGGGAG GGAGGGGGCT GGGAGGGAAT GAGTGGGAAT GGCAAGAGGC 180 

CAGGGTTTGG TGGGATCAGG TTGAGGCAGG TTTGGTTTCC TTAAAATGCC AAGTTGGGGG 240 

CCAGTGGGGC CCACATATAA ATCCTCACCC TGGGAGCCTG GCTGCCTTGC TCTCCTTCCT 300 

GGGTCTGTCT CTGCCACCTG GTCTGCCACA GATCCATGAT GTGCAGTTCT CTGGAGCAGG 360 

CGCTGGCTGT GCTGGTCACT ACCTTCCACA AGTACTCCTG CCAAGAGGGC GACAAGTTCA 420 

AG CTG AGTAA GGGGGAAATG AAGGAACTTC TGCACAAGGA GCTGCCCAGC TTTGTGGGGG 480 

AGAAAGTGGA TGAGGAGGGG CTGAAGAAGC TGATGGGCAG CCTGGATGAG AACAGTGACC 540 

AGCAGGTGGA CTTCCAGGAG TATGCTGTTT TCCTGGCACT CATCACTGTC ATGTGCAATG 600 

ACTTCTTCCA GGGCTGCCCA GACCGACCCT GAAGCAGAAC TCTTGACTTC CTGCCATGGA 660 

TCTCTTGGGC CCAGGACTGT TGATGCCTTT GAGTTTTGTA TTCAATAAAC TTTTTTTGTC 720 

TGTTGATAAT ATTTTAATTG CTCAGTGATG TTCCATAACC CGGCTGGCTC AGCTGGAGTG 780 

CTGGGAGATG AGGGCCTCCT GGATCCTGCT CCCTTCTGGG CTCTGACTCT CCTGGAAATC 840 

TCTCCAAGGC CAGAGCTATG CTTTAGGTCT CAATTTTGGA ATTTCAAACA CCAGCAAAAA 900 

ATTGGAAATC GAGATAGGTT GCTGACTTTT ATTTTGTCAA ATAAAGATAT TAAAAAAGGC 960 
AAATACCA 

Seq ID NO: 10 Protein sequence: 



191 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 
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Protein Accession #: NJ?_005969.1 

1 11 21 31 41 51 

I I I I I I 

MMCSSLEQAL AVLVTTPHKY SCQEGDKFKL SKGEMKELLH KELPSFVGEK VDEEGLKKLM 
GSLDENSDQQ VDFQEYAVFL ALITVMCNDF FQGCPDRP 



9eg ID NO i 11 DNA sequence 

Nucleic Acid Accession #> Eos sequence 

Coding sequences 336-626 



PCT/US02/12476 



CTCCCCTCAC 
CCTGGGTGGG 
GAGCTGGCAC 
CAGGGTTTGG 
CCAGTGGGGC 
GGGTCTGTCT 
CGCTGGCTGT 
AG CTGAGTAA 
ATTCCAGAGA 
GAGACTTGAG 
GGAGAAAGTG 
CCAGCAGGTG 
TGACTTCTTC 
GATCTCTTGG 
TCTGTTGATA 
TGCTGGGAGA 
TCTCTCCAAG 
AAATTGGAAA 
GCAAATACCA 



11 

I 

CCCGGTCCAG 
CTCAGGGGCT 
TCTCTGGGAG 
TGGGATCAGG 
CCACATATAA 
CTGCCACCTG 
GCTGGTCACT 
GGGGGAAATG 
ACCATGTGCT 
AAACCAGAGC 
GATGAGGAGG 
GACTTCCAGG 
CAGGGCTGCC 
GCCCAGGACT 
ATATTTTAAT 
TGAGGGCCTC 
GCCAGAGCTA 
TCGAGATAGG 



21 
I 

GATGCCCAGT 
GCCCTTGACC 
GGAGGGGGCT 
TTGAGGCAGG 
ATCCTCACCC 
GTCTGCCACA 
ACCTTCCACA 
AAGGAACTTC 
GTGAGGGCCT 
CCAGAAGGGA 
GGCTGAAGAA 
AGTATGCTGT 
CAGACCGACC 
GTTGATGCCT 
TGCTCAGTGA 
CTGGATCCTG 
TGCTTTAGGT 
TTGCTGACTT 



31 
I 

CCCCACGACA 
TGGCCTAGAG 
GGGAGGGAAT 
TTTGGTTTCC 
TGGGAGCCTG 
GATCCATGAT 
AGTACTCCTG 
TGCACAAGGA 
TCCGAGTCCA 
AAAGTGATTG 
GCTGATGGGC 
TTTCCTGGCA 
CTGAAGCAGA 
TTGAGTTTTG 
TGTTCCATAA 
CTCCCTTCTG 
CTCAATTTTG 
TTATTTTGTC 



41 
I 

CCTCCCACTT 
CCCTCCCCCA 
GAGTGGGAAT 
TTAAAATGCC 
GCTGCCTTGC 
GTGCAGTTCT 
CCAAGAGGGC 
GCTGCCCAGC 
TCTGTTTAAT 
TCCCAAGATC 
AGCCTGGATG 
CTCATCACTG 
ACTCTTGACT 
TATTCAATAA 
CCCGGCTGGC 
GGCTCTGACT 
GAATTTCAAA 
AAATAAAGAT 



51 
I 

CCCACTGTGG 
GCTGGTGGTG 
GGCAAGAGGC 
AAGTTGGGGG 
TCTCCTTCCT 
CTGGAGCAGG 
GACAAGTTCA 
TTTGTGGGGC 
CCTGTCATTG 
ACACAGCACT 
AGAACAGTGA 
TCATGTGCAA 
TCCTGCCATG 
ACTTTTTTTG 
TCAGCTGGAG 
CTCCTGGAAA 
CACCAGCAAA 
ATTAAAAAAG 



Seq ID NO: 12 Protein sequence 1 
Protein Accession # i Eos sequence 

1 11 21 31 41 51 

1)1111 
MMCSSLEQAL AVLVTTPHKY SCQEGDKFKL SKGEMKELLH KELPSFVGHS REPCAVRAFR 
VHLFNPVIGD LRNQSPEGKS DCPKITQHWR KWMRRG 



Seq ID NO: 13 DNA sequence 

Nucleic Acid Accession ft. Eos sequence 

Coding sequence: 58-354 



1 

I 

GTGAGCTCAC 
ATGTGCAGTT 
TGCCAAGAGG 
GAGCTGCCCA 
AGCCTGGATG 
CTCATCACTG 
ACTCTTGACT 
TATTCAATAA 
CCCGGCTGGC 
GGCTCTGACT 
GAATTTCAAA 
AAATAAAGAT 



11 
I 

CATGTGGGGG 
CTCTGGAGCA 
GCGACAAGTT 
GCTTTGTGGG 
AGAACAGTGA 
TCATGTGCAA 
TCCTGCCATG 
ACTTTTTTTG 
TCAGCTGGAG 
CTCCTGGAAA 
CACCAGCAAA 
ATTAAAAAAG 



21 
! 

TGAGGCTGAG 
GGCGCTGGCT 
CAAGCTGAGT 
GGAGAAAGTG 
CCAGCAGGTG 
TGACTTCTTC 
GATCTCTTGG 
TCTGTTGATA 
TGCTGGGAGA 
TCTCTCCAAG 
AAATTGGAAA 
GCAAATACCA 



31 

I 

AGAAAACAAG 
GTGCTGGTCA 
AAGGGGGAAA 
GATGAGGAGG 
GACTTCCAGG 
CAGGGCTGCC 
GCCCAGGACT 
ATATTTTAAT 
TGAGGGCCTC 
GCCAGAGCTA 
TCGAGATAGG 



41 

I 

TACACAGCCA 
CTACCTTCCA 
TGAAGGAACT 
GGCTGAAGAA 
AGTATGCTGT 
CAGACCGACC 
GTTGATGCCT 
TGCTCAGTGA 
CTGGATCCTG 
TGCTTTAGGT 
TTGCTGACTT 



51 

I 

CAGATCCATG 
CAAGTACTCC 
TCTGCACAAG 
GCTGATGGGC 
TTTCCTGGCA 
CTGAAGCAGA 
TTGAGTTTTG 
TGTTCCATAA 
CTCCCTTCTG 
CTCAATTTTG 
TTATTTTGTC 



Seq ID NO: 14 Protein sequence: 
Protein Accession ft: NP_005969.1 

1 11 21 31 41 51 

I I i I I I 

MMCSSLEQAL AVLVTTPHKY SCQEGDKFKL SKGEMKELLH KELPSFVGEK VDEEGLKKLM 
GSLDENSDQQ VDFQEYAVPL ALITVMCNDF FQGCPDRP 



Seq ID NO: 15 DNA sequence 

Nucleic Acid Accession ft: Eos sequence 

Coding sequence: 62-358 



GGAGGGTGTG 
CATGATGTGC 
CTCCTGCCAA 
CAAGGAGCTG 
GGGCAGCCTG 
GGCACTCATC 
CAGAACTCTT 
TTTGTATTCA 
ATAACCCGGC 
TCTGGGCTCT 
TTTGGAATTT 



11 

I 

CCGCTGAGTC 
AGTTCTCTGG 
GAGGGCGACA 
CCCAGCTTTG 
GATGAGAACA 
ACTGTCATGT 
GACTTCCTGC 
ATAAACTTTT 
TGGCTCAGCT. 
GACTCTCCTG 
CAAACACCAG 



21 

! 

ACTGCCTGGG 
AGCAGGCGCT 
AGTTCAAGCT 
TGGGGGAGAA 
GTGACCAGCA 
GCAATGACTT 
CATGGATCTC 
TTTGTCTGTT 
GGAGTGCTGG 
GAAATCTCTC 
CAAAAAATTG 



31 
I 

CATCTGGGCC 
GGCTGTGCTG 
GAGTAAGGGG 
AGTGGATGAG 
GGTGGACTTC 
CTTCCAGGGC 
TTGGGCCCAG 
GATAATATTT 
GAGATGAGGG 
CAAGGCCAGA 
GAAATCGAGA 



41 
I 

TGGAACCTCG 
GTCACTACCT 
GAAATGAAGG 
GAGGGGCTGA 
CAGGAGTATG 
TGCCCAGACC 
GACTGTTGAT 
TAATTGCTCA 
CCTCCTGGAT 
GCTATGCTTT 
TAGGTTGCTG 



51 

I 

GCCACAGATC 
TCCACAAGTA 
AACTTCTGCA 
AGAAGCTGAT 
CTGTTTTCCT 
GACCCTGAAG 
GCCTTTGAGT 
GTGATGTTCC 
CCTGCTCCCT 
AGGTCTCAAT 
ACTTTTATTT 



60 
12 0 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



60 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



192 
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TGTCAAATAA AGATATTAAA AAAGGCAAAT ACCA 

Seq ID NO: 16 Protein sequence: 
Protein Accession #: NP_005969.1 

1 11 21 31 41 51 

I I I I I I 

MMCSSLEQAL AVLVTTFHKY SCQEGDKFKL SKQEMKELLH KEIjPSFVGEK VDEEGLKKLM 60 
GSLDENSDQQ VDFQEYAVFL ALITVMCNDP FQGCPDRP 



Seq ID NO: 17 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 939-2372 

1 11 21 31 41 51 

I I I I I I 

AAGACGGATT CTCAQACAAG GCTTGCAAAT GCCCCGCAGC CATCATTTAA CTGCACCCGC 60 

AGAATAGTTA CGGTTTGTCA CCCGACCCTC CCGGATCGCC TAATTTGTCC CTAGTGAGAC 120 

CCCGAGGCTC TGCCCGCGCC TGGCTTCTTC GTAGCTGGAT GCATATCGTG CTCCGGGCAG 180 

CGCGGGCGCA GGGCACGCGT TCGCGCACAC CCTAGCACAC ATGAACACGC GCAAGAGCTG 240 

AACCAAGCAC GGTTTCCATT TCAAAAAGGG AGACAGCCTC TACCGCGATT GTAGAAGAGA 300 

CTGTGGTGTG AATTAGGGAC CGGGAGGCGT CGAACGGAGG AACGGTTCAT CTTAGAGACT 360 

AATTTTCTGG AGTTTCTGCC CCTGCTCTGC GTCAGCCCTC ACGTCACTTC GCCAGCAGTA 420 

GCAGAGGCGG CGGCGGCGGC TCCCGGAATT GGGTTGGAGC AGGAGCCTCG CTGGCTGCTT 4 B0 

CGCTCGCGCT CTACGCGCTC AGTCCCCGGC GGTAGCAGGA GCCTGGACCC AGGCGCCGCC 540 

GGCGGGCGTG AGGCGCCGGA GCCCGGCCTC GAGGTGCATA CCGGACCCCC ATTCGCATCT 600 

AACAAGGAAT CTGCGCCCCA GAGAGTCCCG GGAGCGCCGC CGGTCGGTGC CCGGCGCGCC 660 

GGGCCATGCA GCGACGGCCG CCGCGGAGCT CCGAGCAGCG GTAGCGCCCC CCTGTAAAGC 720 

GGTTCGCTAT GCCGGGGCCA CTGTGAACCC TGCCGCCTGC CGGAACACTC TTCGCTCCGG 780 

ACCAGCTCAG CCTCTGATAA GCTGGACTCG GCACGCCCGC AACAAGCACC GAGGAGTTAA 840 

GAGAGCCGCA AGCGCAGGGA AGGCCTCCCC GCACGGGTGG GGGAAAGCGG CCGGTGCAGC 900 

GCGGGGACAG GCACTCGGGC TGGCACTGGC TGCTAGGGAT GTCGTCCTQG ATAAGGTGGC 960 

ATGGACCCGC CATGGCGCGG CTCTGGGGCT TCTGCTGGCT GGTTGTGGGC TTCTGGAGGG 1020 

CCG CT T T CGC CTGTCCCACG TCCTGCAAAT GCAGTGCCTC TCGGATCTGG TGCAGCGACC 1080 

CTTCTCCTGG CATCGTGGCA TTTCCGAGAT TGGAGCCTAA CAGTGTAGAT CCTGAGAACA 1140 

TCACCGAAAT TTTCATCGCA AACCAGAAAA GGTTAGAAAT CATCAACGAA GATGATGTTG 1200 

AAGCTTATGT GGGACTGAGA AATCTGACAA TTGTGGATTC TGGATTAAAA TTTGTGGCTC 1260 

ATAAAGCATT TCTGAAAAAC AGCAACCTGC AGCACATCAA TTTTACCCGA AACAAACTGA 1320 

CGAGTTTGTC TAGGAAACAT TTCCGTCACC TTGACTTGTC TGAACTGATC CTGGTGGGCA 1380 

ATCCATTTAC ATGCTCCTGT GACATTATGT GGATCAAGAC TCTCCAAGAG GCTAAATCCA 1440 

GTCCAGACAC TCAGGATTTG TACTGCCTGA ATGAAAGCAG CAAGAATATT CCCCTGGCAA 1500 

ACCTGCAGAT ACCCAATTGT GGTTTGCCAT CTGCAAATCT GGCCGCACCT AACCTCACTG 1560 

TGGAGGAAGG AAAGTCTATC ACATTATCCT GTAGTGTGGC AGGTGATCCG GTTCCTAATA 1620 

TGTATTGGGA TGTTGGTAAC CTGGTTTCCA AACATATGAA TGAAACAAGC CACACACAGG 1680 

GCTCCTTAAG GATAACTAAC ATTTCATCCG ATGACAGTGG GAAGCAGATC TCTTGTGTGG 1740 

CGGAAAATCT TGTAGGAGAA GATCAAGATT CTGTCAACCT CACTGTGCAT TTTGCACCAA 1800 

CTATCACATT TCTCGAATCT CCAACCTCAG ACCACCACTG GTGCATTCCA TTCACTGTGA 1860 

AAGGCAACCC CAAACCAGCG CTTCAGTGGT TCTATAACGG GGCAATATTG AATGAGTCCA 1920 

AATACATCTG TACTAAAATA CATGTTACCA ATCACACGGA GTACCACGGC TGCCTCCAGC 1980 

TGGATAATCC CACTCACATG AACAATGGGG ACTACACTCT AATAGCCAAG AATGAGTATG 2040 

GGAAGGATGA GAAACAGATT TCTGCTCACT TCATGGGCTG GCCTGGAATT GACGATGGTG 2100 

CAAACCCAAA TTATCCTGAT GTAATTTATG AAGATTATGG AACTGCAGCG AATGACATCG 2160 

GGGACACCAC GAACAGAAGT AATGAAATCC CTTCCACAGA CGTCACTGAT AAAACCGGTC 2220 

GGGAACATCT CTCGGTCTAT GCTGTGGTGG TGATTGCGTC TGTGGTGGGA TTTTGCCTTT 2280 

TGGTAATGCT G TTTCTGCTT AAGTTGGCAA GACACTCCAA GTTTGGCATG AAAGGTTTTG 2340 

TTTTGTTTCA TAAGATCCCA CTGGATGGGT AGCTGAAATA AAGGAAAAGA CAGAGAAAGG 2400 

GGCTGTGGTG CTTGTTGGTT GATGCTGCCA TGTAAGCTGG ACTCCTGGGA CTGCTGTTGG 2460 

CTTATCCCGG GAAGTGCTGC TTATCTGGGG TTTTCTGGTA GATGTGGGCG GTGTTTGGAG 2520 

GCTGTACTAT ATGAAGCCTG CATATACTGT GAGCTGTGAT TGGGGAACAC CAATGCAGAG 2580 

GTAACTCTCA GGCAGCTAAG CAGCACCTCA AGAAAACATG TTAAATTAAT GCTTCTCTTC 2640 

TTACAGTAGT TCAAATACAA AACTGAAATG AAATCCCATT GGATTGTACT TCTCTTCTGA 2700 

AAAGTGTGCT TTTTGACCCT ACTGGACATT TATTGACTTA ATTGCTTCTG TTTATTAAAA 2760 

TTGACCTGCA AAGTTAAAAA AAAATTAAAG TTGAGAACAG GTATAAGTGC ACACTGAATA 2820 

GTCTAAT CT A CATGTAACAC ATATTTTAGT GTGATTTTCT ATACTCTAAT CAGCACTGAA 2880 

TTCAGAGGGT TTGACTTTTT CATCTATAAC ACAGTGACTA AAAGAGTTAA GGGTATATAT 2940 

ACCATCACTT TGGGACTTGG TAGTATTATT AAAAGGTTAT TTCCTTCACT GTCAATAAAA 3000 

GTCCAAATGT TTAGCTTAGG TCTGAGAGTC AAACAATGTT AAGGATTGTC TTAAAGTTCC 3060 

TTAGCCAGCA AAACAAAACA AAACAAAACA AACAAATGAA AAACGTTTAA AAAGAAGAAG 3120 

AAGAAAAAAA ACAAGAACAA GCAGCAACAG CTGTTTTGTT GGGGCTATAG ATTTAAGTTA 3180 

GGCATAGTCA ATTTCAGAAT AACTAAGAGT GGAATATATG CATATGGTGA AATTATAACC 3240 

TTGCCCTTTT TTATTTGCCC TCTGCGATCC ACCTGCTTTT TAGAAGTCTG CCGAGTGAGA 3300 

AGGCCACAGT ATCTCATGCT GTTTGCATTA CAGAACTGCA GCTTTTCTAC TCTGAAAAGG 3360 

CCTGGGAGCA GAATGGCTGG CCTGCTGTGA GCAGGAGAGG AGATTCTAAG AAGGATAGTC 3420 

CCCCCTACAA CATACTGTCA TACTGCTGGG TTTTCATGGG TAGGAAAGCT TGTCCTGACC 3480 

CCAGCAGCAA AGAGGTGGCA GGTCGCTAAT GAATATATGC TTTATAATGT CCTTCTTCAT 3540 

TGCTGAGAGG GCAGCCTTAG AGCTGTGGAT TTCTGCATCC CCCCTGAGTC TGACCCATGG 3600 

ACACCTGTTT CATTCACTTT AGCATCACAG TGACCTTTGT ATGCTCTGTT CAGTCTGTGT 3 660 

CAGGCAGTAT GCTTGTCCTG AAGAGAGGTT TGGCTATCCC CACCCCACCC CACCCCACCC 3720 

TGTT CCTTTT TTATCAGGAG GACTTCAGAG CCAGGCCTGC AGCATTTTGT TTGAAAACAC 3780 

AATCAGCTCT GACAGTTAGA CATGCACACA GACGCCATAG CTGGATTGGA AACATTGATG 3840 

TTTTAAAAAT TTATTTTTTT TGGAAATAGT TGCACAAATG CTGCAATTTA GCTTTAAGGT 3900 

TCTATAGATT TTTAACTAGT CCAACACAGT CAGAAACATT GTTTTGAATC CTCTGTAAAC 3960 

CAAGGCATTA ATCTTAATAA ACCAGGATCC ATTTAGGTAC CACTTGATAT AAAAAGGATA 4020 

TCCATAATGA ATATTTTATA CTGCATCCTT TACATTAGCC ACTAAATACG TTATTGCTTG 4080 

ATGAAGACCT TTCACAGAAT CCTATGGATT GCAGCATTTC ACTTGGCTAC TTCATACCCA 4140 



193 



WO 02/086443 

TGCCTTAAAG AGGGGCAGTT TCTCAAAAGC AGAAACATGC CGCCAGTTCT CAAGTTTTCC 4200 

TCCTAACTCC ATTTGAATGT AAGGGCAGCT GGCCCCCAAT GTGGGGAGGT CCGAACATTT 4260 

TCTGAATTCC CATTTTCTTG TTCGCGGCTA AATGACAGTT TCTGTCATTA CTTAGATTCC 4320 

GATCTTTCCC AAAGGTGTTG ATTTACAAAG AGGCCAGCTA ATAGCAGAAA TCATGACCCT 4380 

GAAAGAGAGA TGAAATTCAA GCTGTGAGCC AGGCAGGAGC TCAGTATGGC AAAGGTTCTT 4440 

GAGAATCAGC CATTTGGTAC AAAAAAGATT TTTAAAGCTT TTATGTTATA CCATGGAGCC 4500 

ATAGAAAGGC TATGGATTGT TTAAGAACTA TTTTAAAGTG TTCCAGACCC AAAAAGGAAA 4560 

AATAAAAAAA AAGGAATATT TGTACCCAAC AGCTAGAAGG ATTGCAAGGT AGATTTTTGT 4620 

TTTAAAATGG AGAGAAGTGG A CAGAT AAGG CCATTTAATA TATCAAAGAT CAGTTGACAT 4680 
CTCCTAGGGA ATGATGAAAA CAGCAGGCTA T 

Seq ZD NO i 18 Protein sequence: 
Protein Accession #: CAA53S71 

1 11 21 31 41 51 

I I I I I I 

MSSWIRWHGP AMARLWGFCW IjWGFWRAAF ACPTSCKCSA SRIWCSDPSP GIVAFPRLEP 60 

NSVDPENITE IFIANQKRLE IINEDDVEAY VGLRNLTIVD SGLKFVAKKA FLKNSNLQHI , 120 

NPTRNKLTSL SRKHPRHLDL SELILVGNPF TCSCDIMWrK TLQEAKSSPD TQDLYCLNES 180 

SKNIPLANLQ IPNCGLPSAN LAAPNLTVEE GKSITLSCSV AGDPVPNMYW DVGNLVSKHM 240 

NETSHTQGSL RITNISSDDS GKQISCVAEN LVGEDQDSVN LTVHFAPTIT FLESPTSDHH 300 

WCIPFTVKGN PKPALQWFYN GAILNESKYI CTKIHVTNHT EYHGCLQLDN PTHMNNGDYT 360 

LIAKNEYGKD EKQISAHPMG WPGIDDGANP NYPDVIYEDY GTAANDIGDT TNRSNEIPST 420 
DVTDKTGREH LSVYAWVIA SWGFCLLVM LFLLKLARHS KFGMKGFVLF HKIPLDG 



Seq ID NO t 19 DNA sequence 

Nucleic Acid Accession #: NM_000228 

Coding sequence: 82-3600 

1 11 21 31 41 51 

I 1 I I I I 

GCTTTCAGGC GATCTGGAGA AAGAACGGCA GAACACACAG CAAGGAAAGG TCCTTTCTGG 60 

GGATCACCCC ATTGGCTGAA GATGAGACCA TTCTTCCTCT TGTGTTTTGC CCTGCCTGGC 120 

CTCCTGCATG CCCAACAAGC CTGCTCCCGT GGGGCCTGCT ATCCACCTGT TGGGGACCTG 180 

CTTGTTGGGA GGACCCGGTT TCTCCGAGCT TCATCTACCT GTGGACTGAC CAAGCCTGAG 240 

ACCTACTGCA CCCAGTATGG CGAGTGGCAG ATGAAATGCT GCAAGTGTGA CTCCAGGCAG 300 

CCTCACAACT ACTACAGTCA CCGAGTAGAG AATGTGGCTT CATCCTCCGG CCCCATGCGC 360 

TGGTGGCAGT CCCAGAATGA TGTGAACCCT GTCTCTCTGC AGCTGGACCT GGACAGGAGA 420 

TTCCAGCTTC AAGAAGTCAT GATGGAGTTC CAGGGGCCCA TGCCCGCCGG CATGCTGATT 480 

GAGCGCTCCT CAGACTTCGG TAAGACCTGG CGAGTGTACC AGTACCTGGC TGCCGACTGC 540 

ACCTCCACCT TCCCTCGGGT CCGCCAGGGT CGGCCTCAGA GCTGGCAGGA TGTTCGGTGC 600 

CAGTCCCTGC CTCAGAGGCC TAATGCACGC CTAAATGGGG GGAAGGTCCA ACTTAACCTT 660 

ATGGATTTAG TGTCTGGGAT TCCAGCAACT CAAAGTCAAA AAATTCAAGA GGTGGGGGAG 720 

ATCACAAACT TGAGAGTCAA TTTCACCAGG CTGGCCCCTG TGCCCCAAAG GGGCTACCAC 780 

CCTCCCAGCG CCTACTATGC TGTGTCCCAG CTCCGTCTGC AGGGGAGCTG CTTCTGTCAC 840 

GGCCATGCTG ATCGCTGCGC ACCCAAGCCT GGGGCCTCTG CAGGCCCCTC" CACCGCTGTG 900 

CAGGTCCACG ATGTCTGTGT CTGCCAGCAC AACACTGCCG GCCCAAATTG TGAGCGCTGT 960 

GCACCCTTCT ACAACAACCG GCCCTGGAGA CCGG CGGAGG GCCAGGACGC CCATGAATGC 1020 

CAAAGGTGCG ACTGCAATGG GCACTCAGAG ACATGTCACT TTGACCCCGC TGTGTTTGCC 1080 

GCCAGCCAGG GGGCATATGG AGGTGTGTGT GACAATTGCC GGGACCACAC CGAAGGCAAG 1140 

AACTGTGAGC GGTGTCAGCT GCACTATTTC CGGAACCGGC GCCCGGGAGC TTCCATTCAG 1200 

GAGACCTGCA TCTCCTGCGA GTGTGATCCG GATGGGGCAG TGCCAGGGGC TCCCTGTGAC 1260 

CCAGTGACCG GGCAGTGTGT GTGCAAGGAG CATGTGCAGG GAGAGCGCTG TGACCTATGC 1320 

AAGCCGGGCT TCACTGGACT CACCTACGCC AACCCGCAGG GCTGCCACCG CTGTGACTGC 1380 

AACATCCTGG GGTCCCGGAG GGACATGCCG TGTGACGAGG AGAGTGGGCG CTGCCTTTGT 1440 

CTGCCCAACG TGGTGGGTCC CAAATGTGAC CAGTGTGCTC CCTACCACTG GAAGCTGGCC 1500 

AGTGGCCAGG GCTGTGAACC GTGTGCCTGC GACCCGCACA ACTCCCCTCA GCCCACAGTG 1560 

CAACCAGTTC ACAGGGCAGT GCCCTGTCGG GAAGGCTTTG GTGGCCTGAT GTGCAGCGCT 1620 

GCAGCCATCC GCCAGTGTCC AGACCGGACC TATGGAGACG TGGCCACAGG ATGCCGAGCC 1680 

TGTGACTGTG ATTTCCGGGG AACAGAGGGC CCGGGCTGCG ACAAGGCATC AGGCCGCTGC 1740 

CTCTGCCGCC CTGGCTTGAC CGGGCCCCGC TGTGACCAGT GCCAGCGAGG CTACTGCAAT 1800 

CGCTACCCGG TGTGCGTGGC CTGCCACCCT TGCTTCCAGA CCTATGATGC GGACCTCCGG 1860 

GAGCAGGCCC TGCGCTTTGG TAGACTCCGC AATGCCACCG CCAGCCTGTG GTCAGGGCCT 1920 

GGGCTGGAGG ACCGTGGCCT GGCCTCCCGG AT CCT AGATG CAAAGAGTAA GATTGAGCAG 1980 

ATCCGAGCAG TTCTCAGCAG CCCCGCAGTC ACAGAGCAGG AGGTGGCTCA GGTGGCCAGT 2040 

GCCATCCTCT CCCTCAGGCG AACTCTCCAG GGCCTGCAGC TGGATCTGCC CCTGGAGGAG 2100 

GAGACGTTGT CCCTTCCGAG AGACCTGGAG AGTCTTGACA GAAGCTTCAA TGGTCTCCTT 2160 

ACTATGTATC AGAGGAAGAG GGAGCAGTTT GAAAAAATAA GCAGTGCTGA TCCTTCAGGA 2220 

GCCTTCCGGA TGCTGAGCAC AGCCTACGAG CAGTCAGCCC AGGCTGCTCA GCAGGTCTCC 2280 

GACAGCTCGC GCCTTTTGGA CCAGCTCAGG GACAGCCGGA GAGAGGCAGA GAGGCTGGTG 2340 

CGGCAGGCGG GAGGAGGAGG AGGCACCGGC AGCCCCAAGC TTGTGGCCCT GAGGCTGGAG 2400 

ATGTCTTCGT TGCCTGACCT GACACCCACC TTCAACAAGC TCTGTGGCAA CTCCAGGCAG 2460 

ATGGCTTGCA CCCCAATATC ATGC CCTGGT GAGCTATGTC CCCAAGACAA TGGCACAGCC 2520 

TGTGGCTCCC GCTGCAGGGG TGTCCTTCCC AGGGCCGGTG GGGCCTTCTT GATGGCGGGG 2580 

CAGGTGGCTG AGCAGCTGCG GGGCTTCAAT GCCCAGCTCC AGCGGACCAG GCAGATGATT 2640 

AGGGCAGCCG AGGAATCTGC CTCACAGATT CAATCCAGTG CCCAGCGCTT GGAGACCCAG 2700 

GTGAGCGCCA GCCGCTCCCA GATGGAGGAA GATGTCAGAC GCACACGGCT CCTAATCCAG 2760 

CAGGTCCGGG ACTTCCTAAC AGACCCCGAC ACTGATGCAG CCACTATCCA GGAGGTCAGC 2820 

GAGGCCGTGC TGGCCCTGTG GCTGCCCACA GACTCAGCTA CTGTTCTGCA GAAGATGAAT 2880 

GAGATCCAGG CCATTGCAGC CAGGCTCCCC AACGTGGACT TGGTGCTGTC CCAGACCAAG 2940 

CAGGACATTG CGCGTGCCCG CCGGTTGCAG GCTGAGGCTG AGGAAGCCAG QAGCCGAGCC 3000 

CATGCAGTGG AGGGCCAGGT GGAAGATGTG GTTGGGAACC TGCGGCAGGG GACAGTGGCA 3060 

CTGCAGGAAG CTCAGGACAC CATGCAAGGC ACCAGCCGCT CCCTTCGGCT TATCCAGGAC 3120 

AGGGTTGCTG AGGTTCAGCA GGTACTGCGG CCAGCAGAAA AGCTGGTGAC AAGCATGACC 3180 

AAGCAGCTGG GTGACTTCTG GACACGGATG GAGGAGCTCC GCCACCAAGC CCGGCAGCAG 3240 

GGGGCAGAGG CAGTCCAGGC CCAGCAGCTT GCGGAAGGTG CCAGCGAGCA GGCATTGAGT 33 00 

GCCCAAGAGG GATTTGAGAG AATAAAACAA AAGTATGCTG AGTTGAAGGA CCGGTTGGGT 3360 
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CAGAGTTCCA TGCTGGGTGA GCAGGGTGCC 
GAGCTGTTTG GGGAGACCAT GGAGATGATG 
CTGCGGGGCA GCCAGGCCAT CATGCTGCGC 
GTGGAGCAGA TCCGTGACCA CATCAATGGG 
TGCTACAGCT TCCAGCCCGT TGCCCCACTC 
GATTGGGTTG GAATGCTTTC CATCTCCAGG 
GACCACCCCT GGTGTGTAGC TAGTAAGATT 
GGGACAGTTA CACTTGACAG ACAAAGATGG 
CTCTCAAGTC AAGGAAGCTG GGCTGGGCAG 
GGAATCCTGG ACCAAGCACA AAAACTTAAC 
AAAATCTTTG G 

Seq ID NO t 20 Protein sequence i 
Protein Accession #: NP 000219 



PCT/US02/12476 



1 

I 

MRPFPLLCPA 
EWQMKCCKCD 
MEFQGPMPAG 
NARXiNGGKVQ 
VSQLRLQGSC 
PWRPAEGQDA 
HYFRNRRPGA 
TYANPQGCHR 
CACDPHNSPQ 
TEGPGCDKAS 
RLRNATASLW 
TLQGLQIiDLP 
AYEQSAQAAQ 
TPTFNKLCGN 
GFNAQLQRTR 
DPDTDAATIQ 
RLQAEAEEAR 
VIiRPAEKLVT 
IKQKYAELKD 
' MLRSADLTGL 



11 
I 

LPGLLHAQQA 
SRQPHNYYSH 
MLIBRSSDFG 
LNLMDLVSGI 
PCHGHADRCA 
HECQRCDCNG 
SIQETCISCE 
CECNILGSRR 
PTVQPVHRAV 
GRCLCRPGIiT 
SGPGLEDRGL 
LEEETLSLPR 
QVSDSSRLLD 
SRQMACTPIS 
QMIRAAEBSA 
EVSEAVLALW 
SRAHAVEGQV 
SMTKQW3DFW 
RLGQSSMLGS 
EKRVEQIRDH 



21 
I 

CSRGACYPPV 
RVENVASSSG 
KTWRVYQYLA 
PATQSQKIQE 
PKPGASAGPS 
HSETCHPDPA 
CDPDGAVPGA 
DMPCDEESGR 
PCREGFGGLM 
GPRCDQCQRG 
ASRILDAKSK 
DLESLDRSFN 
QLRDSRREAE 
CPGELCPQDN 
SQIQSSAQRL 
LPTDSATVLQ 
EDWGNLRQG 
TRMEELRHQA 
QGARIQSVKT 
INGRVIiYYAT 



CGGATCCAGA 
GACAGGATGA 
TCGGCGGACC 
CGCGTGCTCT 
ATCTGCCGCC 
AGACTTTCAT 
ACCCTGAGCT 
TGGAGATTGG 
TATCCCCCGC 
AAAAGTGATG 



31 
I 

GDLLVGRTRF 
PMRWWQSQND 
ADCTSTFPRV 
VGEITNLRVN 
TAVQVHDVCV 
VFAASQGAYG 
PCDPVTGQCV 
CLCLPMWGP 
CSAAAIRQCP 
YCNRYPVCVA 
IEQIRAVLSS 
GLLTMYQRKR 
RLVRQAGGGG 
GTACGSRCRG 
ETQVSA6RSQ 
KMNEIQAIAA 
TVALQEAQDT 
RQQGAEAVQA 
EAEELFGETM 
CK 



GTGTGAAGAC 
AAGACATGGA 
TGACAGGACT 
ACTATGCCAC 
TTTGCTTTTG 
GCAGCCTAAA 
GCAGCTGAGC 
CATGCCATTG 
CTTTAGTTCT 
TAAAAATGAA 



41 
I 

LRASSTCGLT 
VNPV8LQLDL 
RQGRPQSWQD 
FTRLAPVPQR 
CQHNTAGPNC 
GVCDNCRDHT 
CKEHVQGERC 
KCDQCAPYHW 
DRTYGDVATG 
CHPCPQTYDA 
PAVTEQEVAQ 
EQFEKISSAD 
GTGSPKLVAL 
VLPRAGGAFL 
MEEDVRRTRL 
RLPNVDLVLS 
MQGTSRSLRL 
QQLAEGASEQ 



Seq ID NO: 21 DNA sequence 

Nucleic Acid Accession #: NMJJ03722 

Coding sequence: 145-1491 



1 
I 

TCGTTGATAT 
ACAGTACTGC 
AAAGAAAGTT 
CCAGAGGTTT 
ATTGACTTGA 
AGCATGGACT 
ACGAACCTGG 
AGTCCCTATA 
CCCAGCTCCA 
CCAGGCCCGC 
TGGACGTATT 
CAGATCAAGG 
AAAAAAGCTG 
GAATTCAACG 
CATGCCCAGT 
CCACCCCAGG 
TGTGTTGGAG 
GGGCAAGTCC 
AGGAAGGCGG 
GATGGTACGA 
AAACGAAGAT 
GAAATGCTGT 
ATTGAAACGT 
CTTTCAGCCT 
GACGTCTTCT 
TCTATATTTT 
TGTGTGTGCG 
CCCAACTGCT 
TTACAAGAAA 
GAACCACTGT 
GAAAGGGGCA 
AATTCACAGG 
AAAAAAGTTG 
CCCTTTTAAT 
TACTGCTGGG 
TTTGTGAGAA 
GCTGTGTACC 
CATGAAACCC 
CTCATTTTGT 
TGTTTACCAT 
AATTTGCTTA 
CTGATACTGT 
AGACGTGTTA 



11 

I 

CAAAGACAGT 
CCTGACCCTT 
ATTACCGATC 
TCCAGCATAT 
ACTTTGTGGA 
GTATCCGCAT 
GGCTCCTGAA 
ACACAGACCA 
CCTTCGATGC 
ACAGTTTCGA 
CCACTGAACT 
TGATGACCCC 
AGCACGTCAC 
AGGGACAGAT 
ATGTAGAAGA 
TTGGCACTGA 
GGATGAACCG 
TGGGCCGACG 
ATGAAGATAG 
AGCGCCCGTT 
CCCCAGATGA 
TGAAGATCAA 
ACAGGCAACA 
GCTTCAGGAA 
TTAGACATTC 
AAGTGTGTGT 
TGTGTATCTA 
CAAAGGCACA 
GGATGTTTTC 
GTTTGTCTGT 
TTAAGATGTT 
GAAGCTTTTG 
TTATTGTCTG 
GCTGGTCATG 
CAGCGAGGTG 
CTTGCATTAT 
TGCCTCTGCC 
TGGAAGACCT 
GCTTTTAATA 
TATTCAAAGC 
ATTAGAGCTT 
TCAGTGCATT 
AAATCAGCAC 



21 

I 

TGAAGGAAAT 
ACAT CCAGCG 
CACCATGTCC 
CTGGGATTTT 
TGAACCATCA 
GCAGGACTCG 
CAGCATGGAC 
CGCGCAGAAC 
TCTCTCTCCA 
CGTGTCCTTC 
GAAGAAACTC 
ACCTCCTCAG 
GGAGGTGGTG 
TGCCCCTCCT 
TCCCATCACA 
ATTCACGACA 
CCGTCCAATT 
CTGCTTTGAG 
CATCAGAAAG 
TCGTCAGAAC 
TGAACTGTTA 
AGAGTCCCTG 
GCAACAGCAG 
TGAGCTTGTG 
CAAGCCCCCA 
GTTGTATTTC 
GCCCTCATAA 
AAGCCACTAG 
TGCAGATTTT 
GAGCTTTCTG 
TATTGGAACC 
AGCAGGTCTC 
TGCATAAGTA 
TAATAATATT 
ATCATTACCA 
TTGTGTCCTC 
ACTGTATGTT 
ACTACAAAAA 
GAAAGACAAA 
TCAAAATAGA 
CTATCCCTCA 
TAGCCAGGAG 
TCCTGGACTG 



31 
I 

GAATTTTGAA 
TTTCGTAGAA 
CAGAGCACAC 
CTGGAACAGC 
GAAGATGGTG 
GACCTGAGTG 
CAGCAGATTC 
AGCGTCACGG 
TCACCCGCCA 
CAGCAGTCGA 
TACTGCCAAA 
GGAGCTGTTA 
AAGCGGTGCC 
AGTCATTTGA 
GGAAGACAGA 
GTCTTGTACA 
TTAATCATTG 
GCCCGGATCT 
CAGCAAGTTT 
ACACATGGTA 
TACTTACCAG 
GAACTCATGC 
CAGCACCAGC 
GAGCCCCGGA 
AACCGATCAG 
CATGTGTATA 
ACAGGACTTG 
TGAGAGAATC 
GTATCCTTAG 
TTGTTTCCTG 
CTTTTCTGTC 
AAACTTAAGA 
AGTTGTAGGT 
GCAAGTAGTA 
AAAGTAATCA 
CCCTCATGTG 
GGCATCTGTT 
AACTGTTGTT 
TCCACCCCAG 
ATTTGAAGCC 
AGCCTACCTA 
ACTTACGTTT 
GAAATTAAAG 



41 
I 

ACTTCACGGT 
ACCCAGCTCA 
AGACAAATGA 
CTATATGTTC 
CGACAAACAA 
ACCCCATGTG 
AGAACGGCTC 
CGCCCTCGCC 
TCCCCTCCAA 
GCACCGCCAA 
TTGCAAAGAC 
TCCGCGCCAT 
CCAACCATGA 
TTCGAGTAGA 
GTGTGCTGGT 
ATTTCATGTG 
TTACTCTGGA 
GTGCTTGCCC 
CGGACAGTAC 
TCCAGATGAC 
TGAGGGGCCG 
AGTACCTTCC 
ACTTACTTCA 
GAGAAACTCC 
TGTACCCATA 
TGTGAGTGTG 
AAGACACTTT 
TTTTGAAGGG 
ACCGGCCATT 
GGAGGGAGGG 
TTCTTCTGTT 
TGTCTTTTTA 
GACTGAGAGA 
AGAAACGAAG 
ACTTTGTGGG 
TAGGTAGAAC 
ATGCTAAAGT 
TGGCCCCCAT 
TAATATTGCC 
CTCTCACAAA 
CCATAAAACC 
TGAGTAAGTG 
ATTGAAAGGG 



AGAGGCAGAG 
GTTGGAGCTG 
GGAGAAGCGT 
CTGCAAGTGA 
GTTGGGGGCA 
GTACAGCCTG 
CTGAGCCAAT 
AAACTAAGAG 
CCACTGGGGA 
AAGCCAAATA 



51 
I 

KPETYCTQYG 
DRRFQLQEVM 
VRCQSLPQRP 
GYHPPSAYYA 
ERCAPFYNNR 
EGKNCERCQIi 
DliCKPGFTGIi 
KLASGQGCEP 
CRACDCDFRG 
DLREQALRFG 
VASAILSLRR 
PSGAFRMLST 
RLEMSSLPDL 
MAGQVAEQLR 
IjIQQVRDFI.T 
QTKQDIARAR 
IQDRVAEVQQ 
ALSAQEGFER 
LELLRGSQAI 



51 
I 

GTGCCACCCT 
TTTCTCTTGG 
ATTCCTCAGT 
AGTTCAGCCC 
GATTGAGATT 
GCCACAGTAC 
CTCGTCCACC 
CTACGCACAG 
CACCGACTAC 
GTCGGCCACC 
ATGCCCCATC 
GCCTGTCTAC 
GCTGAGCCGT 
GGGGAACAGC 
ACCTTATGAG 
TAACAGCAGT 
AACCAGAGAT 
AGGAAGAGAC 
AAAGAACGGT 
ATCCATCAAG 
TGAGACTTAT 
TCAGCACACA 
GAAACATCTC 
AAAACAATCT 
GAGCCCTATC 
TGTGTGTGTA 
GGCTCAGAGA 
ACTCAAACCT 
GGTGGGTGAG 
GTCAGGTGGG 
GTTTTTCTAA 
AGAAAAGGAG 
CTCAGTCAGA 
GTGTCAAGTG 
TGGAGAGTTC 
ATTTCTTAAT 
TTTTCTTGTA 
AGCAGGTGAA 
CTTACGTAGT 
ATCTGTGATT 
AGCCATATTA 
AGATCCAAGC 
TAGACTACTT 



3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 



60 
120 
160 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
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TTCTTTTTTT TACTCAAAAG TTTAGAGAAT 
TTAAGATAAT AGCATAAAGA CTTTAAAAAT 
CACCAGCACT GTATTTTCTG TCACCAAGAC 
TTGTGGATGT GTGATTTTAA TTTTCAATAA 



CTCTGTTTCT TTCCATTTTA AAAACATATT 2640 

GTTCCTCCCC TCCATCTTCC CACACCCAGT 2700 

AATGATTTCT TGTTATTGAG GCTGTTGCTT 2760 
ACTTTTGCAT CTTGGTTTAA AAGAAA 



Seq ID NO: 22 Protein sequence: 
Protein Accession #i NP_003713 



1 11 21 31 41 51 

I I I I I I 

MSQSTQTNEF IiSPEVFQHIW DFLEQPICSV QPIDLNFVDE PSEDGATNKI EISMDCIRMQ 60 

DSDLSDPMWP QYTNLGL.LNS MDQQIQNGSS STSPYNTDHA QNSVTAPSPY AQPSSTPDAL 120 

SPSPAIPSNT DYPGPHSPDV SFQQSSTAKS ATWTYSTELK KLYCQ1AKTC PIQIKVMTPP 180 

PQGAVIRAMP VYKKAEHVTE WKRCPNHEL SREFNEGQIA PPSHLIRVEG NSHAQYVEDP 240 

ITGRQSVLVP YEPPQVGTEF TTVLYNFMCN SSCVGGMNRR PILIIVTLET RDGQVLGRRC 300 

FEARICACPG RDRKADEDSI RKQQVSDSTK NGDGTKRPFR QNTHGIQMTS IKKRRSPDDE 3 60 

LLYIiPVRGRE T YEMLLKI KB SLELMOYLPQ HTIETYRQQQ QQQHQHLLQK HLLSACFRNE 420 
LVEPRRETPK QSDVPPRHSK PPNRSVYP 



Seq ID NO: 23 DNA sequence 

Nucleic Acid Accession #: NM_001944.1 

Coding sequence: 84-3083 

1 11 21 31 41 51 

I I I I I I 

TTTTCTTAGA CATTAACTGC AGACGGCTGG CAGGATAGAA GCAGCGGCTC ACTTGGACTT 60 

TTTCACCAGG GAAATCAGAG ACAATGATGG GGCTCTTCCC CAGAACTACA GGGGCTCTGG 120 

CCATCTTCGT GGTGGTCATA TTGGTTCATG GAGAATTGCG AATAGAGACT AAAGGTCAAT 180 

ATGATGAAGA AGAGATGACT ATGCAACAAG CTAAAAGAAG GCAAAAACGT GAATGGGTGA 240 

AATTTGCCAA ACCCTGCAGA GAAGGAGAAG ATAACTCAAA AAGAAACCCA ATTGCCAAGA 300 

TTACTTCAGA TTACCAAGCA ACCCAGAAAA TCACCTACCG AATCTCTGGA GTGGGAATCG 360 

ATCAGCCGCC TTTTGGAATC TTTGTTGTTG ACAAAAACAC TGGAGATATT AACATAACAG 420 

CTATAGTCGA CCGGGAGGAA ACTCCAAGCT TCCTGATCAC ATGTCGGGCT CTAAATGCCC 480 

AAGGACTAGA TGTAGAGAAA CCACTTATAC TAACGGTTAA AATTTTGGAT ATTAATGATA 540 

ATCCTCCAGT ATTTTCACAA CAAATTTTCA TGGGTGAAAT TGAAGAAAAT AGTGCCTCAA 600 

ACTCACTGGT GATGATACTA AATGCCACAG ATGCAGATGA ACCAAACCAC TTGAATTCTA 660 

AAATTGCCTT CAAAATTGTC TCTCAGGAAC CAGCAGGCAC ACCCATGTTC CTCCTAAGCA 720 

GAAACACTGG GGAAGTCCGT ACTTTGACCA ATTCTCTTGA CCGAGAGCAA GCTAGCAGCT 780 

ATCGTCTGGT TGTGAGTGGT GCAGACAAAG ATGGAGAAGG ACTATCAACT CAATGTGAAT 840 

GTAATATTAA AGTGAAAGAT GTCAACGATA ACTTCCCAAT GTTTAGAGAC TCTCAGTATT 900 

CAGCACGTAT TGAAGAAAAT ATTTTAAGTT CTGAATTACT TCGATTTCAA GTAACAGATT 960 

TGOATGAAGA GTACACAGAT AATTGGCTTG CAGTATATTT CTTTACCTCT GGGAATGAAG 1020 

GAAATTGGTT TGAAATACAA ACTGATCCTA GAACTAATGA AGGCATCCTG AAAGTGGTGA 1080 

AGGCTCTAGA TTATGAACAA CTACAAAGCG TGAAACTTAG TATTGCTGTC AAAAACAAAG 1140 

CTGAATTTCA CCAATCAGTT ATCTCTCGAT ACCGAGTTCA GTCAACCCCA GTCACAATTC 1200 

AGGTAATAAA TGTAAGAGAA GGAATTGCAT TCCGTCCTGC TTCCAAGACA TTTACTGTGC 1260 

AAAAAGGCAT AAGTAGCAAA AAATTGGTGG ATTATATCCT GGGAACATAT CAAGCCATCG 1320 

ATGAGGACAC TAACAAAGCT GCCTCAAATG TCAAATATGT CATGGGACGT AACGATGGTG 1380 

GATACCTAAT GATTGATTCA AAAACTGCTG AAATCAAATT TGTCAAAAAT ATGAACCGAG 1440 

ATTCTACTTT CATAGTTAAC AAAACAATCA CAGCTGAGGT TCTGGCCATA GATGAATACA 1500 

CGGGTAAAAC TTCTACAGGC ACGGTATATG TTAGAGTACC CGATTTCAAT GACAATTGTC 1560 

CAACAGCTGT CCTCGAAAAA GATGCAGTTT GCAGTTCTTC ACCTTCCGTG GTTGTCTCCG 1620 

CTAGAACACT GAATAATAGA TACACTGGCC CCTATACATT TGCACTGGAA GATCAACCTG 1680 

TAAAGTTGCC TGCCGTATGG AGTATCACAA CCCTCAATGC TACCTCGGCC CTCCTCAGAG 1740 

CCCAGGAACA GATACCTCCT GGAGTATACC ACATCTCCCT GGTACTTACA GACAGTCAGA 1800 

ACAATCGGTG TGAGATGCCA CGCAGCTTGA CACTGGAAGT CTGTCAGTGT GACAACAGGG 1860 

GCATCTGTGG AACTTCTTAC CCAACCACAA GCCCTGGGAC CAGGTATGGC AGGCCGCACT 1920 

CAGGGAGGCT GGGGCCTGCC GCCATCGGCC TGCTGCTCCT TGGTCTCCTG CTGCTGCTGT 1980 

TGGCCCCCCT TCTGCTGTTG ACCTGTGACT GTGGGGCAGG TTCTACTGGG GGAGTGACAG 2040 

GTGGTTTTAT CCCAGTTCCT GATGGCTCAG AAGGAACAAT TCATCAGTGG GGAATTGAAG 2100 

GAGCCCATCC TGAAGACAAG GAAATCACAA ATATTTGTGT GCCTCCTGTA ACAGCCAATG 2160 

GAGCCGATTT CATGGAAAGT TCTGAAGTTT GTACAAATAC GTATGCCAGA GGCACAGCGG 2220 

TGGAAGGCAC TTCAGGAATG GAAATGACCA CTAAGCTTGG AGCAGCCACT GAATCTGGAG 2280 

GTGCTGCAGG CTTTGCAACA GGGACAGTGT CAGGAGCTGC TTCAGGATTC GGAGCAGCCA 2340 

CTGGAGTTGG CATCTGTTCC TCAGGGCAGT CTGGAACCAT GAGAACAAGG CATTCCACTG 2400 

GAGGAACCAA TAAGGACTAC GCTGATGGGG CGATAAGCAT GAATTTTCTG GACTCCTACT 2460 

TTTCTCAGAA AGCATTTGCC TGTGCGGAGG AAGACGATGG CCAGGAAGCA AATGACTGCT 2520 

TGTTGATCTA TGATAATGAA GGCGCAGATG CCACTGGTTC TCCTGTGGGC TCCGTGGGTT 2580 

GTTGCAGTTT TATTGCTGAT GACCTGGATG ACAGCTTCTT GGACTCACTT GGACCCAAAT 2640 

TTAAAAAACT TGCAGAGATA AGCCTTGGTG TTGATGGTGA AGGCAAAGAA GTTCAGCCAC 2700 

CCTCTAAAGA CAGCGGTTAT GGGATTGAAT CCTGTGGCCA TCCCATAGAA GTCCAGCAGA 2760 

CAGGATTTGT TAAGTGCCAG ACTTTGTCAG GAAGTCAAGG AGCTTCTGCT TTGTCCGCCT 2820 

CTGGGTCTGT CCAGCCAGCT GTTTCCATCC CTGACCCTCT GCAGCATGGT AACTATTTAG 2680 

TAAGGGAGAC TTACTCGGCT TCTGGTTCCC TCGTGCAACC TTCCACTGCA GGCTTTGATC 2940 

CACTTCTCAC ACAAAATGTG ATAGTGACAG AAAGGGTGAT CTGTCCCATT TCCAGTGTTC 3000 

CTGGCAACCT AGCTGGCCCA ACGCAGCTAC GAGGGTCACA TACTATGCTC TGTACAGAGG 3 060 

ATCCTTGCTC CCGTCTAATA TGACCAGAAT GAGCTGGAAT ACCACACTGA CCAAATCTGG 3120 

ATCTTTGGAC TAAAGTATTC AAAATAGCAT AGCAAAGCTC ACTGTATTGG GCTAATAATT 3180 

TGGCACTTAT TAGCTTCTCT CATAAACTGA TCACGATTAT AAATTAAATG TTTGGGTTCA 3240 

TACCCCAAAA GCAATATGTT GTCACTCCTA ATTCTCAAGT ACTATTCAAA TTGTAGTAAA 3300 
TCTTAAAGTT TTTCAAAACC CTAAAATCAT ATTCGC 



Seq ID NO x 24 Protein sequence: 
Protein Accession #: NP_001935.1 

1 11 21 31 41 51 
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i r i i 

MMGLFPRTTG ALAIFVWIL VHGELRIETK GQYDEBEMTM 
GEDNSKRNPI AKITSDYQAT QKITYRISGV GIDQPPFGIF 
PSFLITCRAL NAQGLDVEKP LILTVKILDI NDNPPVFSQQ 
ATDADEPNHL NSKIAFKIVS QEPAGTPMFL LSRNTGEVRT 
DKDGEGI*STQ CECNIKVKDV NDNFPMFRDS QYSARIEENI 
WLAVYFFTSG NEGNWPEIQT DPRTNEGILK WKALDYBQL 
SRYRVQ9TPV TIQVINVREG IAFRPASKTF TVQKGISSKK 
SNVKYVMGRN DGGYLMIDSK TAEIKFVKNM NRDSTPIVMK 
VYVRVPDFND NCPTAVLEKD AVCSSSPSW VSARTLNNRY 
ITTLNATSAL LRAQEQIPPG VYHISLVLTD SQNNRCEMPR 
TTSPGTRYGR PHSGRLGPAA IGLLLLGLLL LLLAPLLLLT 
GSEGTIHQWG IEGAHPEDKE ITNICVPPVT ANGADFMESS 
MTTKLGAATE SGGAAGFATG TVSGAASGFG AATGVGICSS 
DGAISMNFLD SYFSQKAFAC AEEDDGQEAN DCLLIYDNEG 
LDDSFIiDSLG PKFKKLAEIS LGVDGEGKEV QPPSKDSGYG 
LSGSQGASAL SASGSVQPAV SIPDPLQHGN YLVTETYSAS 
VTERVICPIS SVPGNLAGPT QLRGSHTMLC TEDPCSRLI 

Seq ID NO: 25 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 56-1642 



PCT/US02/12476 



AGTATCCCAG 
GCAAGGGATC 
CATGTTTGAG 
CTGCTCTGTC 
GGAGAAGGTG 
GGAAGATCAG 
GGACTCTTTT 
CTTTTCCCAG 
GGAGATGGTA 
TAACTCAGGG 
GTCCCTGGCG 
GCCCTTGCTC 
GAAGAAGCTG 
GAGGAGTGTC 
TGGGCTCTCT 
ATGGGCACAG 
GATCTCATTC 
ACAGCGCAAG 
AGATCTCAAC 
TCGTAAGAAC 
CATCTTCTCA 
CGAGCTGTCA 
ACGGTTGAAG 
TGCCCTTCGT 
CAAGTTGACT 
CAATGTGAAT 
CATTGCTAGC 
ACTCGTTCAT 
CAGACACAGG 
AGGAGCTCCT 
AGCTACAGCT 
AACAGCGGGA 
TGTATGAAGA 
TTCAGGAGCG 
AGTCAGTGGC 
CAGCTTCTGC 
CAGAGCTAAA 
CCTCAGCCAA 
TAAGGCTGTT 
CTTGTTGCCA 
TCTTAATCAA 
TGGACCTTCG 
AAGGCCAGGT 
AACAACCACC 
AAAGCTCAAC 
TCAAATCTGG 
GGCCCTGAGG 
TATCAGGAAT 
TATAACCACC 
GCACACAAAA 
GTAGCAAAAT 



11 
I • 

GAGGAGCAAG 
CTTTCTCCGC 
TCCACAGCTG 
GTCTCTACCT 
AAAGTATACT 
GGTTGTGTCC 
GCCCTGAAGA 
ATCTTTGGGC 
AAGGATGTAC 
AAAACCCACA 
CTGATCTTCA 
TCCAATGAGG 
TCCCTGCTAA 
TACATCGAAA 
TCTATCAGTC 
CCAGACACTG 
TTTGAGATCT 
AGGCAGACTT 
TGGATTCATG 
CAGAGCTTTG 
ATCAGGATCC 
CTCTGTGATC 
GAAGCAGGAA 
CAAAACCAGC 
CGAGTGTTCC 
CCCTGTGCAT 
CAGGTGACTT 
CAAGGAACAT 
CCTTGATGAT 
ACAAGTTGTG 
GGAGATGCAT 
ACAGTGGTGC 
AAAACTAAAT 
GGATGAAAAG 
CCATCAGCAA 
CTCCACCCAG 
CTCTACCACT 
GCCCTTCACC 
GCGGACAGAG 
CAGCACTGGG 
ACAGGACCAG 
GAAGAAGGCA 
TTCTGCCAAA 
AGGGAAGAAA 
AGACTGCAGC 
GCCTTTTGGC 
TGGGTCAGCT 
TATATCCAGG 
TATGTAATCT 
ACAGTTATAT 
CATTAAAACA 



21 
I 

TGGCACGTCT 
CAGCGGGCTT 
CAGATTTGGG 
CCCTAGAGGA 
TGAGGGTTAG 
GTATTGAGAA 
GCAATGAACG 
CAGAAGTGGG 
TCAAAGGGCA 
CGATTCAAGG 
ATAGCCTCCA 
TAATCTGGCT 
ATGGAGGCCT 
GTCGGATAGG 
AGTGTACCAG 
CCCCACTACC 
ACAACGAACT 
TGCGGCTATG 
TGCAAGATGC 
CCAGCACCCA 
TACACCTTCA 
TGGCTGGCTC 
ACATTAACAC 
AGAACCGGTC 
AAGGTTTCTT 
CTACCTATGA 
GTGCATGCCC 
AGTCTTCAGG 
GATATTGAAA 
GAAGCCATGA 
CTCCGAGATG 
AGTGAACATT 
ATCCTCAAGG 
ATTGAAGAGC 
TCAGGGTCTG 
CAGCTTCAGG 
GAAGAGTTGC 
ATTGATGTGG 
CTTCAGAAAC 
GCAGGAAAAC 
ACTCTGGCTG 
GCATGTATTG 
AAGCGCCTTG 
CCATTCCTTC 
CCTTATGCCC 
AAAAAGTACT 
ACTCTCCTGA 
ATGCAATACT 
CATGTTGTTG 
TAAAGATATT 
AATTATAAAA 



31 
I 

TCGGACCTAG 
GCTGTCCGAT 
GTCTGTGGTA 
CAAGCAGCAG 
GCCCTTGTTA 
TGTGGAGACC 
GGGAATTGGC 
ACAGGCATCC 
GAACTGGCTC 
TACCATCAAG 
AGGCCAACTT 
AGACAGCAAG 
CCAAGAGGAG 
TACCAGCACC 
CAGTAGCCAG 
TGTCCCGGCA 
GCTTTATGAC 
CGAGGATCAA 
TGAGGAGGCC 
CCTCAACCAG 
GGGGGAAGGA 
AGAGCGCTGC 
CTCTCTACAC 
AAAGCAGAAC 
CACAGGCCGA 
TGAAACTCTT 
CACCTATGCA 
TATCCCCCAG 
ATGAAGCTGA 
AGACACTGCT 
AAATTTGCAA 
TGGACACCCA 
AGTCACTGAC 
TAGAAGCTCT 
AATTGGCCCT 
AGGTTAAAGC 
ATAAGTATCA 
ACAAGAAGTT 
TTGGTGAGTC 
TTCGTCAAGC 
AACTGCAGAA 
CTGAGCAGTA 
GTACCAACCA 
GAAATTTACT 
GGATCCTACG 
AAGGCTGTGG 
AGAAATAGGT 
CAGACACTAG 
TTTTTTTTTA 
ATTGTTCACA 
GGGACAGAAA 



QQAKRRQKRE 
WDKNTGDIN 
IFMGEIEENS 
LTNSLDREQA 
LSSELLRFQV 
QSVKLSIAVK 
LVDYILGTYQ 
TITAEVLAID 
TGPYTFALED 
SLTLEVCQCD 
CDCGAGSTGG 
EVCTNTYARG 
GQSGTMRTRH 
ADATGSPVGS 
IESCGHPIEV 
GSLVQPSTAG 



41 

I 

GCTGCCCCTG 
GACGATGTCG 
CGCAAGAACC 
GTTCCATCTG 
CCTTCAGAGT 
CTTGTTCTAC 
CAAGCCACAC 
TTCTTCAACC 
ATCTATACAT 
GATGGAGGGA 
CATCCAACAC 
CAGATCCGAC 
GAGCTGTCCA 
AGCTTCGACA 
CTGGATGAAA 
AACATTCGCT 
CTATTAGAAC 
AATGGCAATC 
TGGAAGCTCC 
AACTCCAGCC 
GATATAGTCC 
AAAGATCAGA 
ACCCTGGGCC 

GGCCGTTCCT 
CATGTGGCCA 
ACTGGGATTC 
CTTAGAGAAA 
CATCTCCATG 
TTTGAAGGAA 
TGAGATGGTA 
AAAGGAACTA 
AAGTTTTTAC 
CTTGCAGGAA 
ACGGCGGTCA 
TAAATTACAG 
GAAAATGTTA 
AGAAGAGGGC 
TCTCCAATCA 
CTTGACCACT 
CAACATGGTG 
TCATACTGTG 
GGAAAATCAG 
TCCCCGAACA 
CTCACGGCGT 
GGAAAGAGAA 
CTCTTTTATG 
CTTTTTTCTC 
TTTACTTATA 
TTTTTTATTG 
AA 



WVKFAKPCRE 
ITAIVDREET 
ASNSLVMILN 
SSYRLWSGA 
TDLDEEYTDN 
NKAEFHQSVI 
AIDEDTNKAA 
EYTGKTSTGT 
QPVKLPAVWS 
NRGIOGTSYP 
VTGGFIPVPD 
TAVEGTSGME 
STGGTNKDYA 
VGCCSFIADD 
QQTGFVKCQT 
FDPLLTQNVI 



51 

I 

CCGTCATGTC 
TAGTTTCTCC 
TGCTATCAGA 
AGGACAGTAT 
TGGAACGACA 
AAGCACCCAA 
ACAGGTTCAC 
TAACTGTGAA 
ATGGAGTCAC 
TTCTCCCCCG 
CTGATCTGAA 
AGGAGGAAAT 
CTTCCTTGAA 
GTGGCATTGC 
CAAGTCATCG 
TCTCCATCTG 
CGCCTAGCCA 
CCTATGTGAA 
TAAAAGTGGG 
GCAGTCACAG 
CCAAGATCAG 
AGAGTGGTGA 
GCTGTATTGC 
TCCGTGACAG 
GCATGATTGT 
AGTTCTCAGC 
CCATCCCTGC 
GGGGCTAAGG 
TATGGCAAAG 
CGACAGGAAA 
GAACAGATGC 
TTGGAGGAAA 
CAAGAAGAGA 
GCCAGACAAC 
CAAAGGTTGG 
CAGTGCAAAG 
GAACCACCAG 
CAGAAGAATA 
GCAGAGAGAG 
TGTGATGACA 
CTAGTGAAAC 
TTGAAACTCC 
CAACCAAACC 
CCAACCTGCC 
TCCCCTTTAC 
GAGCAGTCAT 
CTTTACCATA 
ACTTTTGTAT 
TGATTTCTAT 
AATTCCAAAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
I860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 



Seq ID NO: 26 Protein sequence: 
Protein Accession #: Eos sequence 

1 11 21 31 41 51 

I I I I I I 

MSQGII*SPPA GLLSDDDVW SPMFESTAAD LGSWRKNLli SDC9WSTSI* EDKQQVPSED 
SMEKVKVYLR VRPLLPSELE RQEDQGCVRI ENVETLVLQA PKDSFALKSN ERGIGQATHR 
FTFSQIFGPE VGQASFFNLT VKEMVKDVLK GQNWLIYTYG VTNSGKTHTI QGTIKDGGIL 
PRSLALIFNS LQGQLHPTPD LKPI*LSNEVI WLDSKQIRQE EMKKLSLLNG GLQEEELSTS 
LKRSVYIESR IGTSTSFDSG IAGLSSISQC TSSSQLDETS HRWAQPDTAP LPVPANIRFS 



60 
120 
180 
240 
300 
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WO 02/086443 

IWISFFEIYN ELLYDLLEPP SQQRKRQTLR LCEDQNGNPY VKDLNWIHVQ DAEEAWKLLK 
VGHKNQSFAS THLKQNSSRS HSIPSIRILH LQGEGDIVPK ISEI^SLCDLA GSERCKDQKS 
GERLKEAGNI NTSLHTLGRC IAALiRQNQQN RSKQNLVPFR DSKLTRVFQG FFTGRGRSCM 
IVNVNPCAST YDETLHVAKF SAIASQVTCA CPTYATGIPI PALVHQGT 

Seq ID HO: 27 DNA sequence 

Nucleic Acid Accession #: Eos sequence ' 

Coding sequence ■ 13-1424 



PCT/US02/12476 



360 
420 
480 



1 
I 

TAGAAGTTTA 
CTTCCCCTGA 
TTAGAAAAAT 
GGAAACTTAA 
GGGCAACTGG 
GTCCATCATT 
AGAATCAATA 
GCTTTCCAAG 
GCTGACATTT 
AAAGGTGGAA 
TTGGATGAGG 
GTTCACGAGA 
TTCCCCACCT 
GGCATTCAGT 
TCAGAACCAG 
AAGATCTTTT 
AGTGTTAATT 
GAAATTGAAG 
AATTTAAGAC 
GTGAAAAAAA 
GATAACCAGT 
CTGATTACCA 
AACAAATACT 
CGTATCACCA 
TGGTTTTTGT 
GTGTACCACT 
TTATATAAAA 
CTCTACTATT 
CTCTGTAAGT 
TAAAATTAAG 



11 
I 

CAATGAAGTT 
ACAGCTCTAC 
TTTATGGCCT 
TGAAGGAAAA 
ACACATCTAC 
TCAGGGAAAT 
ATTACACACC 
TATGGAGTAA 
TGGTGGTTTT 
TCCTAGCCCA 
ACGAATTCTG 
TTGGCCATTC 
ACAAATATGT 
CCCTGTATGG 
CTCTCTGTGA 
TCTTCAAAGA 
TAATTTCTTC 
CCAGAAATCA 
CAGAGCCAAA 
TTGATGCAGC 
ATTGGAGGTA 
AGAACTTCCA 
ACTATTTCTT 
AAACACTGAA 
TAGTTCACTT 
ACTTAGAOAT 
TACATAATAT 
AAGTTTGAAA 
TGCTTCCTAA 
TATATATATT 



21 
I 

TCTTCTAATA 
AAGCCTGGAA 
TGAGATAAAC 
AAT CCAAGAA 
CCTGGAGATG 
GCCAGGGGGG 
TGACATGAAC 
TGTTACCCCC 
TGCCCGTGGA 
TGCTTTTGGA 
GACTACACAT 
CTTAGGTCTT 
TGACATCAAC 
AGACCCAAAA 
CCCCAATTTG 
CAGGTTCTTC 
CTTATGGCCA 
AGTTTTTCTT 
TTATCCCAAG 
TGTTTTTAAC 
TGATGAAAGG 
AGGAATCGGG 
CCAAGGATCT 
AAGCAATAGC 
CAGCTTAATA 
ATGTATCATA 
TTTTCAATTT 
ATAGTTACCT 
CATCCTTGGA 
TTGGCTCAAA 



31 
I 

CTGCTCCTGC 
AAAAATAATG 
AAACTTCCAG 
ATGCAGCACT 
ATGCACGCAC 
CCCGTATGGA 
CGTGAGGATG 
TTGAAATTCA 
GCTCATGGAG 
CCTGGATCTG 
TCAGGAGGCA 
GGCCATTCTA 
ACATTTCGCC 
GAGAACCAAC 
AGTTTTGATG 
TGGCTGAAGG 
ACCTTGCCAT 
TTTAAAGATG 
AGCATACATT 
CCACGTTTTT 
AGACAGATGA 
CCTAAAATTG 
AACCAATTTG 
TGGTTTGGTT 
AGTATTTATT 
AAAATAAAAT 
TGAAAACTCT 
TCAAAGCAAG 
CTGAGAAATT 
TAAAATTG 



Seq ID NO i 28 Protein sequence: 
Protein Accession #i Eos sequence 



MKFLLILLLQ 
KEKIQEMQHF 
YTPDMNREDV 
LAHAFGPGSG 
KYVDINTFRL 
FKDRFFWLKV 
BPNYPKSIHS 
NFQGIGPKID 



11 
I 

ATASGALPLN 
LGLKVTGQLD 
DYAIRKAFQV 
IGGDAHPDED 
SADDIRGIQS 
SERPKTSVNL 
FGFPNFVKKI 
AVFYSKNKYY 



21 
I 

SSTSLEKNNV 
TSTLEMMHAP 
WSNVTPLKFS 
EFWTTHSGGT 
LYGDPKENQR 
ISSLWPTIiPS 
DAAVFNPRFY 
YFFQGSNQFE 



31 
I 

LFGERYLEKF 
RCGVPDVHHF 
KINTGMADII* 
NLFLTAVHEI 
LPNPDNSEPA 
GIEAAYEIEA 
RTYFFVDNQY 
YDFLLQRITK 



Seq ID NO: 29 DNA sequence 

Nucleic Acid Accession #: NM_006H5.1 

Coding sequence: 2 3 6.. 17 65 



1 
I 

GCTTCAGGGT 
CGGGACACCC 
ACTCTCTGAG 
GAG AC CT AG A 
ACGAAGGCGT 
CCCACGGAGA 
TGCCGCCCTG 
CGGGAGACAC 
TCTGGGAGTG 
TGGACTTGAT 
GGATTTACGG 
TCTGTACTCA 
TGGTTTGAGC 
CCTCAAGGAA 
GAAAAATGTA 
TATCAAGATG 
TACCTGGAAG 
GCGTAGACTC 
GCAGTATATC 
TGTGGACTCT 
CCCCTTGGAA 
GTCCCAGAGT 
CGATGTAAGT 
CCTGGTCTTT 
GAGCCACTGC 



11 
I 

ACAGCTCCCC 
CACCCGCTTC 
GAAAAACCAT 
AATCCAAGCG 
TTGTGGGGTT 
CTTGTGGAGC 
GAGTTGCTGC 
AGCCAGACCC 
CTGATGAAGG 
GTGCTCCTTG 
AAGAACTCTC 
TTTCCAGAGC 
ACAGAGGCAG 
GGTGCCTGTG 
CTACGCCTGT 
ATCCTGAAAA 
CTACCCACCT 
CTCCTCTCCC 
GCCCAGTTCA 
TTATTTTTCC 
ACCCTCTCAA 
CCCAGCGTCA 
CCCGAGCCCC 
GATGAGTGTG 
TCCCAGCTTA 



21 
I 

CGCAGCCAGA 
CCAGGCGTGA 
T TTGATTATT 
TTGGAGGTCC 
CCATTCAGAG 
TGGCAGGGCA 
CCAGGGAGCT 
TGAAGGCAAT 
GACAACATCT 
CCCAGGAGGT 
ATCAGGACTT 
CAGAAGCAGC 
AGCAGCCCTT 
ATGAATTGTT 
GCTGTAAGAA 
TGGTGCAGCT 
TGGCGAAATT 
ACATCCATGC 
CCTCTCAGTT 
TTAGAGGCCG 
TAACTAACTG 
GTCAGCTAAG 
TCCAAGCTCT 
GGATCACGGA 
CAACCTTAAG 



31 
I 

AGCCGGGCCT 
CCTGTCAACA 
ACTCT CAGAC 
TGAGGCCAGC 
CCGATACATC 
GAGCCTGCTG 
CTTCCCGCCA 
GGTGCAGGCC 
TCACCTGGAG 
TCGCCCCAGG 
CTGGACTGTA 
TCAGCCCATG 
CATTCCAGTA 
CTCCTACCTC 
GCTGAAGATT 
GG ACT CTATT 
TTCTCCTTAC 
ATCTTCCTAC 
CCTCAGTCTG 
CCTGGATCAG 
CCGGCTTTCG 
TGTCCTGAGT 
GCTGGAGAGA 
TGATCAGCTC 
CTTCTACGGG 



41 

1 

AGGCCACTGC 
TGCTATTTGG 
TGACAAAAAT 
TCTTGGGTCT 
CTCGATGTGG 
GGAAACATTA 
TTGACTACGC 
GCAAGATTAA 
ACTTCCATGC 
GCATTGGAGG 
CAAACTTGTT 
GTGATCCAAA 
TCTCTGCTGA 
GCTTGCCAAA 
CTGTCACTAC 
TTTCTGAGAG 
CTGGCATTGA 
ACAAATACTG 
CTTTTGGTTT 
ATAGGACCTA 
TGGACCCTGG 
ATGCAGTCTT 
AATATGACTT 
GTTGAAAATG 
GCATATTTGC 
CTGTAAACCA 
AATTGTCCAT 
ATAATTCTAT 
ATACTTACTT 



41 

I 

YGLEINKLPV 
REMPGGPVWR 
WFARGAHGD 
GHSLGLGHSS 
LCDPNLSFDA 
RNQVFLFKDD 
WRYDERRQMM 
TLKSNSWFGC 



41 
I 

GCAGCCCCTC 
GCAACTTCGC 
GTGCGTGGCA 
CTAAGTCGCT 
AGCATGAGTG 
AAGGATGAGG 
CTCTTCATGG 
TGGCCCTTCA 
ACCTTCAAAG 
AGGTGGAAAC 
TGGTCTGGAA 
ACAAAGAAGC 
GAGGTGCTCG 
ATTGAGAAAG 
TTTGCAATGC 
GAAGATTTGG 
CTGGGCCAGA 
ATTTCCCCGG 
CAGTGCCTCC 
TTGCTCAGGC 
GAAGGGGATG 
CTAAGTGGGG 
GCCTCTGCCA 
CTTGCCCTCC 
AATTCCATCT 



51 
I 

TTCTGGAGCT 
TGAAAGATAC 
GAAATATAGT 
GAAAGTGACC 
AGTCCCCGAT 
TATCACCTAC 
AATCCGGAAA 
CACAGGCATG 
TTTTGATGGC 
GGATGCACAT 
CCTCACTGCT 
GGCCGTAATG 
TGACATACGT 
TCCTGACAAT 
CGTGGGAAAT 
ACCAAAGACC 
AGCTGCTTAT 
GTTAATTAGC 
TCCTAACTTT 
CTTCTTTGTA 
TTATCCCAAA 
CTACTCTAAA 
CCTACTCCAA 
GTGTAATTAA 
TATGTCCTCA 
TAGGXAATGA 
TCTTGCTTGA 
TTGAAGCATG 
CTGGCATAAC 



51 
I 

TKMKYSGNLM 
KHYITYRINN 
FHAFDGKGGI 
DPKAVMFPTY 
VTTVGNKIFF 
ICYWLISNLRP 
DPGYPKLITK 



51 

I 

AGCACCGCTC 
GK3TGTGGT6A 
ACAAGTGACT 
TCAAAATGGA 
TGTGGACAAG 
CCCTGGCCAT 
CAGCCTTTGA 

CTGTGCTTGA 
TTCAAGTGCT 
ACAGGGCCAG 
GAAAAGTAGA 
TAGACCTGTT 
TGAAGCGAAA 
CCATGCAGGA 
AAGTGACTTG 
TGATTAATCT 
AGAAGGAAGA 
AGGCTCTCTA 
ACGTGATGAA 
TGATGCATCT 
TCATGCTGAC 
CCCTCCAGGA 
TGCCTTCCCT 
CCATATCTGC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 



60 
120 
180 
240 
300 
360 
420 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
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CTTGCAOAGT CTCCTGCAGC ACCTCATCGG GCTGAGCAAT CTGACCCACG TGCTGTATCC 1560 

TGTCCCCCTG GAGAGTTATG AGGACATCCA TGGTACCCTC CACCTGGAGA GGCTTGCCTA 1620 

TCTGCATGCC AGGCTCAGGG AGTTGCTGTG TGAGTTGGGG CGGCCCAGCA TGGTCTGGCT 1680 

TAGTGCCAAC CCCTGTCCTC ACTGTGGGGA CAGAACCTTC TATGACCCGG AGCCCATCCT 1740 

GTGCCCCTGT TTCATGCCTA ACTAGCTGGG TGCACATATC AAATGCTTCA TTCTGCATAC 1800 

TTGGACACTA AAGCCAGGAT GTGCATGCAT CTTGAAGCAA CAAAGCAGCC ACAGTTTCAG 1860 

ACAAATGTTC AGTGTGAGTG AGGAAAACAT GTTCAGTGAG GAAAAAACAT TCAGACAAAT 1920 

GTTCAGTGAG GAAAAAAAGG GGAAGTTGGG GATAGGCAGA TGTTGACTTG AGGAGTTAAT 1980 

GTGATCTTTG GGGAGATACA TCTTATAGAG TTAGAAATAG AATCTGAATT TCTAAAGGGA 2040 

GATTCTGGCT TGGGAAGTAC ATGTAGGAGT TAATCCCTGT GTAGACTGTT GTAAAGAAAC 2100 
TGTTGAAAAT AAAGAGAAGC AATGTGAAGC AAAAAAAAAA AAAAAAAA 

Seg ID NO: 30 Protein sequence: 
Protein Accession #: NP_006106.1 

1 11 21 31 41 51 

I I I I I I 

GCTTCAGGGT ACAGCTCCCC CGCAGCCAGA AGCCGGGCCT GCAGCGCCTC AGCACCGCTC 60 

CGGGACACCC CACCCGCTTC CCAGGCGTGA CCTGTCAACA GCAACTTCGC GGTGTGGTGA 120 

ACTCTCTGAG GAAAAACCAT TTTGATTATT ACTCTCAGAC GTGCGTGGCA ACAAGTGACT 180 

GAGACCTAGA AATCCAAGCG TTGGAGGTCC TGAGGCCAGC CTAAGTCGCT TCAAAATGGA 240 

ACGAAGGCGT TTGTGGGGTT CCATTCAGAG CCGATACATC AGCATGAGTG TGTGGACAAG 300 

CCCACGGAGA CTTGTGGAGC TGGCAGGGCA GAGCCTGCTG AAGGATGAGG CCCTGGCCAT 360 

TGCCGCCCTG GAGTTGCTGC CCAGGGAGCT CTTCCCGCCA CTCTTCATGG CAGCCTTTGA 420' 

CGGGAGACAC AGCCAGACCC TGAAGGCAAT GGTGCAGGCC TGGCCCTTCA CCTGCCTCCC 480 

TCTGGGAGTG CTGATGAAGG GACAACATCT TCACCTGGAG ACCTTCAAAG CTGTGCTTGA 540 

TGGACTTGAT GTGCTCCTTG CCCAGGAGGT TCGCCCCAGG AGGTGGAAAC TTCAAGTGCT 600 

GGATTTACGG AAGAACTCTC ATCAGGACTT CTGGACTGTA TGGTCTGGAA ACAGGGCCAG 660 

TCTGTACTCA TTTCCAGAGC CAGAAGCAGC TCAGCCCATG ACAAAGAAGC GAAAAGTAGA 720 

TGGTTTGAGC ACAGAGGCAG AGCAGCCCTT CATTCCAGTA GAGGTGCTCG TAGACCTGTT 780 

CCTCAAGGAA GGTGCCTGTG ATGAATTGTT CTCCTACCTC ATTGAGAAAG TGAAGCGAAA 840 

GAAAAATGTA CTACGCCTGT GCTGTAAGAA GCTGAAGATT TTTGCAATGC CCATGCAGGA 900 

TATCAAGATG ATCCTGAAAA TGGTGCAGCT GGACTCTATT GAAGATTTGG AAGTGACTTG 960 

TACCTGGAAG CTACCCACCT TGGCGAAATT TTCTCCTTAC CTGGGCCAGA TGATTAATCT 1020 

GCGTAGACTC CTCCTCTCCC ACATCCATGC ATCTTCCTAC ATTTCCCCGG AGAAGGAAGA 1080 

GCAGTATATC GCCCAGTTCA CCTCTCAGTT CCTCAGTCTG CAGTGCCTGC AGGCTCTCTA 1140 

TGTGGACTCT TTATTTTTCC TTAGAGGCCG CCTGGATCAG TTGCTCAGGC ACGTGATGAA 1200 

CCCCT TGGAA ACCCTCTCAA TAACTAACTG CCGGCTTTCG GAAGGGGATG TGATGCATCT 1260 

GTCCCAGAGT CCCAGCGTCA GTCAGCTAAG TGTCCTGAGT CTAAGTGGGG TCATGCTGAC 1320 

CGATGTAAGT CCCGAGCCCC TCCAAGCTCT GCTGGAGAGA GCCTCTGCCA CCCTCCAGGA 1380 

CCTGGTCTTT GATGAGTGTG GGATCACGGA TGATCAGCTC CTTGCCCTCC TGCCTTCCCT 1440 

GAGCCACTGC TCCCAGCTTA CAACCTTAAG CTTCTACGGG AATTCCATCT CCATATCTGC 1500 

CTTGCAOAGT CTCCTGCAGC ACCTCATCGG GCTGAGCAAT CTGACCCACG TGCTGTATCC 1560 

TGTCCCCCTG GAGAGTTATG AGGACATCCA TGGTACCCTC CACCTGGAGA GGCTTGCCTA 1620 

. TCTGCATGCC AGGCTCAGGG AGTTGCTGTG TGAGTTGGGG CGGCCCAGCA TGGTCTGGCT 1680 

TAGTGCCAAC CCCTGTCCTC ACTGTGGGGA CAGAACCTTC TATGACCCGG AGCCCATCCT 1740 

GTGCCCCTGT TTCATGCCTA ACTAGCTGGG TGCACATATC AAATGCTTCA TTCTGCATAC 1800 

TTGGACACTA AAGCCAGGAT GTGCATGCAT CTTGAAGCAA CAAAGCAGCC ACAGTTTCAG 1860 

ACAAATGTTC AGTGTGAGTG AGGAAAACAT GTTCAGTGAG GAAAAAACAT TCAGACAAAT 1920 

GTTCAGTGAG GAAAAAAAGG GGAAGTTGGG GATAGGCAGA TGTTGACTTG AGGAGTTAAT 1980 

GTGATCTTTG GGGAGATACA TCTTATAGAG TTAGAAATAG AATCTGAATT TCTAAAGGGA 2040 

GATTCTGGCT TGGGAAGTAC ATGTAGGAGT TAATCCCTGT GTAGACTGTT GTAAAGAAAC 2100 
TGTTGAAAAT AAAGAGAAGC AATGTGAAGC AAAAAAAAAA AAAAAAAA 

Seq ID NO : 31 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 64-2754 

1 11 21 31 41 51 

I I I I I I 

GGCAGGTCTC GCTCTCGGCA CCCTCCCGGC GCCCGCGTTC TCCTGGCCCT GCCCGGCATC 60 

CCGATGGCCG CCGCTGGGCC CCGGCGCTCC GTGCGCGGAG CCGTCTGCCT GCATCTGCTG 120 

CTGACCCTCG TGATCTTCAG TCGTGATGGT GAAGCCTGCA AAAAGGTGAT ACTTAATGTA 180 

CCTTCTAAAC TAGAGGCAGA CAAAATAATT GGCAGAGTTA ATTTGGAAGA GTGCTTCAGG 240 

TCTGCAGACC TCATCCGGTC AAGTGATCCT GATTTCAGAG TTCTAAATGA TGGGTCAGTG 300 

TACACAGCCA GGGCTGTTGC GCTGTCTGAT AAGAAAAGAT CATTTACCAT ATGGCTTTCT 360 

GACAAAAGGA AACAGACACA GAAAGAGGTT ACTGTGCTGC TAGAACATCA GAAGAAGGTA 420 

TCGAAGACAA GACACACTAG AGAAACTGTT CTCAGGCGTG CCAAGAGGAG ATGGGCACCT 480 

ATTCCTTGCT CTATGCAAGA GAATTCCTTG GGCCCTTTCC CATTGTTTCT TCAACAAGTT 540 

GAATCTGATG CAGCACAGAA CTATACTGTC TTCTACTCAA TAAGTGGACG TGGAGTTGAT 600 

AAAGAACCTT TAAATTTGTT TTATATAGAA AGAGACACTG GAAATCTATT TTGCACTCGG 660 

CCTGTGGATC GTGAAGAATA TGATGTTTTT GATTTGATTG CTTATGCGTC AACTGCAGAT 720 

GGATATTCAG CAGATCTGCC CCTCCCACTA CCCATCAGGG TAGAGGATGA AAATGACAAC 780 

CACCCTGTTT TCACAGAAGC AATTTATAAT TTTGAAGTTT TGGAAAGTAG TAGACCTGGT 840 

ACTACAGTGG GGGTGGTTTG TGCCACAGAC AGAGATGAAC CGGACACAAT GCATACGCGC 900 

CTGAAATACA GCATTTTGCA GCAGACACCA AGGTCACCTG GGCTCTTTTC TGTGCATCCC 960 

AGCACAGGCG TAATCACCAC AGTCTCTCAT TATTTGGACA GAGAGGTTGT AGACAAGTAC 1020 

TCATTGATAA TGAAAGTACA AGACATGGAT GGCCAGTTTT TTGGATTGAT AGGCACATCA 1080 

ACTTGTATCA TAACAGTAAC AGATTCAAAT GATAATGCAC CCACTTTCAG ACAAAATGCT 1140 

TATGAAGCAT TTGTAGAGGA AAATGCATTC AATGTGGAAA TCTTACGAAT ACCTATAGAA 1200 

GATAAGGATT TAATTAACAC TGCCAATTGG AGAGTCAATT TTACCATTTT AAAGGGAAAT 1260 

GAAAATGGAC ATTTCAAAAT CAGCACAGAC AAAGAAACTA ATGAAGGTGT TCTTTCTGTT 1320 

GTAAAGCCAC TGAATTATGA AGAAAACCGT CAAGTGAACC TGGAAATTGG AGTAAACAAT 1380 

GAAGCGCCAT TTGCTAGAGA TATTCCCAGA GTGACAGCCT TGAACAGAGC CTTGGTTACA 1440 

GTTCATGTGA GGGATCTGGA TGAGGGGCCT GAATGCACTC CTGCAGCCCA ATATGTGCGG 1500 

ATTAAAGAAA ACTTAGCAGT GGGGTCAAAG ATCAACGGCT ATAAGGCATA TGACCCCGAA 1560 
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AATAGAAATG GCAATGGTTT AAGGTACAAA 
ATTGATGAAA TTTCAGGGTC AATCATAACT 
CCCAAAAATG AGTTGTATAA TATTACAGTC 
ACTGGAACAC TTGCTGTGAA CATTGAAGAT 
GAATATGTAG TCATTTGCAA ACCAAAAATG 
GATGAACCTG TCCATGGAGC TCCATTTTAT 
AGTAGACTGT GGAGCCTCAC CAAAGTTAAT 
AATGCTGGAT TTCAAGAATA TACCATTCCT 
GCAACAAAAT TATTGAGAGT TAATCTGTGT 
ACTTCAAGGA GTACAGGAGT AATACTTGGA 
ATAGCACTGC TCTTTTCTGT ATTGCTAACT 
GGGAAACGTT TTCCTGAAGA TTTAGCACAG 
CCTGGAGACG ATAGAGTGTG CTCTGCCAAT 
AGCCAAGGTT TTTGTGGTAC TATGGGATCA 
GAAATGATGA AAGGAGGAAA CCAGACCTTG 
ACCCTGGACT CCTGCAGGGG AGGACACACG 
GAGTGGCACA GTTTTACTCA ACCCCGTCTC 
GAAGACCGCA TGCCATCCCA AGATTATGTC 
CCAGCTGGTT CTGTGGGCTG CTGCAGTGAA 
AATAATTTGG AACCCAAATT TATTACATTA 
AGTGCTACAA TTAGGTCTTT GTCAGACATT 
TTCAATTTCA ACATGTATGT ATATGATGAT 
CCAA TTTATA TTTTTAAAGC CAGTTGTTGC 
AACAGACAAC TGGTAAATCT CAAACTCCAG 
TCTTTTTTTT TTTTACGGAT ATTTTAGTAA 
ATAGGTAAGT TATGCTAATA TCACATTATT 
ATAAACAAGA AATATTGAGT ATCACTATGT 
ACTGAATTAA ATTAAAAATG TTGCAGCTCA 
ACCAAATTCA TTTGACTTTG GAGGCAAAAT 
TCTATAGGAA TATAGTTGGA AATAAATGTG 
ATTTAAAATG AAATGAGAAC AAAGAGGAAA 
TAGTTTGTCC TACAATAGAA AAAAGAGAGA 
ATTATAACTG AGTCTATGAG GAAATAGTTC 
GTAAATAAAT TAAACTTTTC TGGTTTCTGT 
TAGCTTTGCT TTGCAGTCTG TTT CAAGATT 
GAATACTCGC TGCAGCTGGG GTTCCCTGCT 
TTTTTTTCGG GGAGCTAATA ACAAAAACAT 
CTCTATTGCT GTTTCTATTC TCTCTTATAG 
TAACCATGTC CTCCTAGAGT TTAGAGGCTA 
GCACCCTGGG GAGATTGATT GT CCTTAAAC 
GTCTGGGAGC TACAAAATTT CATTTTTCTC 
CTGAATCAAG GAAAGCCAGG CCTTGTGGGC 
ACCTCCAGCA GAGATTCCCT TAAGTGACTC 
AATTTTTAAT CAGTTTGCTT TCTCCAGAGA 
TTGAATGTAT AAAAGAAAAA GATCAAGTTG 
AAGCAGCCCA AGTAGGTTAT TTGTACAGTC 
GGGCAAGGAG AGGCCACAAG GAATATGGGT 
CTTTTTCCTA GGCTTGGCAC TGCCTTTTCC 
GTCCGGTGAG GGATCAGCCA ACCTCTTCTC 
AAGGAGACAG AGCTGACTGC ATGATGAGTC 
GTTGTGCAGA ACAAACAAGG CATTCATGGG 
CTGGGCACTA AGAAGGTCTA TGAATTAAAT 
TTTTCTGTTT TCTAATTTGA CCCTAAAATC 
CCCCCCCCCC TTTTTTTTTG AGACGGAGTC 
GCTCCGATCT CTGCTCACTG AAAGCTCCGC 
GCCTCCTGAG TAGCTGGGAC TACAGGCGCC 
TTTAATAGAG ACGGGGTTTC ACTGTGTTAG 
ATCCGCCTGC CTCGGCCTCC CAAAGTGCTG 
CTTGTTTTCC GTTTAAAGTC GTCTTCTTTT 
TG AT CAT ACQ AATTGGATCA ATCTTGAAAT 
GGGAGAAAGA ACTCAGGGCA CAAAATATTG 
TTGCTGAAAT TTCCTGCTGT AACCAGAAGC 
ACTGTGTTTT GCTCACTCCC TCACTCACCG 
CTAGTGCCGA TAAACTTTCT CAAAGAGCAA 
TAACCATCTC TTTGTTCTTT GAACATGCTG 
TTTGTAATTC TTTTCTCTCA AATGAAAATT 
CATATGTAGT ATT ATT ATT T CCTTATATGT 
CAAGAAAATA TATTTTTAAA GCTTTCATTT 
TGTAAATATA CAGAATGTTT TTTCTTACTT 
GGGGTTTGTT TTGCAATGTT TTAAACAGAG 
TGCTTTTAAA GAAACTTGGC TGCTTAAAAT 
TACAGATGTG GGGAGATGTA ATAAAACAAT 
TAGAGATTAA ATAATTCTAA GATGATCACT 
AATAGAAATA CTCAATTATG TCTTTGTTGT 
ATTATCAAAT TGTCGACATC ATTAATATAT 
GAAGCACAGC TTTACAGATG AGTATCTATG 
ATTAAAAGTA TTAGAAGGTG GTTATAATTG 
AGGGGTTTTA CTTTGAGGAC CAGTGTAGTC 
GGCAATATTG CAGTCTTGAT TCTGCCACTT 
CAAGATGATC CAACCATAAA GGTGCTCTGT 
AGTGTGCTCC CCTACAAACG TTAAGACTGA 
AGCCTTACAT TTTAATATAG GTTGAACCAA 
CATTATTTTT GTGTATGTCT TCAAGAATGT 
ACCGGATACA TTTCACGTGT CCTTCAGTAT 
TGAGAAGCAT GGACACTAGA GCCAGAATGC 
TTCTGTGTGA CCTTTGAAAG GCTACTTATT 
GAACAATGCC AGCCTCATGG GGTTGTTGAA 



AAATTGCATG ATCCTAAAGG TTGGATCACC 1620 

TCCAAAATCC TGGATAGGGA GGTTGAAACT 1680 

CTGGCAATAG ACAAAGATGA TAGATCATGT 1740 

GTAAATGATA ATCCACCAGA AATACTTCAA 1800 

GGGTATACOG ACATTTTAGC TGTTGATCCT I960 

TTCAGTTTGC CCAATACTTC TCCAGAAATC 1920 

GATACAGCTG CCCGTCTTTC ATATCAGAAA 1980 

ATTACTGTAA AAGACAGGGC CGGCCAAGCT 2040 

GAATGTACTC ATCCAACTCA GTGTCGTGCG 2100 

AAATGGGCAA TCCTTGCAAT ATTACTGGGT 2160 

TTAGTATGTG GAGTTTTTGG TGCAACTAAA 2220 

CAAAACTTAA TTATATCAAA CACAGAAGCA 2280 

GGATTTATGA CCCAAACTAC CAACAACTCT 2340 

GGAATGAAAA ATGGAGGGCA GGAAACCATT 2400 

GAATCCTGCC GGGGGGCTGG GCATCATCAT 2460 

GAGGTGGACA ACTGCAGATA CACTTACTCG 2520 

GGTGAAAAAT TGCATCGATG TAATCAGAAT 2580 

CTCACTTATA ACTATGAGGG AAGAGGATCT 2640 

AAGCAGGAAG AAGATGGCCT TGACTTTTTA 2700 

GCAGAAGCAT GCACAAAGAG ATAATGTCAC 27 60 

CTGGAGGTTT CCAAAAATAA TATTGTAAAG 2820 

TTTTTTCTCA ATTTTGAATT ATGCTACTCA 2880 

TTATCTTTTC CAAAAAGTGA AAAATGTTAA 2940 

CACTGGAATT AAGGTCTCTA AAGCATCTGC 3000 

TAAATATGCT GGATAAATAT TAGTCCAACA 3060 

ATGTATTCAC TTTAAGTGAT AGTTTAAAAA 3120 

GAAGAAAGTT TTGGAAAAGA AACAATGAAG 3180 

TAAAGAATTG GGACTCACCC CTACTGCACT 3240 

GTGTTGAAGT GCCCTATGAA GTAGCAATTT 3300 

TGTGTGTATA TTATTATTAA TCAATGCAAT 3360 

ATGGTAAAAA CTTGAAATGA GGCTGGGGTA 3420 

GCTTCCTAGG CCTGGGCTCT TAAATGCTGC 3480 

CTGTCCAATT TGTGTAATTT GTTTAAAATT 3540 

GGGAAGGAAA TAGGGAATCC AATGGAACAG 3600 

TCTGCATCCA CAAGTTAGTA GCAAACTGGG 3660 

TTTTGGTAGC AAGGGTCCAG AGATGAGGTG 3720 

TTTAAAACTT ACCTTTACTG AAGTTAAATC 3780 

TGACCAACAT CTTTTTAATT TAGATCCAAA 3840 

GAGGGAGCTG AGGGGAGGAT CTTACTGAAA 3900 

CTAAGCCCCA CAAACTTGAC ACCTGATCAG 3960 

CTCACTGCCC TTCTTCTGAG TGGCATTGGC 4020 

CCCCTTCTTT CGGCTTTCTG CTAAAGCAAC 4080 

CAGGTTTTCC ACCATCCTTC AGCGTGAATT 4140 

AATTTTAAAA TAATAGAAGA AATAGAAATT 4200 

TCAT TTTAGA ACAGAGGGAA CTTTGGGAGA 42 60 

AGAGGGCAAC AGGAAGATGC AGGCCTTCAA 4320 

GGGAGTAAAA GCAACATCGT CTGCTTCATA 4380 

TTTCTCAGGC CAATGGCAAC TGCCATTTGA 4440 

TATGGCTCAC CTTATTTGGA GTGAGAAATC 4500 

TGAAGGCATT TGCAGGATGA GCCTGAACTG 4560 

AATTGTTGTA TTCCTTCTGC AGCCCTCCTT 4620 

GCCTATCTAA AATTCTGATT TATTCCTACA 4680 

TATGTGTTTT AGACTTAGAC TTTTTATTGC 4740 

TCGCTCTGAC GCACAGGCTG GAGTGCAGTG 4800 

CTCCCGGGTT CATGCCATTC TCCTGCCTCA 4860 

CACCACCACG CCCGGCTAAT TTTTTGTATT 4920 

CCAGGATGGT CTCGATCTCC TGACCTCGTG 4980 

GGATTACAGG CATGACCCAC CGCTCCCGGC 5040 

AATGTAATCA TTTTGAACAT GTGTGAAAGT 5100 

ACTCAACCAA AAGACAGTCG AGAAGCCAGG 5160 

GTCTGAGAAT GGAATTCTCT GTAAGCCTAG 5220 

CAGTTTTATC TAACGGCTAC TGAAACACCC 5280 

ATCAAAACCT GCTACCTCCC CAAGACTTTA 5340 

CCAGTATCAC TTCCCTGTTT ATAAAACCTC 5400 

AAAACCACCT GGTCTGCATG TATGCCCGAA 5460 

TAATTTTAGG GATTCATTTC TATATTTTCA 5520 

GTAAGGTGAA ATTTATGGTA TTTGAGTGTG 5580 

TTCCCCCAGT GAATGATTTA GAATTTTTTA 5640 

TTATAAGGAA GCAGCTGTCT AAAATGCAGT 5700 

TTTTAGTATT GCTATTAAAA GAAGTTACTT 5760 

AAGCAAAAAT TGGATGCATA AAGTAATATT 5820 

ATTAACTTGG TTTCTTGTTT TTGCTGTATT 5880 

TTGCAAAATT ATGCTTATGG CTGGCATGGA 5940 

ATTAATGGGG AATATTTTGG ACAATGTTT C 6000 

ATTGTAATGT TGGGAAGAGA TCACTATTTT 6060 

ATACATATGT ATAATAAATT TTGATCGGGT 6120 

CAGAGTATTC CATGAATAGT ACACTGACAC 6180 

AAGGGAAAAC ATGAGTTAAA AAGAAAAGCA 6240 

ACAGGATAGA TAATGCCTGA ACTTTAATGA 6300 

GCTTCACAGT GAATCTTTTC CCCATGCAGG 6360 

TCATTTCAAA AATCTATTAG CTATATCAAA 6420 

AATTTCAATT CCAGTAACTT CTATTGTAAC 6480 

TCATTGGATT TTTGTTTGTA ATAGTAAAAT 6540 

TGATTTGGTT GAATATTGGG TCATAATGGT 6600 

TTGGATATGA ATCCTGGATC TGTCACTTAC 6660 

TCCTCTCTTA GCTTTCTCAT TAAAATCAAT 6720 

TGATTAAATT AGTTAATATA CCTAAAGTAC 6780 
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ATAGAACACT GCCTGCACAT AGTAAAAGAA TTATAAGTGT GAGGTAGTTG GTAAAATTAT 6840 

GTAGTTGGAT ATACTACCGA ACAATATCTA ATCTCTTTTT AGGGAAATAA AGTTTGTGCA 6900 
TATATATAAT CCCGAAACAT G 

Seq ID NO: 32 Protein sequence: 
Protein Accession 8: NP_001932.1 

1 11 21 31 41 SI 

I I I I I I 

MAAAGFRRSV RGAVCLHLLL TLVIFSRDGE ACKKVILNVP SKLEADKIIG RVNLEECFRS €0 

ADLIRSSDPD PRVLNDGSVY TARAVALSDK KRSFTIWLSD KRKQTQKEVT VLLEHQKKVS 120 

KTRHTRETVL RRAKRRWAPI PCSMQENSLG PFPLPLQQVE SDAAQNYTVF YS I SGRGVDK 180 

EPLNLFYIER DTGNLFCTRP VDREEYDVFD LIAYASTADG YSADLPLPLP IRVEDENDNH 240 

PVPTEAIYNP EVLESSRPGT TVGWCATDR DEPDTMHTRIj KYSILQQTPR SPGLPSVHPS 300 

TGVITTVSHY LDREWDKYS LIMKVQDMDG QFFGLIGTST CIITVTDSND NAPTFRQNAY 360 

EAPVEENAFN VEILRIPIED KDLINTANWR VNFTILKGNE NGHPKISTDK ETNEGVLSW 420 

KPLNYEENRQ VNLEIGVNNE APFARDIPRV TALNRALVTV HVRDLDEGPE CTPAAQYVRI 480 

KENLAVGSKI NGYKAYDPEN RNGNGLRYKK IiHDPKGWITI DEISGSIITS KILDREVETP 540 

KNELYNITVTi AIDKDDRSCT GTLAVNIEDV NDNPPEILQE YWICKPKMG YTDIIAVDPD 600 

EPVHGAPFYP SLPNTSPEIS RLWSLTKVND TAARLSYQKN AGFQEYTIPI TVKDRAGQAA 660 

TKLLRVNLCE CTHPTQCRAT SRSTGVILGK WAILAILLGI ALLFSVLLTL VCGVFGATKG 720 

KRFPEDLAQQ NLIISNTEAP GDDRVCSANG FMTQTTNNSS QGFCGTMGSG MKNGGQETIE 780 

MMKGGNQTLE SCRGAGHHHT LDSCRGGHTE VDNCRYTYSB WHSFTQPRLG EKZJIRCNQNE 840 
DRMPSQDYVL TYNYEGRGSP AGSVGCCSEK QEEDGLDFLN NLEPKFITIA EACTKR 



Seg ZD NO: 33 DMA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 64-2583 

1 11 21 31 41 51 

I I I 1 I I 

GGCAGGTCTC GCTCT CGGCA CCCTCCCGGC GCCCGCGTTC TCCTGGCCCT GCCCGGCATC 60 

CCGATGGCCG CCGCTGGGCC CCGGCGCTCC GTGCGCGGAG CCGTCTGCCT GCATCTGCTG 120 

CTGACCCTCG TGATCTTCAG TCGTGATGGT GAAGCCTGCA AAAAGGTGAT ACTTAATGTA 180 

CCTTCTAAAC TAGAGGCAGA CAAAATAATT GGCAGAGTTA ATTTGGAAGA GTGCTTCAGG 240 

TCTGCAGACC TCATCCGGTC AAGTGATCCT GATTTCAGAG TTCTAAATGA TGGGTCAGTG 300 

TACACAGCCA GGGCTGTTGC GCTGTCTGAT AAGAAAAGAT CATTTACCAT ATGGCTTTCT 360 

GACAAAAGGA AACAGACACA GAAAGAGGTT ACTGTGCTGC TAGAACATCA GAAGAAGGTA 420 

TCGAAGACAA GACACACTAG AGAAACTGTT CTCAGGCGTG CCAAGAGGAG ATGGGCACCT 480 

ATTCCTTGCT CTATGCAAGA GAATTCCTTG GGCCCTTTCC CATTGTTTCT TCAACAAGTT 540 

GAATCTGATG CAGCACAGAA CTATACTGTC TTCTACTCAA TAAGTGGACG TGGAGTTGAT 600 

AAAGAACCTT TAAATTTGTT TTATATAGAA AGAGACACTG GAAATCTATT TTGCACTCGG 660 

CCTGTGG AT C GTGAAGAATA TGATGTTTTT GATTTGATTG CTTATGCGTC AACTGCAGAT 720 

GGATATTCAG CAGATCTGCC CCTCCCACTA CCCATCAGGG TAGAGGATGA AAATGACAAC 780 

CACCCTGTTT TCACAGAAGC AATTTATAAT TTTGAAGTTT TGGAAAGTAG TAGACCTGGT 840 

ACTACAGTGG GGGTGGTTTG TGCCACAGAC AGAGATGAAC CGGACACAAT GCATACGCGC 900 

CTGAAATACA GCATTTTGCA GCAGACACCA AGGTCACCTG GGCTCTTTTC TGTGCATCCC 960 

AGCACAGGCG TAATCACCAC AGTCTCTCAT TATTTGGACA GAGAGGTTGT AGACAAGTAC 1020 

TCATTGATAA TGAAAGTACA AGACATGGAT GGCCAGTTTT TTGGATTGAT AGGCACATCA 1080 

ACTTGTATCA TAACAGTAAC AGATTCAAAT GATAATGCAC CCACTTTCAG ACAAAATGCT 1140 

TATGAAGCAT TTGTAGAGGA AAATGCATTC AATGTGGAAA TCTTACGAAT ACCTATAGAA 1200 

GATAAGGATT TAATTAACAC TGCCAATTGG AGAGTCAATT TTACCATTTT AAAGGGAAAT 1260 

GAAAATGGAC ATTTCAAAAT CAGCACAGAC AAAGAAACTA ATGAAGGTGT TCTTTCTGTT 1320 

GTAAAGC CAC TGAATTATGA AGAAAACCGT CAAGTGAACC TGGAAATTGG AGTAAACAAT 1380 

GAAGCGCCAT TTGCTAGAGA TATTCCCAGA GTGACAGCCT TGAACAGAGC CTTGGTTACA 1440 

GTTCATGTGA GGGATCTGGA TGAGGGGCCT GAATGCACTC CTGCAGCCCA ATATGTGCGG 1500 

ATTAAAGAAA ACTTAGCAGT GGGGTCAAAG ATCAACGGCT ATAAGGCATA TGACCCCGAA 1560 

AATAGAAATG GCAATGGTTT AAGGTACAAA AAATTGCATG ATCCTAAAGG TTGGATCACC 1620 

ATTGATGAAA TTTCAGGGTC AATCATAACT TCCAAAATCC TGGATAGGGA GGTTGAAACT 1680 

CCCAAAAATG AGTTGTATAA TATTACAGTC CTGGCAATAG ACAAAGATGA TAGATCATGT 1740 

ACTGGAACAC TTGCTGTGAA CATTGAAGAT GTAAATGATA ATCCACCAGA AATACTTCAA 1800 

GAATATGTAG TCATTTGCAA ACCAAAAATG GGGTATACCG ACATTTTAGC TGTTGATCCT 1860 

GATGAACCTG TCCATGGAGC TCCATTTTAT TTCAGTTTGC CCAATACTTC TCCAGAAATC 1920 

AGTAGACTGT GGAGCCTCAC CAAAGTTAAT GATACAGCTG CCCGTCTTTC ATATCAGAAA 1980 

AATGCTGGAT TTCAAGAATA TACCATTCCT ATTACTGTAA AAGACAGGGC CGGCCAAGCT 2040 

GCAACAAAAT TATTGAGAGT TAATCTGTGT GAATGTACTC ATCCAACTCA GTGTCGTGCG 2100 

ACTTCAAGGA GTACAGGAGT AATACTTGGA AAATGGGCAA TCCTTGCAAT ATTACTGGGT 2160 

ATAGCACTGC TCTTTTCTGT ATTGCTAACT TTAGTATGTG GAGTTTTTGG TGCAACTAAA 2220 

GGGAAACGTT TTCCTGAAGA TTTAGCACAG CAAAACTTAA TTATATCAAA CACAGAAGCA 2280 

CCTGGAGACG ATAGAGTGTG CTCTGCCAAT GGATTTATGA CCCAAACTAC CAACAACTCT 2340 

AGCCAAGGTT TTTGTGGTAC TATGGGATCA GGAATGAAAA ATGGAGGGCA GGAAACCATT 2400 

GAAATGATGA AAGGAGGAAA CCAGACCTTG GAATCCTGCC GGGGGGCTGG GCATCATCAT 2460 

ACCCTGGACT CCTGCAGGGG AGGACACACG GAGGTGGACA ACTGCAGATA CACTTACTCG 2520 

GAGTGGCACA GTTTTACTCA ACCCCGTCTC GGTGAAGAAT CCATTAGAGG ACACACTGGT 2580 

TAAAAATTAA ACATAAAAGA AATTGCATCG ATGTAATCAG AATGAAG AC C GCATGCCATC 2640 

CCAAGATTAT GTCCTCACTT ATAACTATGA GGGAAGAGGA TCTCCAGCTG GTTCTGTGGG 2700 

CTGCTGCAGT GAAAAGCAGG AAGAAGATGG CCTTGACTTT TTAAATAATT TGGAACCCAA 2760 

ATTTATTACA TTAGCAGAAG CATGCACAAA GAGATAATGT CACAGTGCTA CAATTAGGTC 2820 

TTTGTCAGAC ATTCTGGAGG TTTCCAAAAA TAATATTGTA AAGTTCAATT TCAACATGTA 2880 

TGTATATGAT GATTTTTTTC TCAATTTTGA ATTATGCTAC TCACCAATTT ATATTTTTAA 2940 

AGCCAGTTGT TGCTTATCTT TTCCAAAAAG TGAAAAATGT TAAAACAGAC AACTGGTAAA 3000 

TCTCAAACTC CAGCACTGGA ATTAAGGTCT CTAAAGCATC TGCTCTTTTT TTTTTTTACG 3060 

GATATTTTAG TAATAAATAT GCTGGATAAA TATTAGTCCA ACAATAGCTA AGTTATGCTA 3120 

ATATCACATT ATTATGTATT CACTTTAAGT GATAGTTTAA AAAATAAACA AGAAATATTG 3i80 

AGTATCACTA TGTGAAGAAA GTTTTGGAAA AGAAACAATG AAGACTGAAT TAAATTAAAA 3240 

ATGTTGCAGC TCATAAAGAA TTGGGACTCA CCCCTACTGC ACTACCAAAT TCATTTGACT 3300 
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TTGGAGGCAA AATGTGTTGA AGTGCCCTAT GAAGTAGCAA TTTTCTATAG GAATATAGTT 3360 

GGAAATAAAT GTGTGTGTGT ATATTATTAT TAATCAATGC AATATTTAAA ATGAAATGAG 3420 

AACAAAGAGG AAAATGGTAA AAACTTGAAA TGAGGCTGGG GTATAGTTTG TCCTACAATA 3480 

GAAAAAAGAG AGAGCTTCCT AGGCCTGGGC TCTTAAATGC TGCATTATAA CTGAGTCTAT 3S40 

GAGGAAATAG TTCCTGTCCA ATTTGTGTAA TTTGTTTAAA ATTGTAAATA AATTAAACTT 3600 

TTCTGQTTTC TQTGGGAAGG AAATAGGGAA TCCAATGGAA CAGTAGCTTT GCTTTGCAGT 3660 

CTGTTTCAAG ATTTCTGCAT CCACAAGTTA GTAGCAAACT GGGGAATACT CGCTGCAGCT 3720 

GGGGTTCCCT GCT TTTTG GT AGCAAGGGTC CAGAGATGAG GTGTTTTTTT CGGGGAGCTA 3780 

ATAACAAAAA CATTTTAAAA CTTACCTTTA CTGAAGTTAA ATCCTCTATT GCTGTTTCTA 3840 

TTCTCTCTTA TAGTGACCAA CATCTTTTTA ATTTAGATCC AAATAACCAT GTCCTCCTAG 3900 

AGTTTAGAGG CTAGAGGGAG CTGAGGGGAG GATCTTACTG AAAGCACCCT GGGGAGATTG 3960 

ATTGTOCTTA AACCTAAGCC CCACAAACTT GACACCTGAT CAGGTCTGGQ AGCTACAAAA 4020 

TTTCATTTTT CTCCTCACTG CCCTTCTTCT GAGTGGCATT GGCCTGAATC AAGGAAAGCC 4080 

AGGCCTTGTG GGCCCCCTTC TTTCGGCTTT CTGCTAAAGC AACACCTCCA GCAGAGATTC 4140 

CCTTAAGTGA CTCCAGGTTT TCCACCATCC TTCAGCGTGA ATTAATTTTT AATCAGTTTG 4200 

CTTTCTCCAG AGAAATTTTA AAATAATAGA AGAAATAGAA ATTTTGAATG TATAAAAGAA 4260 

AAAGATCAAG TTGTCATTTT AGAACAGAGG GAACTTTGGG AGAAAGCAGC CCAAGTAGGT 4320 

TATTTGTACA GTCAGAGGGC AACAGGAAGA TGCAGGCCTT CAAGGGCAAG GAGAGGCCAC 4380 

AAGGAATATG GGTGGGAGTA AAAGCAACAT CGTCTGCTTC ATACTTTTTC CTAGGCTTGG 4440 

CACTGCCTTT TCCTTT CTCA GGCCAATGGC AACTGCCATT TGAGTCCGGT GAGGGATCAG 4500 

CCAACCTCTT CTCTATGGCT CACCTTATTT GGAGTGAGAA ATCAAGGAGA CAGAGCTGAC 4560 

TGCATGATGA GTCTGAAGGC ATTTGCAGGA TGAGCCTGAA CTGGTTGTGC AGAACAAACA 4620 

AGGCATTCAT GGGAATTGTT GTATTCCTTC TGCAGCCCTC CTTCTGGGCA CTAAGAAGGT 4680 

CTATGAATTA AATGCCTATC TAAAATTCTG ATTTATTCCT ACATTTTCTG TTTTCTAATT 4740 

TGACCCTAAA AT CT ATGTGT TTTAGACTTA GACTTTTTAT TGCCCCCCCC CCCTTTTTTT 4800 

TTGAGACGGA GTCTCGCTCT GACGCACAGG CTGGAGTGCA GTGGCTCCGA TCTCTGCTCA 4860 

CTGAAAGCTC CGCCTCCCGG GTTCATGCCA TTCTCCTGCC TCAGCCTCCT GAGTAGCTGG 4920 

GACTACAGGC GCCCACCACC ACGCCCGGCT AATTTTTTGT ATTTTTAATA GAGACGGGGT 4980 

TTCACTGTGT TAGCCAGGAT GGTCTCGATC TCCTGACCTC GTGATCCGCC TGCCTCGGCC 5040 

TCCCAAAGTG CTGGGATTAC AGGCATGACC CACCGCTCCC GGCCTTGTTT TCCGTTTAAA 5100 

GTCGTCTTCT TTTAATGTAA TCATTTTGAA CATGTGTGAA AGTTGATCAT ACGAATTGGA 5160 

TCAATCTTGA AATACTCAAC CAAAAGACAG TCGAGAAGCC AGGGGGAGAA AGAACTCAGG 5220 

GCACAAAATA TTGGTCTGAG AATGGAATTC TCTGTAAGCC TAGTTGCTGA AATTTCCTGC 5280 

TGTAACCAGA AGCCAGTTTT ATCTAACGGC TACTGAAACA CCCACTGTGT TTTGCTCACT 5340 

CCCACTCACC GATCAAAACC TGCTACCTCC CCAAGACTTT ACTAGTGCCG ATAAACTTTC 5400 

TCAAAGAGCA ACCAGTATCA CTTCCCTGTT TATAAAACCT CTAACCATCT CTTTGTTCTT 5460 

TGAACATGCT GAAAACCACC TGGTCTGCAT GTATGCCCGA ATTTGTAATT CTTTTCTCTC 5520 

AAATGAAAAT TTAATTTTAG GGAT TCATTT CTATATTTTC ACATATGTAG TATTATTATT 5580 

TCCTTATATG TGTAAGGTGA AATTTATGGT ATTTGAGTGT GCAAGAAAAT ATATTTTTAA 5640 

AG CTTTCATT TTTCCCCCAG TGAATGATTT AGAATTTTTT ATGTAAATAT ACAGAATGTT 5700 

TTTTCTTACT TTTATAAGGA AGCAGCTGTC TAAAATGCAG TGGGGTTTGT TTTGCAATGT 5760 

TTTAAACAGA GTTTTAGTAT TGCTATTAAA AGAAGTTACT TTGCTTTTAA AGAAACTTGG 5820 

CTGCTTAAAA TAAGCAAAAA TTGGATGCAT AAAGTAATAT TTACAGATGT GGGGAGATGT 5880 

AATAAAACAA TATTAACTTG GCTGCTTAAA ATAAGCAAAA ATTGGATGCA TAAAGTAATA 5940 

TTTACAGATG TGGGGAGATG TAATAAAACA ATATTAACTT GGTTTCTTGT TTTTGCTGTA 6000 

TTTAGAGATT AAATAATTCT AAGATGATCA CTTTGCAAAA TTATG CTTAT GGCTGGCATG 6060 

GAAATAGAAA TACT CAATT A TGTCTTTGTT GTATTAATGG GGAATATTTT GGACAATGTT 6120 

TCATTATCAA ATTGTCGACA TCATTAATAT ATATTGTAAT GTTGGGAAGA GATCACTATT 6180 

TTGAAGCACA GCTTTACAGA TGAGTATCTA TGATACATAT GTATAATAAA TTTTGATCGG 6240 

GTATTAAAAG TATTAGAAGG TGGTTATAAT TGCAGAGTAT TCCATGAATA GTACACTGAC 6300 

ACAGGGGTTT TACTTTGAGG ACCAGTGTAG TCAAGGGAAA ACATGAGTTA AAAAGAAAAG 6360 

CAGGCAATAT TGCAGTCTTG ATTCTGCCAC TTACAGGATA GATAATGCCT GAACTTTAAT 6420 

GACAAGATGA TCCAACCATA AAGGTGCTCT GTGCTTCACA GTGAATCTTT TCCCCATGCA 6480 

GGAGTGTGCT CCCCTACAAA CGTTAAGACT GATCATTTCA AAAATCTATT AG CT ATATCA 6540 

AAAGCCTTAC ATTTTAATAT AGGTTGAACC AAAATTTCAA TTCCAGTAAC TTCTATTGTA 6600 

ACCATTATTT TTGTGTATGT CTTCAAGAAT GTT CAT T GG A TTTTTGTTTG TAATAGTAAA 6660 

ATACCGGATA CATTTCACGT GTCCTTCAGT ATTGATTTGG TTGAATATTG GGTCATAATG 6720 

GTTGAGAAGC ATGGACACTA GAGCCAGAAT GCTTGGATAT GAATCCTGGA TCTGTCACTT 6780 

ACTTCTGTGT GACCTTTGAA AGGCTACTTA TTTCCTCTCT TAGCTTTCTC ATTAAAATCA 6840 

ATGAACAATG CCAGCCTCAT GGGGTTGTTG AATGATTAAA TTAGTTAATA TACCTAAAGT 6900 

ACATAGAACA CTGCCTGCAC ATAGTAAAAG AATTATAAGT GTGAGGTAGT TGGTAAAATT 6960 

ATGTAGTTGG A T AT ACT AC C GAACAATATC TAATCTCTTT TTAGGGAAAT AAAGTTTGTG 7020 
CATATATATA AT CCCGAAAC ATG 



Seq ID NO: 34 Protein sequence i 
Protein Accession ft: NP_077741.1 

1 11 21 31 41 51 
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MAAAGPRRSV RGAVCLHLLL TLVXFSRDGE ACKKVILNVP SKLEADKI IG RVNLEECFRS 60 

ADLIRSSDPD FRVLNDGSVY TARAVALSDK KRSFTIWLSD KRKQTQKEVT VLL.EHQKKVS 120 

KTRHTRETVL RRAKRRWAPI PCSMQENSLG PFPLFLQQVE SDAAQNYTVF YSISGRGVDK 180 

EPLNLFYIER DTGNLFCTRP VDREEYDVFD L I AYASTADG YSADLPLPLP IRVEDENDNH 240 

PVFTEAIYNF EVLESSRPGT TVGWCATDR DEPDTMHTRL KYSILQQTPR SPGLFSVHPS 300 

TGVITTVSHY LDREWDKYS LIMKVQDMDG QPFGL1GTST CIITVTDSND NAPTFRQNAY 360 

EAPVEENAFN VEILRIPIED KDLINTANWR VNFTILKGNE NGHFKISTDK ETNEGVLSW 420 

KPLNYEENRQ VNLEIGVNNE APFARDIPRV TALNRALVTV HVRDLDEGPE CTPAAQYVRI 480 

KENLAVGSKI NGYKAYDPEN RNGNGIiRYKK LHDPKGWITI DEISGSIITS KILDREVETP 540 

KNELYNITVL AIDKDDRSCT GTLAVNIEDV NDNPPEILQE YWICKPKMG YTDILAVDPD 600 

EPVHGAPPYF SLPNTSPEIS RLWSLTKVND TAARLSYQKN AGFQEYTIPI TVKDRAGQAA 660 

TKLLRVNLCE CTHPTQCRAT SRSTGVILGK WAILAILLGI ALLFSVLLTL VCGVFGATKG 720 

KRFPEDLAQQ NLIISNTEAP GDDRVCSANG FMTQTTNNSS QGFCGTMGSG MKNGGQETIE 780 
MMKGGNQTLB SCRGAGHHHT LDSCRGGHTE VDNCRYTYSE WHSFTQPRLG EE8ZRGHTG 
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Nucleic Acid Accession Jh Eos sequence 
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GGGAGTGGGC GTGGCGGTGC TGCCCAGGTG AGCCACCGCT GCTTCTGCCC AGACACGGTC 60 

5 GCCTCCACAT CCAGGTCTTT GTGCTCCTCG CTTGCCTGTT CCTTTTCCAC GCATTTTCCA 120 

GGATAACTGT GACTCCAGGC CCGCAATGGA TGCCCTGCAA CTAGCAAATT CGGCTTTTGC 180 

CGTTGATCTG TTCAAACAAC TATGTGAAAA GGAGCCACTG GGCAATGTCC TCTTCTCTCC 240 

AATCTGTCTC TCCACCTCTC TGTCACTTGC TCAAGTGGGT GCTAAAGGTG ACACTGCAAA 300 

TGAAATTGGA CAGGTTCTTC ATTTTGAAAA TGTCAAAGAT ATACCCTTTG GATTTCAAAC 360 

10 AGTAACATOG GATGTAAACA AACTTAGTTC CTTTTACTCA CTGAAACTAA TCAAGCGGCT 420 

CTACGTAGAC AAATCTCTGA ATCTTTCTAC AGAGTTCATC AGCTCTACGA AGAGACCCTA 480 

TGCAAAGGAA TTGGAAACTG TTGACTTCAA AGATAAATTG GAAGAAACGA AAGGTCAGAT 540 

CAACAACTCA ATTAAGGATC TCACAGATGG CCACTTTGAG AACATTTTAG CTGACAACAG 600 

TGTGAACGAC CAGACCAAAA TCCTTGTGGT TAATGCTGCC TACTTTGTTG GCAAGTGGAT 660 

15 GAAGAAATTT CCTGAATCAG AAACAAAAGA ATGTCCTTTC AGACTCAACA AGACAGACAC 720 

CAAACCAGTG CAGATGATGA ACATGGAGGC CACGTTCTGT ATGGGAAACA TTGACAGTAT 780 

CAATTGTAAG ATCATAGAGC TTCCTTTTCA AAATAAGCAT CTCAGCATGT TCATCCTACT 840 

ACCCAAGGAT GTGGAGGATG AGTCCACAGG CTTGGAGAAG ATTGAAAAAC AACTCAACTC 900 

AGAGTCACTG TCACAGTGGA CTAATCCCAG CACCATGGCC AATGCCAAGG TCAAACTCTC 960 

20 CATTCCAAAA TTTAAGGTGG AAAAGATGAT TGATCCCAAG GCTTGTCTGG AAAATCTAGG 1020 

GCTGAAACAT ATCTTCAGTG AAGACACATC TGATTTCTCT GGAATGTCAG AGACCAAGGG 1080 

AGTGGCCCTA TCAAATGTTA TCCACAAAGT GTGCTTAGAA ATAACTGAAG ATGGTGGGGA 1140 

TTCCATAGAG GTGCCAGGAG CACGGATCCT GCAGCACAAG GATGAATTGA ATGCTGACCA 1200 

TCCCTTTATT TACATCATCA GGCACAACAA AACTCGAAAC ATCATTTTCT TTGGCAAATT 1260 

25 CTGTTCTCCT TAAGTGGCAT AGCCCATGTT AAGTCCTCCC TGACTTTTCT GTGGATGCCG 1320 

ATTTCTGTAA ACTCTGCATC CAGAGATTCA TTTTCTAGAT ACAATAAATT GCTAATGTTG 1380 

CTGGATCAGG AAGCCGCCAG TACTTGTCAT ATGTAGCCTT CACACAGATA GACCTTTTTT 1440 

TTTTTCCAAT TCTAT CTTTT GTTTCCTTTT TTCCCATAAG ACAATGACAT ACGCTTTTAA 1500 

TGAAAAGGAA TCACGTTAGA GGAAAAATAT TT ATT CATT A TTTGTCAAAT TGTCCGGGGT 1560 

30 AGTTGGCAGA AATACAGTCT TCCACAAAGA AAATTCCTAT AAGGAAGATT TGGAAGCTCT 1620 

TCTTCCCAGC ACTATGCTTT CCTTCTTTGG GATAGAGAAT GTTCCAGACA TTCTCGCTTC 1680 

CCTGAAAGAC TGAAGAAAGT GTAGTGCATG GGACCCACGA AACTGCCCTG GCTCCAGTGA 1740 

AACTTGGGCA CATGCTCAGG CTACTATAGG TCCAGAAGTC CTTATGTTAA GCCCTGGCAG 1800 

GCAGGTGTTT ATTAAAATTC TGAATTTTGG GGATTTTCAA AAGATAATAT TTTACATACA 1860 

35 CTGTATGTTA TAGAACTTCA TGGATCAGAT CTGGGGCAGC AACCTATAAA TCAACACCTT 1920 

AATATGCTGC AACAAAATGT AGAATATTCA GACAAAATGG ATACATAAAG ACTAAGTAGC 1980 

CCATAAGGGG TCAAAATTTG CTG CCAAATG CGTATGCCAC CAACTTACAA AAACACTTCG 2040 

TTCGCAGAGC TTTTCAGATT GTGGAATGTT GGATAAGGAA TTATAGACCT CTAGTAGCTG 2100 

AAATGCAAGA CCCCAAGAGG AAGTTCAGAT CTTAATATAA ATTCACTTTC ATTTTTGATA 2160 

40 GCTGTCCCAT CTGGTCATGT GGTTGGCACT AGACTGGTGG CAGGGGCTTC TAGCTGACTC 2220 

GCACAGGGAT TCTCACAATA GCCGATATCA GAATTTGTGT TGAAGGAACT TGT CTCTTCA 2280 

TCTAATATGA TAGCGGGAAA AGGAGAGGAA ACTACTGCCT TTAGAAAATA TAAGTAAAGT 2340 

GATTAAAGTG CTCACGTTAC CTTGACACAT AGTTTTTCAG TCTATGGGTT TAGTTACTTT 2400 

AGATGGCAAG CATGTAACTT ATATTAATAG TAATTTGTAA AGTTGGGTGG ATAAGCTATC 2460 

45 CCTGTTGCCG GTTCATGGAT TACTTCTCTA TAAAAAATAT ATATTTACCA AAAAATTTTG 2S20 

TGACATTCCT TCTCCCATCT CTTCCTTGAC ATGCATTQTA AATAGGTTCT TCTTGTTCTG 2580 
AGATTCAATA TTGAATTTCT CCTATGCTAT TGACAATAAA ATATTATTGA ACTACC 

Seq ID NO: 36 Protein sequence: 
50 Protein Accession #» NP_00263 0.i 

1 11 21 31 41 51 
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MDALQLANSA PAVDLPKQLC EKEPLGNVLF SPICLSTSLS LAQVGAKGDT ANEIGQVLHF 60 

55 ENVKDIPFGP QTVTSDVNKL SSFYSLKLIK RLYVDKSLNL STBPISSTKR PYAKELETVD 120 

FKDKLEETKG QINNSIKDLT DGHFENILAD NSVNDQTKIL WNAAYFVGK WMKKPPESET 180 

KECPFRLNKT DTKPVQMMNM EATPCMGNID 9INCKIIELP FQNKHLSMPI LLPKDVEDES 240 

TGLEKXEKQIi NSESLSQWTN PSTMANAKVK IiSIPKFKVEK MIDPKACLEN LGLKHIFSED 300 

TSDPSGMSET KGVALSNVIH KVCLEITEDG GDSIEVPGAR ILQHKDELNA DHPFIYIIRH 360 
60 NKTRNIIFFG KFCSP 

Seq ZD NO: 37 DNA sequence 
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GGAGTGGGGG AGAGAGAGGA GACCAGGACA GCTGCTGAGA CCTCTAAGAA GTCCAGATAC 60 

70 TAAGAGCAAA GATGTTTCAA ACTGGGGGCC TCATTGTCTT CTACGGGCTG TTAGCCCAGA 120 

CCATGGCCCA GTTTGGAGGC CTGCCCGTGC CCCTGGACCA GACCCTGCCC TTGAATGTGA 180 

ATCCAGCCCT GCCCTTGAGT CCCACAGGTC TTGCAGGAAG CTTGACAAAT GCCCTCAGCA 240 

ATGGCCTGCT GTCTGGGGGC CTGTTGGGCA TTCTGGAAAA CCTTCCGCTC CTGGACATCC 300 

TGAAGCCTGG AGGAGGTACT TCTGGTGGCC TCCTTGGGGG ACTGCTTGGA AAAGTGACGT 360 

75 CAGTGATTCC TGGCCTGAAC AACATCATTG ACATAAAGGT CACTGACCCC CAGCTGCTGG 420 

AACTTGGCCT TGTGCAGAGC CCTGATGGCC ACCGTCTCTA TGTCACCATC CCTCTCGGCA 480 

TAAAGCTCCA AGTGAATACG CCCCTGGTCG GTGCAAGTCT GTTGAGGCTG GCTGTGAAGC 540 

TGGACATCAC TGCAGAAATC TTAGCTGTGA GAGATAAGCA GGAGAGGATC CACCTGGTCC 600 

_ _ TTGGTGACTG CACCCATTCC CCTGGAAGCC TGCAAATTTC TCTGCTTGAT GGACTTGGCC 660 

80 CCCTCCCCAT TCAAGGTCTT CTGGACAGCC TCACAGGGAT CTTGAATAAA GTCCTGCCTG 720 

AGTTGGTTCA GGGCAACGTG TGCCCTCTGG TCAATGAGGT TCTCAGAGGC TTGGACATCA 780 

CCCTGGTGCA TGACATTGTT AACATGCTGA TCCACGGACT ACAGTTTGTC ATCAAGGTCT 840 

AAGCCTTCCA GGAAGGGGCT GGCCTCTGCT GAGCTGCTTC CCAGTGCTCA CAGATGGCTG 900 

0 GCCCATGTGC TGGAAGATGA CACAGTTGCC TTCTCTCCGA GGAACCTGCC CCCTCTCCTT 960 

85 TCCCACCAGG CGTGTGTAAC ATCCCATGTG CCTCACCTAA TAAAATGGCT CTTCTTCTGC 1020 
AAAAAAAAAA AAAAAAAAAA AAAAAAAAA 
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Seq ID NO: 38 Protein sequence: 
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1 11 21 31 41 SI 
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MFQTGGIilVP YGLLAQTMAQ FGGLPVPLDQ TLPLNVNPAL PLSPTGIiAGS LTNALSNGLL 

SGGLLGILEN LPLLDILKPG GGTSGGLLGG LLGKVTSVIP GLNNIIDIKV TDPQLLELGL 

VQSPDGHRLY VTIPLGIKLQ VNTPLVGASL LRLAVKLDIT AEILAVRDKQ ERIHLVLGDC 

THSPGSLQIS LLDGLGPIjPI QGLLDSLTGI LNKVLPELVQ GNVCPLVNEV LRGLDITLVH 

DIVNMLIHGL QFVIKV 
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CTCAGGGCAG 
TCCTGGAACT 
TCTCCCTCGG 
TCACTTCTAA 
TTCAATGTCG 
TTTGGCTACA 
GTAATAGGAA 
CCCAATGCAT 
CACGTCATAA 
GAGCTGCCCA 
GTGGCCTTCA 
CAGAGCCTCC 
TTCAATGTCA 
GCCAGGCGCA 
TCCCCTCTAA 
TCTAACCCAC 
GAGCTCTTTA 
AACTCAGACA 
CCCAAACCCT 
TTAACCTGTG 
CTCCCGGTCA 
GTCACAAGGA 
CACAGCGACC 
TCATACACCT 
CCACCTGCAC 
TTTATCTCCA 
GCCAGTGGCC 
CCCTCCATCT 
TGTGAACCTG 
GTCAGTCCCA 
AGAAATGACG 
GACCCAGTCA 
TCGTCTTACC 
CCGCAGTATT 
GCCAAAATCA 
GGCGGCAATA 
CTCTCAGCTG 
TAGCAGCCCT 
TAAAGCATTT 
AGACTCTGAC 
AAATACAAAA 
TGAGGCAGQA 
ACTGCACTCC 
TCTGACCTGT 
AACTTTAATG 
TAATTAATTT 
TTCCCAGATT 
AAATATACTT 
AGACTTGGGA 
TCAATAAAAA 



11 
I 

AGGGAGGAAG 
CAAGCTCTTC 
CCCCTCCCCA 
CCTTCTGGAA 
CAGAGGGGAA 
GCTGGTACAA 
CTCAACAAGC 
CCCTGCTGAT 
AGTCAGATCT 
AGCCCTCCAT 
CCTGTGAACC 
CGGTCAGTCC 
CAAGAAATGA 
GTGATTCAGT 
ACACATCTTA 
CTGCACAGTA 
TCCCCAACAT 
CTGGCCTCAA 
TCATCACCAG 
AACCTGAGAT 
GTCCCAGGCT 
ATGATGTAGG 
CAGTCATCCT 
ATTACCGTCC 
AGTATTCTTG 
ACATCACTGA 
ACAGCAGGAC 
CCAGCAACAA 
AGGCTCAGAA 
GGCTGCAGCT 
CAAGAGCCTA 
CCCTGGATGT 
TTTCGGGAGC 
CTTGGCGTAT 
CGCCAAATAA 
ATTCCATAGT 
GGGCCACTGT 
GGTGTAGTTT 
GCAACAGCTA 
CAGAGATCGA 
ATGAGCTGGG 
GAATCGCTTG 
AGTCTGGCAA 
ACTCTTGAAT 
AACTAACTGA 
CATGGGACTA 
TCAGGAAACT 
TTGTGAACAA 
AACTATTCAT 
TCTGCTCTTT 



21 
I 

GACAGCAGAC 
TCCACAGAGG 
CAGATGGTGC 
CCCGCCCACC 
GGAGGTGCTT 
AGGTGAAAGA 
TACCCCAGGG 
CCAGAACATC 
TGTGAATGAA 
CTCCAGCAAC 
TGAGACTCAG 
CAGGCTGCAG 
CACAGCAAGC 
CATCCTGAAT 
CAGATCAGGG 
CTCTTGGTTT 
CACTGTGAAT 
TAGGACCACA 
CAACAACTCC 
TCAGAACACA 
GCAGCTGTCC 
ACCCTATGAG 
GAATGTCCTC 
AGGGGTGAAC 
GCTGATTGAT 
GAAGAACAGC 
TACAGTCAAG 
CTCCAAACCC 
CACAACCTAC 
GTCCAATGGC 
TGTATGTGGA 
CCTCTATGGG 
GAACCTCAAC 
CAATGGGATA 
TAACGGGACC 
CAAGAGCATC 
CGGCATCATG 
CTTCATTTCA 
CAGTCTAAAA 
GACCATCCTA 
CTTGGTGGCG 
AACCCGGGAG 
CAGAGCAAGA 
ACAAGTTTCT 
CAGCTTCATG 
AATGAACTAA 
TTTTTTCTTT 
AAATTGAGAC 
GAATATTTAT 
GTATAACAGA 
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I 

CAGACAGTCA 
AGGACAGAGC 
ATCCCCTGGC 
ACTGCCAAGC 
CTACTTGTCC 
GTGGATGGCA 
CCCGCATACA 
ATCCAGAATG 
GAAGCAACTG 
AACTCCAAAC 
GACGCAACCT 
CTGTCCAATG 
TACAAATGTG 
GTCCTCTATG 
GAAAATCTGA 
GTCAATGGGA 
AATAGTGGAT 
GTCACGACGA 
AACCCCGTGG 
ACCTACCTGT 
AATGACAACA 
TGTGGAATCC 
TATGGCCCAG 
CTCAGCCTCT 
GGGAACATCC 
G GACTCT AT A 
ACAATCACAG 
GTGGAGGACA 
CTGTGGTGGG 
AACAGGACCC 
ATCCAGAACT 
CCGGACACCC 



CCGCAGCAAC 
TATGCCTGTT 
ACAGTCTCTG 
ATTGGAGTGC 
GGAAGACTGA 
TTGCTTCTTT 
GCCAACATCG 
CGCACCTGTA 
GTGGAGATTG 
CTCCATCTCA 
GATACCACTG 
AAACTGTCCA 
TGAGGATTGC 
TAAGCTATCC 
ATTTACATTT 
ATTGTATGGT 
AAAA 



41 
I 

CAGCAGCCTT 
AGACAGCAGA 
AGAGGCTCCT 
TCACTATTGA 
ACAATCTGCC 
ACCGTCAAAT 
GTGGTCGAGA 
ACACAGGATT 
GCCAGTTCCG 
CCGTGGAGGA 
ACCTGTGGTG 
GCAACAGGAC 
AAACCCAGAA 
GCCCGGATGC 
ACCTCTCCTG 
CTTTCCAGCA 
CCTATACGTG 
TCACAGTCTA 
AGGATGAGGA 
GGTGGGTAAA 
GGACCCTCAC 
AGAACGAATT 
ACGACCCCAC 
CCTGCCATGC 
AGCAACACAC 
CCTGCCAGGC 
TCTCTGCGGA 
AGGATGCTGT 
TAAATGGTCA 
TCACTCTATT 
CAGTGAGTGC 
CCATCATTTC 
ACTCGGCCTC 
ACACACAAGT 
TTGTCTCTAA 
CATCTGGAAC 
TGGTTGGGGT 
CAGTTGTTTT 
ACCAAGGATA 
TGAAACCCCA 
GTCCCAGTTA 
CAGTGAGCCC 
AAAAGAAAAG 
CACTGTCTGA 
CCAAGATCAA 
TGATTCTTTA 
ACTCTTACAG 
TCTCCCTATG 
AATATAGTTA 



51 
I 

GACAAAACGT 
GACCATGGAG 
GCTCACAGCC 
ATCCACGCCG 
CCAGCATCTT 
TATAGGATAT 
GATAATATAC 
CTACACCCTA 
GGTATACCCG 
CAAGGATGCT 
GGTAAACAAT 
CCTCACTCTA 
CCCAGTGAGT 
CCCCACCATT 
CCACGCAGGC 
ATCCACCCAA 
CCAAGCCCAT 
TGCAGAGCCA 
TGCTGTAGCC 
TAATCAGAGC 
TCTACTCAGT 
AAGTGTTGAC 
CATTTCCCCC 
AGCCTCTAAC 
ACAAGAGCTC 
CAATAACTCA 
GCTGCCCAAG 
GGCCTTCACC 
GAGCCTCCCA 
CAATGTCACA 
AAACCGCAGT 
CCCCCCAGAC 
TAACCCATCC 
TCTCTTTATC 
CTTGGCTACT 
TTCTCCTGGT 
TGCTCTGATA 
GCTTCTTCCT 
TTTACAGAAA 
TCTCTACTAA 
CTCGGGAGGC 
AGATCGCACC 
AAAAGAAGAC 
GAATTTCCAA 
G CAG AG AAAA 
AATGTCTTGT 
CAATTTGATA 
TGGTCGCTCC 
TTGCACAAGT 



Seq ID NO: 40 Protein sequence: 
Protein Accession #: NP_004354.l 



MESPSAPPHR 
HDFGYSWYKG 
TIiHVIKSDLV 
NNQSLPVSPR 
TISPLNTSYR 
AHNSDTGLNR 
QSLPVSPRLQ 
SPSYTYYRPG 
NSASGHSRTT 
LPVSPRLQLS 
PDSSYLSGAN 
ATGRNNSIVK 



11 
I 

WCIPWQRLLIi 
ERVDGNRQII 
NEEATGQFRV 
LQLSNGNRTL 
SGENLNLSCH 
TTVTTITVYA 
LSNDNRTLTL 
VNLSLiSCHAA 
VKTITVSAEL 
NGNRTLTLFN 
LNLSCH3ASN 
SITVSASGTS 



21 

I 

TASLLTFWNP 
GYVIGTQQAT 
YPELPKPSIS 
TLFNVTRNDT 
AASNPPAQYS 
EPPKPFITSN 
LSVTRNDVGP 
SNPPAQYSWL 
PKPSISSNNS 
VTRNDARAYV 
PSPQYSWRIN 
PGLSAGATVG 



31 
I 

PTTAKLTIES 
PGPAYSGREI 
SNNSKPVEDK 
ASYKCETQNP 
WFVNGTFQQS 
NSNPVEDEDA 
YECGIQNELS 
IDGNIQQHTQ 
KPVEDKDAVA 
CGIQNSVSAN 
GIPQQHTQVL 
IMIGVLVGVA 



41 
I 

TPFNVAEGKE 
IYPNASbLIQ 
DAVAFTCEPE 
VSARRSDSVI 
TQELFIPNIT 
VALTCEPEIQ 
VDHSDPVILN 
ELFISNITEK 
FTCEPEAQNT 
RSDPVTLDVL 
FIAKITPNNN 
LI 



60 
120 
180 
240 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 ' 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
19B0 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 



51 
I 

VLLLVHNLPQ 
NIIQNDTGFY 
TQDATYLWWV 
LNVLYGPDAP 
VNNSGSYTCQ 
NTTYLWWVNN 
VLYGPDDPTI 
NSGLYTCQAN 
TYLWWVNGQS 
YGPDTPIISP 
GTYACFVSNL 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



204 



WO 02/086443 

Seq ID NO: 41 DNA sequence 

Nucleic Acid Accession ft: NM_006952.1 

Coding sequence: 11-793 

1 11 21 31 , 41 51 

I I I I I I 

AATCCCGACA ATGGCOAAAG ACAACTCAAC TGTTCGTTGC TTCCAGGGCC TGCTGATTTT 60 

TGGAAATGTG ATTATTGGTT GTTGCGGCAT TGCCCTGACT GCGGAGTGCA TCTTCTTTGT 120 

ATCTGACCAA CACAGCCTCT ACCCACTGCT TGAAGCCACC GACAACGATG ACATCTATGG 180 

GGCTGCCTGG ATCGGCATAT TTGTGGGCAT CTGCCTCTTC TGCCTGTCTG TTCTAGGCAT 240 

TGTAGGCATC ATGAAGTCCA GCAGGAAAAT TCTTCTGGCG TATTTCATTC TGATGTTTAT 300 

AGTATATGCC TTTGAAGTGG CATCTTGTAT CACAGCAGCA ACACAACGAG ACTTTTTCAC 360 

ACCCAACCTC TTCCTGAAGC AGATGCTAGA GAGGTACCAA AACAACAGCC CTCCAAACAA 420 

TGATGACCAG TGGAAAAACA ATGGAGTCAC CAAAACCTGG GACAGGCTCA TGCTCCAGGA 480 

CAATTGCTGT GGCGTAAATG GTCCATCAGA CTGGCAAAAA TACACATCTG CCTTCCGGAC 540 

TGAGAATAAT GATGCTGACT ATCCCTGGCC TCGTCAATGC TGTGTTATGA ACAATCTTAA 600 

AGAACCTCTC AACCTGGAGG CTTGTAAACT AGGCGTGCCT GGTTTTTATC ACAATCAGGG 660 

CTGCTATGAA CTGATCTCTG GTCCAATGAA CCGACACGCC TGGGGGGTTG CCTGGTTTGG 720 

ATTTGCCATT CTCTGCTGGA CTTTTTGGGT TCTCCTGGGT ACCATGTTCT ACTGGAGCAG 780 
AATTGAATAT TAAGAA 

Seq ID NO: 42 Protein sequence: 
Protein Accession #: NP_008883.1 

1 11 21 31 41 51 

I I I I 1 I 

MAKDNSTVRC FQGLLIFGNV IIGCCGIALT AECIPFVSDQ HSLYPLLEAT DNDDIYGAAW 60 
IGIFVGICLP CLSVLGIVGI MKSSRKILIiA YFILMFIVYA PEVASCITAA TQRDFPTPNL 120 
FLiKQMLERYQ NNSPPNNDDQ WKNNGVTKTW DRLMLQDNCC GVNGPSDWQK YTSAFRTENN 180 
DADYPWPRQC CVMNNLKEPL NLEACKLGVP GFYHNQGCYE LISGPMNRHA WGVAWPGFAI 240 
LCWTFWVLLG TMFYWSRIEY 



Seq ID NO: 43 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 83-2605 

1 11 21 31 41 51 

I I I I I I 

GCCGGACAGA TCTGCGCGTA TCCTGGAGCC GGCCCAGTTG TGAACTAGGA GAGCTTTGGG 60 

ACCTCTGTCC CAAGCAAGAG AGATGAATGG AGAGTATAGA GGCAGAGGAT TTGGACGAGG 120 

AAGATTTCAA AGCTGGAAAA GGGGAAGAGG TGGTGGGAAC TTCTCAGGAA AATGGAGAGA 180 

AAGAGAACAC AGACCTGATC TGAGTAAAAC CACAGGAAAA CGTACTTCTG AACAAACCCC 240 

ACAGTTTTTG CTTTCAACAA AGACCCCACA GTCAATGCAG TCAACATTGG ATCGATTCAT 300 

ACCATATAAA GGCTGGAAGC TTTATTTCTC TGAAGTTTAC AGCGATAGCT CTCCTTTGAT 360 

TGAGAAGATT CAAGCATTTG AAAAATTTTT CACAAGGCAT ATTGATTTGT ATGACAAGGA 420 

TGAAATAGAA AGAAAGGGAA GTATTTTGGT AGATTTTAAA GAACTGACAG AAGGTGGTGA 480 

AGTAACTAAC TTGATACCAG ATATAGCAAC TGAACTAAGA GATGCACCTG AGAAAACCTT 540 

GGCTTGCATG GGTTTGGCAA TACATCAGGT GTTAACTAAG GACCTTGAAA GGCATGCAGC 600 

TGAGTTACAA GCCCAGGAAG GATTGTCTAA TGATGGAGAA ACAATGGTAA ATGTGCCACA 660 

TATTCATGCA AGGGTGTACA ACTATGAGCC TTTGACACAG CTCAAGAATG TCAGAGCAAA 720 

TTACTATGGA AAATACATTG CTCTAAGAGG GACAGTGGTT CGTGTCAGTA ATATAAAGCC 780 

TCTTTGCACC AAGATGGCTT TTCTTTGTGC TGCATGTGGA GAAATTCAGA GCTTTCCTCT 840 

TCCAGATGGA AAATACAGTC TTCCCACAAA GTGTCCTGTG CCTGTGTGTC GAGGCAGGTC 900 

ATTTACTGCT CTCCGCAGCT CTCCTCTCAC AGTTACGATG GACTGGCAGT CAATCAAAAT 960 

CCAGGAATTG ATGTCTGATG ATCAGAGAGA AGCAGGTCGG ATTCCACGAA CAATAGAATG 1020 

TGAGCTTGTT CATGAT CTTG TGGATAGCTG TGTCCCGGGA GACACAGTGA CTATTACTGG 1080 

AATTGTCAAA GTCTCAAATG CGGAAGAAGG TTCTCGAAAT AAGAATGACA AGTGTATGTT 1140 

CCTTTTGTAT ATTGAAGCAA ATTCTATTAG TAATAGCAAA GGACAGAAAA CAAAGAGTTC 1200 

TGAGGATGGG TGTAAGCATG GAATGTTGAT GGAGTTCTCA CTTAAAGACC TTTATGCCAT 1260 

CCAAGAGATT CAAGCTGAAG AAAACCTGTT TAAACTCATT GTCAACTCGC TTTGCCCTGT 13 20 

CATTTTTGGT CATGAACTTG TTAAAGCAGG TTTGGCATTA GCACTCTTTG GAGGAAGCCA 1380 

GAAATACGCA GATGACAAAA ACAGAATTCC AATTCGGGGA GACCCCCACA TCCTTGTTGT 1440 

TGGAGATCCA GGCCTAGGAA AAAGTCAAAT GCTACAGGCA GCGTGCAATG TTGCCCCACG 1500 

TGGCGTGTAT GTTTGTGGTA ACACCACGAC CACCTCTGGT CTGACGGTAA CTCTTTCAAA 1560 

AGATAGTTCC TCTGGAGATT TTGCTTTGGA AGCTGGTGCC CTGGTACTTG GTGATCAAGG 1620 

TATTTGTGGA ATCGATGAAT TTGATAAGAT GGGGAATCAA CATCAAGCCT TGTTGGAAGC 1680 

CATGGAGCAG CAAAGTATTA GTCTTGCTAA GGCTGGTGTG GTTTGTAGCC TTCCTGCAAG 1740 

AACTTCCATT ATTGCTGCTG CAAATCCAGT TGGAGGACAT TACAATAAAG CCAAAACAGT 1800 

TTCTGAGAAT TTAAAAATGG GGAGTGCACT ACTATCCAGA TTTGATTTGG TCTTTATCCT 1860 

GTTAGATACT CCAAATGAGC ATCATGATCA CTTACTCTCT GAACATGTGA TTGCAATAAG 1920 

AGCTGGAAAG CAGAGAACCA TTAGCAGTGC CACAGTAGCT CGTATGAATA GTCAAGATTC 1980 

AAATACTTCC GTACTTGAAG TAGTTTCTGA GAAGCCATTA TCAGAAAGAC TAAAGGTGGT 2040 

TCCTGGAGAA ACAATAGATC CCATTCCCCA CCAGCTATTG AGAAAGTACA TTGGCTATGC 2100 

TCGGCAGTAT GTGTACCCAA GGCTATCCAC AGAAGCTGCT CGAGTTCTTC AAGATTTTTA 2160 

CCTTGAGCTC CGGAAACAGA GCCAGAGGTT AAATAGCTCA CCAATCACTA CCAGGCAGCT 2220 

GGAATCTTTG ATTCGT CTG A CAGAGGCACG AGCAAGGTTG GAATTGAGAG AGGAAGCAAC 2280 

CAAAGAAGAC GCTGAGGATA TAGTGGAAAT TATGAAATAT AGCATGCTAG GAACTTACTC 2340 

TGATGAATTT GGGAACCTAG ATTTTGAGCG ATCCCAGCAT GGTTCTGGAA TGAGCAACAG 2400 

GTCAACAGCG AAAAGATTTA TTTCTGCTCT CAACAACGTT GCTGAAAGAA CTTATAATAA 2460 

TATATTTCAA TTTCATCAAC TTCGGCAGAT TGCCAAAGAA CTAAACATTC AGGTTGCTGA 2520 

TTTTGAAAAT TTTATTGGAT CACTAAATGA CCAGGGTTAC CTCTTGAAAA AAGGCCCAAA 2580 

AGTTTACCAG CTTCAAACTA TGTAAAAGGA CTTCACCAAG TTAGGGCCTC CTGGGTTTAT 2640 

TGCAGATTAA AGCCAT CTCA GTGAAGATAT GCGTGCACGC ACAGACAGAC AGACACACAC 2700 

ACACACACAC ACACACACAC ACACACACAC ACACACAGTC AAATACTGTT CTCTGAAAAA 2760 

TGATGTCCCA AAAGTATTAT AATAGGAAAA AAGCATTAAA TATAATAAAC TAATTTAAGA 2820 
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AGTGATAAAG 
GGTGAGAGGA 
CATTTCTTAA 
TAGTCTCAGC 
TACAGTGAGC 
GACTCAAAAA 
CCAAAGGGCT 
TTATATGTAT 
AAACCAATGA 
ATTTGAATTT 
GCTATTTAAT 
AATAGAAACA 
AATTATTAGA 
GCCTTAGAAT 
AGAAAGTGCT 
GAATGCCCCC 
ATTGCTGAAT 
TATGCCCCTT 
AATAAGACCA 
AAACTATCAA 
GTGAACCATT 
AATGTAAATA 
GATGTGGAGA 
GGAATCAAAA 
GAGGCAGGAG 
TATCTCTACA 
GTGGAGGCTG 
TTATACCACT 



TCTCCAGATG 
TTCCTTGAGG 
AAAAAAAAAA 
TACTTGTGAG 
CACAATCACA 
AATAAAAAAA 
AAAAGTAAAT 
GAATATTTCA 
ATATATTACA 
CATAAAATTT 
AATAGGTCTC 
GACTGATTAA 
AGGCAGGT G A 
TGGACTAAGG 
GCTGCCTCCC 
ACCCGCACCG 
TCAAAAAAGA 
TCATAGGCTG 
GAATTTCTCA 
TCATGTATAA 
GTTGGAGAAT 
AAAAGAACTG 
CTATTGCCAT 
GGGGCCAGGT 
GATCACTTGA 
AAAAATAGAT 
AAGTAGGAAA 
GCACTCCAGC 



CAGTAGCTCA 
CCAGGGTTCG 
AAAAAATTTA 
GCTGAGGCAG 
CCAATCACTG 
ATTGTAGTGG 
TACTTATAAA 
TAGTTTTGCA 
TATTCTGTGT 
TCCCATGTCA 
ATTTATTCCA 
GCAGGAGAAG 
ACCAGGAGGG 
AAGAAGCTGC 
TGCCCCACCT 
GAACAGCAAC 
AGTTGCATAC 
CTAGGGAGTT 
TATGTTGTGA 
ATCCAACAAA 
CTACTAAAAT 
GCAGTGTATA 
AGACCACAAT 
GCAGTGGCTC 
AGCCAGTTTT 
TAGCTGGGCA 
TCACTTGAGC 
CTGGGCAAGA 



CACTGTAATC 
AGACCAACCT 
AACTTAGCTG 
GAGGATTCTT 
CACTCCAGCC 
TAGCCATGTG 
TTTTTTATAG 
TATCAGATGT 
TCCAATAAAA 
AGAATACAAA 
CAGGCTGTAG 
TTTTTTGAAA 
TAAGCTTCCA 
TGACACTCCA 
TTGCCACTTC 
AAAAGGATTC 
AAAGACATCT 
TTCCTGGTTC 
GAGGATTCAA 
CACTTTGTAA 
ACGGCTTCCC 
TCAGATGTTT 
GTAAATTTTT 
ACATCTATAA 
GAGACCAGCC 
CGGTGGTGCA 
CCGAGAGTTT 
GAGCAAGACC 



Seq ID NO: 44 Protein sequence: 
Protein Accession ft: CAB55276.2 



MNGEYRGRGF 
TPQSMQSTLD 
ILVDFKELTE 
LSNDGETMVN 
LCAACGEIQS 
QREAGRIPRT 
SISNSKGQKT 
KAGLALALFG 
TTTTSGLTVT 
LAKAGWCSL 
HDHLLSEHVI 
IPHQIiLRKYI 

SALNNVAERT 



11 
I 

GRGRFQSWKR 
RFIPYKGWKL 
GGEVTNLIPD 
VPHIHARVYN 
FPLPDGKYSL 
IECELVHDLV 
KSSEDGCKHG 
GSQKYADDKN 
LSKDSSSGDF 
PARTS I I AAA 
AIRAGKQRTI 
GYARQYVYPR 
EATKEDAEDI 
YNNIFQFHQL 



21 

! 

GRGGGNFSGK 
YFSEVYSDSS 
IATELRDAPE 
YEPLTQLKNV 
PTKCPVPVCR 
DSCVPGDTVT 
MLMEFSLKDL 
RIPIRGDPHZ 
ALEAGALVLG 
NPVGGHYNKA 
SSATVARMNS 
IiSTEAARVLQ 
VEIMKYSMLG 
RQIAKELNIQ 



31 
I 

WREREHRPDL 
PLIEKIQAFE 
KTLACMGLAI 
RANYYGKYIA 
GRS FT AliRS S 
ITGIVKVSNA 
YAIQEIQAEE 
LWGDPGLGK 
DQGICGIDEF 
KTVSENLKMG 
QDSNTSVLEV 
DFYLELRKQS 
TY8DEFGNLD 
VADFENFIGS 



ACAGTGACTC 
TGGGCAACAT 
GGTATGGTGG 
TGAGCCCAGG 
TGGGCAATAA 
TTAATTGTTA 
TTGTATTTTT 
AGGCATACAG 
CTTTATTTAT 
ATACTTGAGT 
TTTGTAGTCT 
GAATTTTGTT 
GCAGCAATTT 
CTGCCACACA 
TGCAGCAGGA 
TGCATGAGAT 
GATTGAAAAA 
TACTTTCAGG 
ATGTTACAGG 
CATACAAGAA 
GCAAACGAAG 
AACTATAGGA 
AAGTGAGGAA 
TCCCAGAGCT 
TATGCAACAC 
TGCCTATTGT 
GAGGTTACAG 
TTGTCTCTT 



41 
I 

SKTTGKRTSE 
KFFTRHIDLY 
HQVLTKDLER 
LRGTWRVSN 
PLTVTMDWQS 



Seq ID NO: 45 DNA sequence 

Nucleic Acid Accession #: NM_005416.1 

Coding sequence: 149.. 658 



1 

I 

ACCAGATCCC 
CTGAAGACCA 
AAAGAGTGTG 
CCCACCACCT 
AATATTTGTT 
AAAGATTCCA 
GCCAGGCTGT 
CAAGGTCCCT 
ACCAGGCAGC 
CAAAGTTCCT 
GCCATGTCCT 
TGGTGCACAG 
TGTTTCTGTG 
AGTCTCTCTC 
CTGAAGAATC 
GGCTGCTCAG 
CTCATTAAAT 



11 
I 

AGAGGCTGAA 
GAAAAGCCAC 
TCCACGATCC 
CAGCTTCAAC 
CCCACAACCA 
GAGCCAGGCT 
ACCAAGGTCC 
GAGCCAGGTT 
ATCAAGGTCC 
GAGCAAGGAT 
TCAACGGTCA 
ACAAGCCCTT 
TCTTAATTGT 
TTATTTGTAT 
CTGTAAGCCC 
GGTTCATCTG 
TGCTTTTAAT 



21 
I 

CACCTCGACC 
TAAGACTTTC 
TTTGAAGCAT 
AGCAGCAGGT 
AGGAGCCATG 
GTACCAAGGT 
CTGAGCCAGG 
GTACCAAGGT 
CTGACCAAGG 
ACACCAAAGT 
CTCCAGGCCC 
GAGAAGCCAA 
CTGTAGACCT 
CCTAAAAATA 
CTGAATTAAG 
AAGATTCGAA 
TCCA 



31 
I 

TTCTCTGCAC 
TGCTTAATTC 
GAGTTCTTAC 
GAAACAACCC 
CCACTCAAAG 
CCCTGAGCCA 
TTGTACCAAG 
CCCTGAGCCA 
CTTCATCAAG 
TCCTGTGCCA 
AGCTCAGCAG 
CCACCAGATG 
TGTAATCAGC 
CGTACTATAA 
CAGAAAGTCT 
TGAAAAGAAA 



NLFKLIVNSL 
SQMLQAACNV 
DKMGNQHQAL 
SALLSRFDLV 
VSEKPLSERL 
QRLNSSPITT 
FERSQHGSGM 
LNDQGYLLKK 



41 
I 

AGCAGATGAT 
AGGAGCTTAG 
CAGCAGAAGC 
AGCCAGCCTC 
GTTCCACAAC 
GGCTGTACCA 
GTCCCTGAGC 
GGCTACACCA 
TTTCCTGAGC 
GGCTACACAA 
AAGACCAAGC 
CTGGACACCC 
ACATTGTCAC 
AGCTTTTGTT 
TCATGGCTTT 
TGCATGTTTC 



AGGAGGCTGA 
AGCAAGACCC 
CACATGCCTA 
AGTTTGAGGT 
AGTAACTCTT 
AATAAATTCT 
GACCTGCCTT 
ACAAATACAT 
GGACACTAAA 
TTTGTTTTTA 
TGCTTGAAAC 
TGGCTCACGG 
GTAAAACCAT 
GGGCACTGGA 
ATAGGTAGAA 
GCCTCCCTAA 
GGGTATGTTA 
TGGTGGGATC 
GTTGCCAGCC 
CTCAGGAAAT 
ATGAATGGAA 
CCAGAACTAA 
GGAAAAATCA 
TTGGGAGTTC 
ATTGAGACCC 
CCTACCTACT 
TGAGCTATGA 



51 
I 

QTPQFLLSTK 
DKDEIERKGS 
HAAELQAQEG 
IKPLCTKMAF 
IKIQELMSDD 
CMFLLYIEAN 
CPVIFGHELV 
APRGVYVCGN 
LEAMEQQSIS 
FILLDTPNEH 
KWPGETIDP 
RQLESLIRLT 
SNRSTAKRFI 
GPKVYQLQTM 



51 
f 

CCCTGAGCAG 
AGGATT CTTC 
AGACCTTTAC 
CACCTCAGGA 
CTGGAAACAC 
AGGTCCCTGA 
CAGGCTGTAC 
AGGTCCCTGA 
CAGGTGCCAT 
AGCTACCAGA 
AGAAGTAATT 
TCTTCCCATC 
CCCAAGCCAT 
CACACACACT 
TCTGGTCTTC 
CTGCTCTTCC 



Seq ID NO: 46 Protein sequence: 
Protein Accession #: NP_005407.1 



11 



21 



31 



41 



51 



MSSYQQKQTF TPPPQLQQQQ VKQPSQPPPQ EIFVPTTKEP CHSKVPQPGN TKIPEPGCTK 
VPEPGCTKVP EPGCTKVPEP GCTKVPEPGC TKVPEPGCTK VPEPGYTKVP EPGSIKVPDQ 
GFIKFPEPGA IKVPEQGYTK VPVPGYTKLP EPCPSTVTPG PAQQKTKQK 

Seq ID NO: 47 DNA sequence 

Nucleic Acid Accession ft: Eos sequence 



2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



60 
120 
180 
24 0 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 



60 
120 
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10 
15 

20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

1 11 21 31 

I I I I 

GCGTCGTGTG CAGGCGTCCC CGGGCTGTGQ ATAATTAGAC 

AAGGCTOGTT AGAATTCGCC CTAGAGCTGT ATCATGTATT 

TTGCAATTAA GCTTAGGGAA CCAGCAACAA AAGCAAACTT 

GAAAATGGAT TAGAGAAACT TCTTCCCCGA TTTAAGGGGA 

TTXGGGGAAA GTGCCCCGAC CGCAGAGGCG ACGACAGGGG 

AGTCGGCGTT GGCGG CAGCG GTGGCCTTCC TCATCTGGGC 

TAAGGATAAC ATCCTGGAAA TGACTTCTGT ACGGTTTGAG 

TTGGAGCTGC CCTGTGGAGT TACAGTTTAC CAAACACATT 

CTAAAAACTT TGTGAGAATT TTCTTTTACT AAAATTTTTT 

Seq ID NO: 48 DNA sequence; 

Nucleic Acid Accession #: CAT cluster 



PCT/US02/12476 



41 

1 

ACGTTCTTCC 
TTCTTTCAAA 
GGCCCGAGGT 
AAGATTCCTG 
AGCAGGAAGC 
GATGTGGGCT 
CCCAACTGCA 
CATGAACATA 
CTTATTACAA 



51 
I 

CTCATTGCCC 
TTAACTTTGC 
CGTTCACCGC 
CGGCCAGCGC 
TGCTCACGGT 
CCTAGAAGAG 
CACTCATGAC 
ATCTCATTTA 
A 



TTCCAAATTT 
TTTTAGTAAA 
CTCCAAGTCA 
TCCTTACTCT 
CCGACTACCG 
CCCAAAGCGC 
ATTTTCGCGG 
TTGCAAGCAA 
AGCCTTGGGC 
CGACGCT 



11 
I 

TTTTTTTTGT 
TGAGATTATG 
TGAGTGTGCA 
TCTOGGAGCC 
TGAGCAGCTT 
TGGCCGCAGG 
TGAACGACCT 
AGTTAATTTG 
AATGAGGGAA 



21 
I 

AATAAGAAAA 
TTCATGAATG 
GTTGGGCTCA 
CACATCGCCC 
CCTGCTCCCC 
AATCTTTCCC 
CGGGCCAAGT 
AAAGAAAATA 
GAACGTGTCT 



31 

I 

AATTTTAGTA 
TGTTTGGTAA 
AACCGTACAG 
AGATGAGGAA 
TGTCGTCGCC 
CTTAAATCGG 
TTGCTTTTGT 
CATGATACAG 
AGTTATCCAC 



41 
I 

AAAGAAAATT 
ACTGTAACTC 
AAGTCATTTC 
GGCCACCGCT 
TCTGCGGTCG 
GGAAGAAGTT 
TGCTGGTTCC 
CTCTAGGGCG 
AGCCCGGGGA 



51 
I 

CTCACAAAGT 
CACAGGGCAG 
CAGGATGTTA 
GCCGCCAACG 
GGGCACTTTC 
TCTCTAATCC 
CTAAGCTTAA 
AATTCTAACG 
CGCCTGCACA 



Seq ID NO i 49 DNA sequence 

Nucleic Acid Accession #: CAT cluster 



1 
I 

TCTTTCTTCT 
CCTGCCGACC 
ACCGTAGACC 
GACACATGCA 
CCCAACCAAA 
TTCATTTAAA 
CTTTCTCTGA 
TCGCTGTAGC 
TAAAGAAACT 
GATTGAACCA 
TGCTGGTATC 
GACGTGGGGA 
CCGGGTCTCT 
GAGACAGCAT 
ACTG 



11 
I 

GCTGCTCGTT 
TCTGTTGTCT 
CGAAACCATT 
GACCAGTTTT 
GTGTTTAAAA 
AAACTCTAAT 
TCTGTGTCTT 
CATGGGAATC 
GACACAGGAG 
GTGCACTCCA 
GTCCTGCAGC 
GAGCTGGTCT 

CATTTATGAG 



21 
1 

TGTCTCTCCT 
CTTCTCTGAT 
GGGTGTCACA 
CCTGGAACNG 
CTTTTTAGGG 
ATTTATATTA 
TTTT CTTTGA 
CGTTTCATTA 
AATCACTTGA 
GCCTTGGCAG 
CCCATCCTCG 
ATATATCCGG 
GGGACCTAGT 
CCTGCAGCAT 



8eq ID NO: 50 DNA sequence 
Nucleic Acid Accession #: L0S1B7 
Coding sequence: 1991.. 2260 



CTGCAGGGAG 
TCAGAAAGGA 
CAGAAGAAGG 
TGAAGGAAAG 
AG AGT CAT AA 
GGAAATGGAT 
ATTTCTAGCT 
CCCCTCCCTT 
ACAACCATCT 
CCAGGGTTAA 
CCCTGCACCT 
CAGCCAGCTA 
GATGAGGATG 
AGCTTCTATT 
TCACACCAAA 
ATTGCAACAA 
ATATGTGTAA 
TATTTTAAGT 
CCTCAGTAGA 
AGTTCATAGC 
TGACAAGATA 
ATTTAAGGCA 
AACATAAAAC 
AGTAATTGGC 
AGGAGACCTC 
AGATGGGAAG 
GAGGCTTAGA 
GAGGAAAGTG 
GAAGCCAGCT 
GAGCCAAGAA 
TTCAAAGGGC 



11 
[ 

GCAGGTAGAA 
GGAAAAGGCC 
ATTAGCCCCT 
CAGGTTTTCC 
GTAAATTATT 
GGAAGGTCTT 
TCCACCTTCA 
TCCCACCTAT 
CAATGACAAG 
CTCATGAAAC 
GGGTCTGAGG 
GTGCCAAAAA 
GTAGTGTGAG 
TCCTTGAGGC 
CCCAAGGGAC 
ACTGGCAATT 
GCAGGTTAAT 
TAAATTACAG 
TAGTCATTGA 
AGAACTAGAA 
TTTATAGAAA 
GTATGCTAGG 
CTAGCAGGAA 
ATGACGGAGA 
TAGGGTGTCA 
AAAAGCATTT 
TGAATATAAA 
GTCTGATGCC 
TTAGTAGGGC 
GAGAACTCCA 
CTGAAAATTA 



21 
I 

AAGGCTTTTG 
AGGGCAGATG 
GAAAGTCCCT 
CAG ATT AG CA 
CTGAATGTGT 
GGACTCTGAG 
CCAAGGCAGA 
TCATGTGTGC 
GACAGCAGGT 
CCTCCATGAA 
ATGAGGGTGG 
ATATCAGGTG 
TCATGTGTGA 
AGGGCTCATT 
CACACAGCCC 
CTAGTGTACT 
CCAGGGTTTC 
TCTGGATTTG 
ACTGGGAGTC 
CTCAGGCCAG 
TTTTAATTTA 
CACTTTGGAC 
GGTAATACAT 
TGGGCAGAGA 
AGTGATGTGA 
GGAAGGGACT 
GCCATCCTAT 
ATTTTCCAAA 
ATTT TTCCAG 
ATAAAATGGA 
TCCAAGCTTA 



31 
I 

GTGCTCTTCT 
GGCGGGGGGC 
AGCCGGTCGC 
CATGACCATG 
CACCCCCAAA 
AATACAAAGA 
CAGCATCTCC 
TTATGGTAGC 
ACTTGGGAGG 
CGGAGCAAGA 
GTTCCATTGC 
GTGAAGCTCA 
ATTTTTGCCA 
CCACCCTACT 



31 
1 

GGTTTTCAGG 
TCTGGGTGGA 
GAAGTAGGAG 
ACCAGTCAGG 
GTAGTTTAAT 
ACAAGGGGTC 
CAAGGAGGGC 
AAGAGTGCCC 
GGCAAGGCTC 
GCCTGCTGCT 
CAGTGAAAAT 
GTGTTCATCA 
CAGGTGAGGA 
CATCTTATAA 
ATTCTGCTCC 
TTTTCATTAT 
AATGGGAGAT 
AAAGGACCTT 
CTGGAGAAGA 
AGCACTCTCA 
TTAGATGGAT 
AAATCAATGC 
ATATATAAAT 
AGGGCTGTGC 
GCTATGATGG 
GTGTAAGCAC 
AAGTCACAGG 
AGACCTAATA 
AACAGATATA 
GCAGAAGAAA 
TTTCATTTTT 



41 
I 

TCTTTCTTTC 
GGGAGAAGCT 
CGGCTTTTTT 
TTATTACTAT 
ATTTTTTTTT 
TACCCAAACC 
ATTTTTTTTC 
AATATGGAGT 
CAGAGTTTGC 
TTCTGTCACA 
GCTGCCAGGC 
GCTGTGGCAC 
CGAGTGTACA 
GCTGTATCCA 



51 
I 

CCTCGCCGCT 
GACCGGTGAG 
GGGAGAACCC 
GGGCCGCCTC 

CTTTATGCTT 
TGCTGCTTCA 
GCTGTATTCC 
AGTGAGCCGA 
GTTCCTGAAG 
AGGGTGCTGG 
ACCTTGGATG 
CCAAACAAAG 
GTTTCCATTG 



41 
I 

TGGGGGGCAG 
GTGAAGGGAA 
AAGGGTAAAG 
GGGAGGAAGG 
GGAATTGGGA 
TATAATCAGT 
CCACCTCAGC 
TGTCCCACAG 
AACAGGACTC 
CACCCCTCCC 
TAGGCCAGTG 
AATAAGCCGA 
ATGAAAACAG 
AAGCCAGCTG 
GTATACCAGG 
TAGAAATTAG 
AGAGAATAGT 
AGAGATGGTT 
TTGTTCAAAT 
GTAACACTGC 
CTCTACTGAG 
CCTAACGTAC 
AAATGAAATG 
ACTTTTGGGA 
AGGGGTATTT 
AGACCAGAAG 
CTTTCTACAT 
TGCGGACCTC 
AGGTGCCTTG 
TTGCCTTTTA 
AAATGTAATG 



51 
I 

TCTAGCCTGA 
AAAGTGATCC 
GTGTGGTTGG 
TGAGAGTGGG 
AAAAGATGGG 
CCATTTCATT 
TCCTCTGCTC 
AACACGGGGA 
AGATGTCCCC 
TCAAGGCAAG 
ACATCATTTT 
GCCAACCGGT 
AGTGCCCGAG 
GCCATTGCCT 
TAAGTCTCTG 
CTAAAGGCAA 
GGAATATCTT 
AGGGCTCCCA 
GCCCATGGGA 
AATTTCCCCC 
CATTTATTCC 
TTACTTAACA 
CAAAGTAGAT 
GACTTGCTCA 
GGACAAGCAG 
CAAAACCATA 
GGTACTAGGA 
ATGTCCCTCA 
GGTAGGAAGG 
GCTCCTCCTC 
GGGGAGCTAA 



60 
120 
160 
240 
300 
360 
420 
480 



60 
120 
180 
240 
300 
360 
420 
480 
540 



60 
120 
180 
240 
300 
360 
420 
460 
540 
600 
660 
720 
780 
840 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 



207 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

GGGAGATGAA AGGCTTTCTC TTCTAAAGGG 
TTGTATCCAT CTTTCTTTAA TTGAATCACT 
CATTTGAAGC ATGAATTCTC AGCAGCAGAA 
GCAGCAGCAG GTGAAACAAC CTTGCCAGCC 
CAAGGAGCCC TGCCAACCCA AGGTGCCTGA 
CCAGCCCAAG ATTCCAGAGC CCTGCCAGCC 
CACTCCAGCA CCAGCCCAGC AGAAGACCAA 
TTGAGGAGCT GGCCACTGGA TACTGAACAC 
GCCTATTGAC CCTGCAGTTA GCATGCTGTC 
CTAAAAAGAT GTCCCTTACC CTCATTCTGG 
GTCTCACTGA CTGAGCTAGT CTTCTTGTTG 
AGGTCAAGTG ACCATCCCTA G 

Seq ID NO: 51 Protein sequence: 
Protein Accession #:AAC26B38 



PCT/US02/12476 



TCCTGAAATA 
GTGTCAGCTT 
GCAGCCTTGC 
TCCACCCCAG 
GCCCTGCCAC 
CAAGGTGCCT 
GCAGAAGTAA 
CCTACTCCAT 
ACCCTGAATC 
AGGCTCCTGA 
CTCGGGTGCA 



AAATCTGTTT 
TCTGTCTCTA 
ACCCCACCCC 
GAACCATGCA 
CCCAAAGTGC 
GAGCCCTGCC 
TGTGGTCCAC 
TCTGCTTATG 
ATAATCGCTC 
GCCTCTGCGT 
TTTGAGGATG 



GGCATTGAAT 
GAAAAAAACA 
CTCAGCCTCA 
TCCCCAAAAC 
CTGAGCCCTG 
CTTCAACGGT 
AGCCATGCCC 
AATCCCATTT 
CTTTGCACCT 
AAGGCTGAAC 
GATTTGGGGA 



SI 



1 11 21 31 41 

I I I I I 1 

MNSQQOKQPC TPPPQPQQQQ VKQPCQPPPQ EPCIPKTKEP CQPKVPBPCH PKVPEPCQPK 
IPEPCQPKVP EPCPSTVTPA PAQQKTKQK 



seq id NO: 52 DNA sequence 

Nucleic Acid Accession #: NM_002638. 

Coding sequence: 120-473 



1 
I 

CAATACAGCT 
GCTGGACTGC 
TGAGGGCCAG 
AGGCAGCTGT 
TCAATGGACA 
CGCAAGAGCC 
TCCGGTGCGC 
TCAAGAAGTG 
CGGTCCTTGC 
TGCTGCCCTT 
GAGCTGCCTC 



11 
I 

AAGGAATTAT 
ATAAAGATTG 
CAGCTTCTTG 
CACGGGAGTT 
AGATCCCGTT 
AGTCAAAGGT 
CATGTTGAAT 
CTGTGAAGGC 
TGCACCTGTG 
CCCCTTCCCA 
TCTCATCCAC 



21 
I 

CCCTTGTAAA 
GTATGGCCTT 
ATCGTGGTGG 
CCTGTTAAAG 
AAAGGACAAG 
CCAGTCTCCA 
CCCCCTAACC 
TCTTGCGGGA 
CCGTCCCCAG 
CACTGTCCAT 
TTTCCAATAA 



31 
I 

TACCACAGAC 
AGCTCTTAGC 
TGTTCCTCAT 
GTCAAGACAC 
TTTCAGTTAA 
CTAAGCCTGG 
GCTGCTTGAA 
TGGCCTGTTT 
AGCTACAGGC 
TCTTCCTCCC 
A 



41 
I 

CCGCCCTGGA 
CAAACACCTT 
CGCTGGGACG 
TGTCAAAGGC 
AGGTCAAGAT 
CTCCTGCCCC 
AGATACTGAC 
CGTTCCCCAG 
CCCATCTGGT 
ATTCAGGATG 



51 
I 

GCCAGGCCAA 
CCTGACACCA 
CTGGTTCTAG 
CGTGTTCCAT 
AAAGTCAAAG 
ATTATCTTGA 
TGCCCAGGAA 
TGAAGGGAGC 
CCTAAGTCCC 
CCCACGGCTG 



Seq ID NO: 53 Protein sequence: 
Protein Accession #: np_002629.i 

11 



1 11 21 31 41 51 

I I I I I I 

MRASSPLIW VFLIAGTLVL EAAVTGVPVK GQDTVKGRVP FNGQDPVKGQ VSVKGQDKVK 
AQEPVXGPVS TKPGSCPIIL IRCAMLNPPN RCLKDTDCPG IKKCCEGSCG MACFVPQ 

Seq ID NO: 54 DNA sequence 
Nucleic Acid Accession #: NM_019618 
Coding Bequence: 75-584 



GGCACGAGCC 
GAGACAACCA 
ATCAATCAAT 
CCCTTCAGGG 
TTGCTGTTAT 
ATTTGGGAAT 
CATTGCAGCT 
CCTTCCTTTT 
CGGACTGGTT 
GGAAGTCATA 
GCAGCTTGGT 
AGTGTCATTT 
TAATGAAGAA 
GGAGAGCTGG 
CTGCATGAGT 
TGAAGATGCT 
CTCTGTTTCT 
CCAATATACC 
TAATTCTTGT 
AATAAACTTT 



11 

I 

ACGATTCAGT 
CACTATGAGA 
GTGTAAACCT 
TCAGAACCTT 
CACATGCAAG 
CCAGAATCCA 
AAAAGAGCAG 
CTACCGTGCC 
CATTGCCTCC 
CAACACTGCC 
CTTTGTCTTA 
TCACGCTGGT 
GAAGCAATTA 
GTGGTATAAG 
GACTTTAAGA 
TCAGAGCTCA 
GTTTTGCTTT 
TCATTGTGTG 
GTTAAGTTAA 
GTGTATTTAT 



21 
I 

CCCCTGGACT 
GGCACTCCAG 
ATTACTGGGA 
GTGGCAGTTC 
TATCCAGAGG 
GAAATGTGTT 
AAGATCATGG 
AAGACTGGTA 
TCCAAGAGAG 
TTTGAATTAA 
AAGTTTCTGG 
GCTGAGACAG 
CTTCATAGCA 
GCTGTCCTCT 
CTCAAAGACC 
TGCGCGTTAC 
ATTCCCTCTT 
TAATAGAACC 
ATCATTTTTG 
ATAATAAAAA 



31 
I 

GTAGATAAAG 
GAGACGCTGA 
CTATTAATGA 
CACGAAGTGA 
CTCTTGAGCA 
TGTATTGTGA 
ATCTGTATGG 
GGACCTCCAC 
ACCAGCCCAT 
ATATAAATGA 
TTCCCAATGT 
GGGCAAGGCT 
ACTGAAGAAC 
CAAGCTGGTG 
AAACACTGAG 
CCACGATGGC 
GGGATGATAT 
TTCTTAGCAT 
TCCTAATTGT 
AAAAAAAAAA 



41 
I 

ACCCTTTCTT 
TGGTGGAGGA 
TTTGAATCAG 
CAGTGTGACC 
AGGCAGAGGG 
GAAGGTTGGA 
CCAACCCGAG 
CCTTGAGTCT 
CATTCTGACT 
CTGAACTCAG 
GTTTTCGTCT 
GCTGTTATCA 
AGGATGTGGC 
CTGTGTAGGC 
CTTTCTTCTA 
ATGACTAGCA 
CATCCAGTCT 
TAAGACCTTG 
AATGTGTAAT 
AAA 



51 
I 

GCCAGGTGCT 
AGGGCCGTCT 
CAAGTGTGGA 
CCAGTCACTG 
GATCCCATTT 
GAACAGCCCA 
CCCGTGAAAC 
GTGGCCTTCC 
TCAGAACTTG 
CCTAGAGGTG 
ACATTTTCTT 
TCTCATTTTA 
CTCAGAAGCA 
CACAAGGCAT 
GGGGTGGGTA 
CAGAGCTGAT 
TTATATGTTG 
TAAACAAAAA 
CTTAAAGTTA 



Seq ID NO: 55 Protein sequence: 
Protein Accession #: NP_062564 

1 .11 21 31 41 51 

I I I 1 I I 

MRGTPGDADG GGRAVYQSMC KPITGTINDL NQQVWTLQGQ NLVAVPRSDS VTPVTVAVIT 
CKYPEALEQG RGDPIYLGIQ NPEMCLYCEK VGEQPTLQLK EQKTMDLYGQ PEPVKPFLFY 
RAKTGRTSTL ESVAFPDWFI ASSKRDQPII LTSELGKSYN TAFELNIND 

Seq ID NO: 56 DNA sequence 
Nucleic Acid Accession #: NM_003125 
Coding sequence: 65-334 



1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 



60 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



60 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



60 
120 



208 



WO 02/086443 PCT7US02/12476 

31 41 51 

I I I 

CTCTCTTCAC ACCAGGACCA GCCACTGTTG 60 

CTGCATCCCA CCCCCTCAGC TTCAGCAGCA 120 

TCAGGAACCA TGCATCCCCA AAACCAAGQA 180 

CCACCCCAAA GTGCCTGAGC CCTGCCAGCC ■ 240 

GCCTGAGCCC TGCCCTTCAA TAGTCACTCC 300 

GTAATGTGGT CCACAGCCAT GCCCTTGAGG 360 

CCATTCTGTG TATGAGTCCC ATTTGCCTTG 420 

ATGTGCTATG AAGCTTTCTT TCCTACACAC 480 

AGTACCAGAG CTAGTTTTCA GCTGCTCAGA 540 

CAAATGATTC AGCTCCCTTA TACCCCCATT 600 



31 41 51 

1 I 1 

EPCIPKTKEP CHPKVPEPCH PKVPEPCQPK 60 



Seq ID NO: 5B DNA sequence 

Nucleic Acid Accession #: NM_001793.2 

Coding sequence: 71-2560 

30 1 11 21 31 41 51 

I I I 1 I I 

AAAGGGGCAA GAGCTGAGCG GAACACCGGC CCGCCGTCGC GGCAGCTGCT TCACCCCTCT 60 

CTCTGCAGCC ATGGGGCTCC CTCGTGGACC TCTCGCGTCT CTCCTCCTTC TCCAGGTTTG 120 

CTGGCTGCAG TGCGCGGCCT CCGAGCCGTG CCGGGCGGTC TTCAGGGAGG CTGAAGTGAC 180 

35 CTTGGAGGCG GGAGGCGCGG AGCAGGAGCC CGGCCAGGCG CTGGGGAAAG TATTCATGGG 240 

CTGCCCTGGG CAAGAGCCAG CTCTGTTTAG CACTGATAAT GATGACTTCA CTGTGCGGAA 300 

TGGCGAGACA GTCCAGGAAA GAAGGTCACT GAAGGAAAGG AATCCATTGA AGATCTTCCC 360 

ATCCAAACGT ATCTTACGAA GACACAAGAG AGATTGGGTG GTTGCTCCAA TATCTGTCCC 420 

TGAAAATGGC AAGGGTCCCT TCCCCCAGAG ACTGAATCAG CTCAAGTCTA ATAAAGATAG 480 

40 AGACACCAAG ATTTTCTACA GCATCACGGG GCCGGGGGCA GACAGCCCCC CTGAGGGTGT 540 

CTTCGCTGTA GAGAAGGAGA CAGGCTGGTT GTTGTTGAAT AAGCCACTGG ACCGGGAGGA 600 

GATTGCCAAG TATGAGCTCT TTGGCCACGC TGTGTCAGAG AATGGTGCCT CA GTGG AGGA 660 

CCCCATGAAC ATCTCCATCA TCGTGACCGA CCAGAATGAC CACAAGCCCA AGTTTACCCA 720 

GGACACCTTC CGAGGGAGTG TCTTAGAGGG AGTCCTACCA GGTACTTCTG TGATG CAGGT 7 BO 

45 GACAGCCACG GATGAGGATG ATGCCATCTA CACCTACAAT GGGGTGGTTG CTTACTCCAT 840 

CCATAGCCAA GAACCAAAGG ACCCACACGA CCTCATGTTC ACCATTCACC GGAGCACAGG 900 

CACCATCAGC GTCATCTCCA GTGGCCTGGA CCGGGAAAAA GTCCCTGAGT ACACACTGAC 960 

CATCCAGGCC ACAGACATGG ATGGGGACGG CTCCACCACC ACGGCAGTGG CAGTAGTGGA 1020 

GATCCTTGAT GCCAATGACA ATGCTCCCAT GTTTGACCCC CAGAAGTACG AGGCCCATGT 1080 

50 GCCTGAGAAT GCAGTGGGCC ATGAGGTGCA GAGGCTGACG GTCACTGATC TGGACGCCCC 1140 

CAACTCACCA GCGTGGCGTG CCACCTACCT TAT CATGGGC GGTGACGACG GGGACCATTT 1200 

TACCATCACC ACCCACCCTG AGAGCAACCA GGGCATCCTG ACAACCAGGA AGGGTTTGGA 1260 

TTTTGAGGCC AAAAACCAGC ACACCCTGTA CGTTGAAGTG ACCAACGAGG CCCCTTTTGT 1320 

GCTGAAGCTC CCAACCTCCA CAGCCACCAT AGTGGTCCAC GTGGAGGATG TGAATGAGGC 1380 

55 ACCTGTGTTT GTCCCACCCT CCAAAGTCGT TGAGGTCCAG GAGGGCATCC CCACTGGGGA 1440 

GCCTGTGTGT GTCTACACTG CAGAAGACCC TGACAAGGAG AATCAAAAGA T CAGCT ACCG 1500 

CATCCTGAGA GACCCAGCAG GGTGGCTAGC CATGGACCCA GACAGTGGGC AGGTCACAGC 1560 

TGTGGGCACC CTCGACCGTG AGGATGAGCA GTTTGTGAGG AACAACATCT ATGAAGTCAT 1620 

GGTCTTGGCC ATGGACAATG GAAGCCCTCC CACCACTGGC ACGGGAACCC TTCTGCTAAC 1680 

60 ACTGATTGAT GTCAATGACC ATGGCCCAGT CCCTGAGCCC CGTCAGATCA CCAT CTGCAA 1740 

CCAAAGCCCT GTGCGCCAGG TGCTGAACAT CACGGACAAG GACCTGTCTC CCCACACCTC 1800 

CCCTTTCCAG GCCCAGCTCA CAGATGACTC AGACATCTAC TGGACGGCAG AGGTCAACGA I860 

GGAAGGTGAC ACAGTGGTCT TGTCCCTGAA GAAGTTCCTG AAGCAGGATA CATATGACGT 1920 

GCACCTTTCT CTGTCTGACC ATGGCAACAA AGAGCAGCTG ACGGTGATCA GGGCCACTGT 1980 

65 GTGCGACTGC CATGGCCATG TCGAAACCTG CCCTGGACCC TGGAAGGGAG GTTTCATCCT 2040 

CCCTGTGCTG GGGGCTGTCC TGGCTCTGCT GTTCCTCCTG CTGGTGCTGC TTTTGTTGGT 2100 

GAGAAAGAAG CGGAAGATCA AGGAGCCCCT CCTACTCCCA GAAGATGACA CCCGTGACAA 2160 

CGTCTTCTAC TATGGCGAAG AGGGGGGTGG CGAAGAGGAC CAGGACTATG ACATCACCCA 2220 

GCTCCACCGA GGTCTGGAGG CCAGGCCGGA GGTGGTTCTC CGCAATGACG TGGCACCAAC 2280 

70 CATCATCCCG ACACCCATGT ACCGTCCTCG GCCAGCCAAC CCAGATGAAA TCGGCAACTT 2340 

TATAATTGAG AACCTGAAGG CGGCTAACAC AGACCCCACA GCCCCGCCCT ACGACACCCT 2400 

CTTGGTGTTC GACTATGAGG GCAGCGGCTC CGACGCCGCG TCCCTGAGCT CCCTCACCTC 2460 

CTCCGCCTCC GACCAAGACC AAGATTACGA TTATCTGAAC GAGTGGGGCA GCCGCTTCAA 2520 

GAAGCTGGCA GACATGTACG GTGGCGGGGA GGACGACTAG GCGGCCTGCC TGCAGGGCTG 2580 

75 GGGACCAAAC GTCAGGCCAC AGAGCATCTC CAAGGGGTCT CAGTTCCCCC TTCAGCTGAG 2640 

GACTTCGGAG CTTGTCAGGA AGTGGCCGTA GCAACTTGGC GGAGACAGGC TATGAGTCTG 2700 

ACGTTAGAGT GGTTGCTT CC TTAGCCTTTC AGGATGGAGG AATGTGGGCA GTTTGACTTC 2760 

AGCACTGAAA ACCTCTCCAC CTGGGCCAGG GTTGCCTCAG AGGCCAAGTT TCCAGAAGCC 2820 

TCTTACCTGC CGTAAAATGC TCAACCCTGT GTCCTGGGCC TGGGCCTGCT GTGACTGACC 2880 

80 TACAGTGGAC TTTCTCTCTG GAATGGAACC TTCTTAGGCC TCCTGGTGCA ACTTAATTTT 2940 

TTTTTTTAAT GCTATCTTCA AAACGTTAGA GAAAGTTCTT CAAAAGTGCA GCCCAGAGCT 3000 

GCTGGGCCCA CTGGCCGTCC TGCATTTCTG GTTTCCAGAC CCCAATGCCT CCCATTCGGA 3060 

TGGATCTCTG CGTTTTTATA CTGAGTGTGC CTAGGTTGCC CCTTAT TTTT TATTTTCCCT 3120 

GTTGCGTTGC TATAGATGAA GGGTGAGGAC AATCGTGTAT ATGTACTAGA ACTTTTTTAT 3180 

85 TAAAGAAACT TTTCCCAGAA AAAAA 



1 11 21 

I I I 

AGCAGTTCTA AGGGACCATA CAGAGTATTC 
5 CAGCATGAGT TCCCAGCAGC AGAAGCAGCC 

GCAGGTGAAA CAGCCTTGCC AGCCTCCACC 
GCCCTGCCAC CCCAAGGTGC CTGAGCCCTG 
CAAGCTTCCA GAGCCATGCC ACCCCAAGGT 
AGCACCAGCC CAGCAGAAGA CCAAGCAGAA 
10 AGCCGGCCAC CAGATGCTGA ATCCCCTATC 
CAATTAGCAT TCTGTCTCCC CCAAAAAAGA 
TCTGAGTCTC TGAATGAAGC TGAAGGTCTT 
ATTCATCTGA AGAGAGACTT AAGATGAAAG 
AAATTCACTT TCAATTCCA 



15 
20 



Seq ID NO: 57 Protein sequence: 
Protein Accession #: NP_003116 

1 11 21 

I I i 

MSSQQQKQPC IPPPQLQQQQ VKQPCQPPPQ 
LPEPCHPKVP EPCPSIVTPA PAQQKTKQK 



Seq ID NO: 59 Protein sequence: 



209 



WO 02/086443 

Protein Accession NP_001784.2 



PCT/US02/12476 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



MGLPRGPIiAS 
QEPALF9TDN 
KGPFPQRLNQ 
YELFGHAVSE 
DEDDAIYTYN 
TDMDGDGSTT 
AWRATYLIMG 
PTSTATIWH 
DPAGWLAMDP 
VNDHGPVPEP 
TWLSLKKFIi 
GAVLALLFLL 
GLEARPEWL 
DYEGSGSDAA 



11 

I 

LLLLQVCWLQ 
DDFTVRNGET 
LKSNKDRDTK 
NGASVEDPMN 
GWAYSIHSQ 
TAVAWEI LD 
GDDGDH FT I T 
VEDVNEAPVF 
DSGQVTAVGT 
RQITICNQSP 
KQDTYDVHLS 
LVLLLLVRKK 
RNDVAPTIIP 
SLSSLTSSAS 



21 

I 

CAASEPCRAV 
VQERRSLKER 
ZFYSITGPGA 
ISIIVTDQND 
EPKDPHDLMF 
ANDNAPMFDP 
THPESNQGIL 
VPPSKWEVQ 
LDREDEQFVR 
VRQVLNITDK 
LSDHGNKEQL 
RKIKEPLLLP 
TPMYRPRPAN 
DQDQDYDYLN 



31 
I 

FREAEVTLBA 
NPUCIFPSKR 
DSPPEGVFAV 
HKPKFTQDTF 
TIHRSTGTIS 
QKYEAHVPEN 
TTRKGLDFEA 
EGIPTGEPVC 
NNIYEVMVLA 
DIiSPHTSPFQ 
TVIRATVCDC 
EDDTRDNVFY 
PDEIGNFIIE 
EWGSRFKKIiA 



Seq ZD NO: 60 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 162-428 



GCGTTCCGTT 
CATACGGACC 
AGCACCTCGG 
TCT CCCAGAG 
AGCGAAAGAA 
GTTTACTGTT 
GTAGAGTCAT 
GAGGTTAGAA 
AGATCATAAA 



11 

I 

GGCGGCGGAT 
GGATTGTTTT 
AAGCTGAGGC 
GAAGCAGATA 
GCCTCAACTT 
TGTTCATCGA 
TAACAAGGAG 
GTCAAAGAAC 
GACATTTTTT 



21 

I 

TCGAACGTTC 
CGCTGGCCCA 
AGCTGGTACT 
AAGCGGAAGG 
CGTCTGGAGA 
TTAGCAGAAG 
CATGTACTGG 
ATATTCTTGA 
ACACATCAGT 



31 
I 

GGACTGAGGT 
GTGTCCCCGG 
TGACAGAGAG 
CTCCCCGTGG 
AAAGTGGTGA 
AGTCCAGGAC 
CCGCAGCAAA 
AAGTTATGAT 
TAATATGGGA 



41 
I 

GGAEQEPGQA 
ILRRHKRDWV 
EKETGWLLLN 
RGSVLEGVLP 
VISSGLDREK 
AVGHEVQRLT 
KNQHTLYVEV 
VYTAEDPDKE 
MDNGSPPTTG 
AQLTDDSDIY 
HGHVETCPGP 
YGEEGGGEED 
NLKAANTDPT 
DMYGGGEDD 



41 
I 

TTTTCTGCCT 
AGCTTGTGTG 
GATGGCGCTG 
CTTTCTAAAG 
CTTATTGGTC 
AAACGCTTGT 
GGTAATTCTA 
GCATTCTTTT 
TTATTAAATA 



51 
I 

IiGKVFMGCPG 
VAPISVPENG 
KPLDREEIAK 
GTSVMQVTAT 
VPEYTLTIQA 
VTDLDAPNSP 
TNEAPFVLKL 
NQKISYRILR 
TGTLLLTLID 
WTAEVNEEGD 
WKGGFILPVL 
QDYDITQLHR . 
APPYDTLIiVF 



51 
I 

GAAGAAGCGT 
CGATACAGAG 
TCGACCATAG 
CGAGTCTTCA 
CATCTGAACT 
GCGAGTAAAT 
AAGAAGAGCA 
GGGTGGTAAC 
TTGG 



Seq ID NO t 61 Protein sequence i 
Protein Accession #: Eos sequence 

1 11 21 31 41 51 

I I I I I I 

MALSTIVSQR KQIKRKAPRG FLKRVFKRKK PQLRLEKSGD LLVHLNCLLF VHRLAEESRT 

NACASKCRVI NKEHVLAAAK VILKKSRG 

Seq ZD NO: 62 DNA sequence 

Nucleic Acid Accession tf: NM_J)00094.2 

Coding sequence: 99-8933 



GGG CTGGAGG 
GAGGCGGGGG 
CCGCGCTCTG 
GAGTGACCTG 
CCATTGGCCG 
TCTCTGGAGC 
CACGGACAGA 
GTGAGCTTAG 
ACCATGTCTT 
CAGACGGGAA 
TCAAGCTATT 
CACAGCCAAC 
TGCCCCTCGT 
CGGATGACTC 
TGAGAGTACA 
CTCTGACGGG 
GTGAGACCAG 
TTGCCCTCTA 
TAGAAGGGCC 
GGAGTGTGCC 
CACAGCAGCA 
GCACGGACTA 
CCCTGATGGC 
CCACATCCAT 
GGCGGCGTGA 
GCTACCAGTT 
TGGAGGGCCA 
TGAGCCCTGT 
GGAGCCCAGT 
AGCGGACCCT 
GGCTTAGCTA 
TCCTCACTGT 
TGTCAGATGC 
GGATTAGCTG 
CTGCCACAGA 
TGCGAGGCAG 
CAGTGAGGAC 
GGGTTCCTGG 



11 
I 

GGCGCTGGGC 
TCCTAGCTGA 
CGCCGGGATC 
CACGCGCCTT 
CAGCAATTTC 
AGCCAGTGCA 



CTACAAGGGG 
CCTGCCCCAG 
GTCCCAGGAC 
TGCTGTGGGG 
CTCCGACTTC 
TTCCCGGAGA 
GACCTCTGCT 
GTGGACAGCG 
GCTGGGACAG 
TGTGCGGCTG 
CGCCAACAGC 
GGAACTGACC 
AGGTGCCACT 
GGAGCTGGGC 
TGAGGTGACC 
TCGCACTGAC 
CCTCCTTTCC 
GACTGGCTTG 
GGATGGGCTG 
CGAGGTGGCC 
AACAGACCTG 
CCCTGGTGCC 
GGTGCTTCCT 
CACTGTGCGG 
CCGCCGGGAG 
AACGCGAGTG 
GAGCACAGGC 
CATCACAGGG 
AGAGGAGGGC 
GGTCCATGTG 
CGCCACAGGA 



21 
I 

TCGGACCTGC 
CGGCTTTTAC 
CTGGCAGAGG 
TACG CCGCTG 
CGCGAGGTCC 
CAGGGTGTGC 
GATGCACTTG 
GGCAACACTC 
CTGGCCCGAC 
CTGGTGGACA 
ATCAAGAATG 
TTCTTCTTCG 
GTGTGCACGA 
CCACGAGACC 
GCCAGTGGCC 
CCACTGCCGA 
OGGGGTCTCC 
ATCGGGGAGG 
ATCCAGAATA 
GGCTACCGTG 
CCTGGGCAGG 
GTGAGCACCC 
GCTT CTGTTG 
TGGAACTTGG 
GAGCCACCGC 
CAGCCGGGCA 
ACCCCTGCAA 
CAAGCCACCG 
ACCCAGTACC 
GGGAGTCAGA 
GTGTCTGCTC 
CCGGAAACTC 
AGGGTGGCCT 
AGTGGTCCGG 
CTGCAGCCTG 
CCTGCTGCAG 
ACTCAGGCCA 
TACAGGGTTT 



31 
1 

CAAGGCCACC 
TGCCTAGGAT 
CGCCCCGAGT 
ACATTGTGTT 
GCAGCTTTCT 
GCTTTGCCAC 
GCTCTGGGGG 
GCACAGGGGC 
CTGGTGTCCC 
CAGCTGCCCA 
CTGACCCTGA 
TCAATGACTT 
CTGCTGGTGG 
TGGTGCTGTC 
CTGTGACTGG 
GTGAGCGGCA 
GGCCACTGAC 
CTGTGAGCGG 
CCACAGCCCA 
TGACATGGCG 
GTTCAGTGTT 
TATTTGGCCG 
AGCAGACCCT 
TGCCTGAGGC 
AGAAGGTGGT 
CTGAGTACCG 
CCGTGGTTCC 
AGCTGCCCGG 
GCATCATTGT 
CAGCATTCGA 
GAGTGGGTCC 
CACTTGCTGT 
GGGGACCCGT 
AGTCCAGCCA 
GAACCACCTA 
TCATCGTGGC 
GCAGCTCATC 
CCTGGCACTC 



41 

I 

GCAGGGGGGA 
GACGCTGCGG 
GCGAGCCCAG 
CTTACTGGAT 
CGAAGGGCTG 
AGTGCAGTAC 
TGATGTGATC 
TGCAATTCTC 
CAAGGTCTGC 
AAGGCTGAAG 
GGAGCTGAAG 
CAGCATCTTG 
CGTGCCTGTG 
TGAGCCAAGC 
CTACAAGGTC 
GGAGGTGAAC 
CGAGTACCAA 
GACAGCTCGG 
CAGCCTCCTG 
GGTCCTCAGT 
GCTGCGTGAC 
CAGTGTGGGG 
GCGCCCGGTC 
CCGTGGCTAC 
ACTGCCCTCT 
CCTCACACTC 
CACTGGACCA 
GCAGCGGGTG 
GCGCAGCACC 
CTTGGATGAC 
CCGTGAGGGC 
TCCAGGGCTG 
CCCTGGAGCC 
GACACTGCCC 
CCAGGTGGCT 
TCGAACGGAC 
TGTCACCATT 
AGCCCACGGC 



51 

I 

GCAAGGGACA 
CTTCTGGTGG 
CACAGGGAGA 
GGCTCCTCAT 
GTGCTGCCTT 
AGCGATGACC 
CGCGCCATCC 
CATGTGGCTG 
ATCCTGATCA 
GGGCAGGGGG 
CGAGTTGCCT 
AGGACACTAC 
ACCCGACCTC 
AGCCAATCCT 
CAGTACACTC 
GTCCCAGCTG 
GTGACTGTGA 
ACCACTGCCC 
GTGGCCTGGC 
GGTGGGCCCA 
TTGGAGCCTG 
CCCGCCACTT 
ATCCTGGGCC 
CGGTTGGAAT 
GATGTGACCC 
TACACTCTGC 
GAGCTGCCTG 
CGAGTGTCCT 
CAGGGGGTTG 
GTTCAGGCTG 
AGTGCCAGTG 
CGGGTTGTGG 
AGTGGATTTC 
CCAGACTCTA 
GTGTCGGTAC 
CCACTGGGCC 
ACCTGGACCA 
CCAGAGAAAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



60 
120 
180 
240 
300 
360 
420 
480 



60 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
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CCCAGTTGGT TTCTGGGGAG GCCACGGTGG CTGAGCTGGA TGGACTGGAG CCAGATACTG 2340 

AGTATACGGT GCATGTGAGG GCCCATGTGG CTGGCGTGGA TGGGCCCCCT GCCTCTGTGG 2400 

TTGTGAGGAC TGCCCCTGAG CCTGTGGGTC GTGTGTCGAG GCTGCAGATC CTCAATGCTT 2460 

CCAGCGACGT TCTACGGATC ACCTGGGTAG GGGTCACTGG AGCCACAGCT TACAGACTGG 2520 

CCTGGGGCCG GAGTGAAGGC GGCCCCATGA GGCACCAGAT ACTCCCAGGA AACACAGACT 2580 

CTGCAGAGAT CCGGGGTCTC GAAGGTGGAG TCAGCTACTC AGTGCGAGTG ACTGCACTTG 2640 

TCGGGGACCG CGAGGGCACA CCTGTCTCCA TTGTTGTCAC TACGCGGCCT GAGGCTCCGC 2700 

CAG CCCTGGG GACGCTTCAC GTGGTGCAGC GCGGGGAGCA CTCGCTGAGG CTGCGCTGGG 2760 

AGCCGGTGCC CAGAGCGCAG GGCTTCCTTC TGCACTGGCA ACCTGAGGGT GGCCAGGAAC 2820 

AGTCCCGGGT CCTGGGGCCC GAGCTCAGCA GCTATCACCT GGAOGGGCTG GAGCCAGCGA 2880 

CACAGTACCG CGTGAGGCTG AGTGTCCTAG GGCCGGCTGG AGAAGGGCCC TCTGCAGAGG 2940 

TGACTGCGCG CACTGAGTCA CCTCGTGTTC CAAGCATTGA ACTACGTGTG GTGGACACCT 3000 

OGATCGACTC GGTGACTTTG GCCTGGACTC CAGTGTCCAG GGCATCCAGC TACATCCTAT 3060 

CCTGGCGGCC ACTCAGAGGC CCTGGCCAGG AAGTGCCTGG GTCCCCGCAG ACACTTCCAG 3120 

GGATCTCAAG CTCCCAGCGG GTGACAGGGC TAGAGCCTGG CGTCTCTTAC ATCTTCTCCC 3180 

TGACGCCTGT CCTGGATGGT GTGCGGGGTC CTGAGGCATC TGTCACACAG ACGCCAGTGT 3240 

GCCCCCGTGG CCTGGCGGAT GTGGTGTTCC TACCACATGC CACTCAAGAC AATGCTCACC 3300 

GTGCGGAGGC TACGAGGAGG GTCCTGGAGC GTCTGGTGTT GGCACTTGGG CCTCTTGGGC 3360 

CACAGGCAGT TCAGGTTGGC CTGCTGTCTT ACAGTCATCG GCCCTCCCCA CTGTTCCCAC 3420 

TGAATGGCTC CCATGACCTT GGCATTATCT TGCAAAGGAT CCGTGACATG CCCTACATGG 3480 

ACCCAAGTGG GAACAACCTG GGCACAGCCG TGGTCACAGC TCACAGATAC ATGTTGGCAC 3540 

CAGATGCTCC TGGGCGCCGC CAGCACGTAC CAGGGGTGAT GGTTCTGCTA GTGGATGAAC 3600 

CCTTGAGAGG TGACATATTC AGCCCCATCC GTGAGGCCCA GGCTTCTGGG CTTAATGTGG 3660 

TGATGTTGGG AATGGCTGGA GCGGACCCAG AGCAGCTGCG TCGCTTGGCG CCGGGTATGG 3720 

ACTCTGTCCA GACCTTCTTC GCCGTGGATG ATGGGCCAAG CCTGGACCAG GCAGTCAGTG 3780 

GTCTGGCCAC AGCCCTGTGT CAGGCATCCT TCACTACTCA GCCCCGGCCA GAGCCCTGCC 3840 

CAGTGTATTG TCCAAAGGGC CAGAAGGGGG AACCTGGAGA GATGGGCCTG AGAGGACAAG 3900 

TTGGGCCT CC TGGCGACCCT GGCCTCCCGG GCAGGACCGG TGCTCCCGGC CCCCAGGGGC 3 960 

CCCCTGGAAG TGCCACTGCC AAGGGCGAGA GGGGCTTCCC TGGAGCAGAT GGGCGTCCAG 4020 

GCAGCCCTGG CCGCGCCGGG AATCCTGGGA CCCCTGGAGC CCCTGGCCTA AAGGGCTCTC 4080 

CAGGGTTGCC TGGCCCTCGT GGGGACCCGQ GAGAGCGAGG ACCTCGAGGC CCAAAGGGGG 4140 

AGCCGGGGGC TCCCGGACAA GTCATCGGAG GTGAAGGACC TGGGCTTCCT GGGCGGAAAG 4200 

GGGACCCTGG ACCATCGGGC CCCCCTGGAC CTCGTGGACC ACTGGGGGAC CCAGGACCCC 4260 

GTGGCCCCCC AGGGCTTCCT GGAACAGCCA TGAAGGGTGA CAAAGGCGAT CGTGGGGAGC 4320 

GGGGTCCCCC TGGACCAGGT GAAGGTGGCA TTGCTCCTGG GGAGCCTGGG CTGCCGGGTC 4380 

TTCCCGGAAG CCCTGGACCC CAAGGCCCCQ TTGGCCCCCC TGGAAAGAAA GGAGAAAAAG 4440 

GTGACTCTGA GGATGGAGCT CCAGGCCTCC CAGGACAACC TGGGTCTCOG GGTGAGCAGG 4500 

GCCCACGGGG ACCTC CTGG A GCTATTGGCC CCAAAGGTGA CCGGGGCTTT CCAGGGCCCC 4560 

TGGGTGAGGC TGGAGAGAAG GGCGAACGTG GACCCCCAGG CCCAGCGGGA TCCCGGGGGC 4620 

TGCCAGGGGT TGCTGGACGT CCTGGAGCCA AGGGTCCTGA AGGGCCACCA GGACCCACTG 4680 

GCCGCCAAGG AGAGAAGGGG GAGCCTGGTC GCCCTGGGGA CCCTGCAGTG GTGGGACCTG 4740 

CTGTTGCTGG ACCCAAAGGA GAAAAGGGAG ATGTGGGGCC CGCTGGGCCC AGAGGAGCTA 4800 

CCGGAGTCCA AGGGGAACGG GGCCCACCCG GCTTGGTTCT TCCTGGAGAC CCTGGCCCCA 4860 

AGGGAGACCC TGGAGACCGG GGTCCCATTG GCCTTACTGG CAGAGCAGGA CCCCCAGGTG 4 920 

ACTCAGGGCC TCCTGGAGAG AAGGGAGACC CTGGGCGGCC TGGCCCCCCA GGACCTGTTG 4980 

GCCCCCGAGG ACGAGATGGT GAAGTTGGAG AGAAAGGTGA CGAGGGTCCT CCGGGTGACC 5040 

CGGGTTTGCC TGGAAAAGCA GGCGAGCGTG GCCTTCGGGG GGCACCTGGA GTTCGGGGGC 5100 

CTGTGGGTGA AAAGGGAGAC CAGGGAGATC CTGGAGAGGA TGGACGAAAT GGCAGCCCTG 5160 

GATCATCTGG ACCCAAGGGT GACCGTGGGG AGCCGGGTCC CCCAGGACCC CCGGGACGGC 5220 

TGGTAGACAC AGGACCTGGA GCCAGAGAGA AGGGAGACCC TGGGGACCGC GGACAAGAGG 5280 

GTCCTCGAGG GCCCAAGGGT GATCCTGGCC TCCCTGGAGC CCCTGGGGAA AGGGGCATTG 5340 

AAGGGTTTCG GGGACCCCCA GGCCCACAGG GGGACCCAGG TGTCCGAGGC CCAGCAGGAG 5400 

AAAAGGGTGA CCGGGGTCCC CCTGGGCTGG ATGGCCGGAG CGGACTGGAT GGGAAACCAG 5460 

GAGCCGCTGG GCCCTCTGGG CCGAATGGTG CTGCAGGCAA AGCTGGGGAC CCAGGGAGAG 5520 

ACGGGCTTCC AGGCCTCCGT GGAGAACAAG GCCTCCCTGG CCCCTCTGGT CCCCCTGGAT 5580 

TACCGGGAAA GCCAGGCGAG GATGGGAAAC CTGGCCTGAA TGGAAAAAAC GGAGAACCTG 5640 

GGGACCCTGG AGAAGACGGG AGGAAGGGAG AGAAAGGAGA TTCAGGCGCC T CTGG GAG AG 5700 

AAGGTCGTGA TGGCCCCAAG GGTGAGCGTG GAGCTCCTGG TATCCTTGGA CCCCAGGGGC 5760 

CTCCAGGCCT CCCAGGGCCA GTGGGCCCTC CTGGCCAGGG TTTTCCTGGT GTCCCAGGAG 5820 

GCACGGGCCC CAAGGGTGAC CGTGGGGAGA CTGGAT CCAA AGGGGAGCAG GGCCTCCCTG 5880 

GAG AG CGTGG CCTGCGAGGA GAGCCTGGAA GTGTGCCGAA TGTGGATCGG TTGCTGGAAA 5940 

CTGCTGGCAT CAAGGCATCT GCCCTGCGGG AGATCGTGGA GACCTGGGAT GAGAGCTCTG 6000 

GTAGCTTCCT GCCTGTGCCC GAACGGCGTC GAGGCCCCAA GGGGGACTCA GGCGAACAGG 6060 

GCCCCCCAGG CAAGGAGGGC CCCATCGGCT TTCCTGGAGA ACGCGGGCTG AAGGGCGACC 6120 

GTGGAGACCC TGGCCCTCAG GGGCCACCTG GTCTGGCCCT TGGGGAGAGG GGCCCCCCCG 6180 

GGCCTTCCGG CCTTGCCGGG GAGCCTGGAA AGCCTGGTAT TCCCGGGCTC CCAGGCAGGG 6240 

CTGGGGGTGT GGGAGAGGCA GGAAGGCCAG GAGAGAGGGG AGAACGGGGA GAGAAAGGAG 6300 

AACGTGGAGA ACAGGGCAGA GATGGCCCTC CTGGACTCCC TGGAACCCCT GGGCCCCCCG 6360 

GACCCCCTGG CCCCAAGGTG TCTGTGGATG AGCCAGGTCC TGGACTCTCT GGAGAACAGG 6420 

GACCCCCTGG ACTCAAGGGT GCTAAGGGGG AGCCGGGCAG CAATGGTGAC CAAGGTCCCA 6480 

AAGGAGACAG GGGTGTGCCA GGCATCAAAG GAGACCGGGG AGAGCCTGGA CCGAGGGGTC 6540 

AGGACGGCAA CCCGGGTCTA CCAGGAGAGC GTGGTATGGC TGGGCCTGAA GGGAAGCCGG 6600 

GTCTGCAGGG TCCAAGAGGC CCCCCTGGCC CAGTGGGTGG TCATGGAGAC CCTGGACCAC 6660 

CTGGTGCCCC GGGTCTTGCT GGCCCTGCAG GACCCCAAGG ACCTTCTGGC CTGAAGGGGG 6720 

AGCCTGGAGA GACAGGACCT CCAGGACGGG GCCTGACTGG ACCTACTGGA GCTGTGGGAC 6780 

TTCCTGGACC CCCCGGCCCT TCAGGCCTTG TGGGTCCACA GGGGTCTCCA GGTTTGCCTG 6840 

GACAAGTGGG GGAGACAGGG AAGCCGGGAG CCCCAGGTCG AGATGGTGCC AGTGGAAAAG 6900 

ATGGAGACAG AGGGAGCCCT GGTGTGCCAG GGTCAC CAGG TCTGCCTGGC CCTGTCGGAC 6960 

CTAAAGGAGA ACCTGGCCCC ACGGGGGCCC CTGGACAGGC TGTGGTCGGG CTCCCTGGAG 7020 

CAAAGGGAGA GAAGGGAGCC CCTGGAGGCC TTG CTGG AG A CCTGGTGGGT GAGCCGGGAG 7060 

CCAAAGGTGA CCGAGGACTG CCAGGGCCGC GAGGCGAGAA GGGTGAAGCT GGCCGTGCAG 7140 

GGGAGCCCGG AGACCCTGGG GAAGATGGTC AGAAAGGGGC TCCAGGACCC AAAGGTTTCA 7200 

AGGGTGACCC AGGAGTCGGG GTCCCGGGCT CCCCTGGGCC TCCTGGCCCT CCAGGTGTGA 7260 

AGGGAGATCT GGGCCTCCCT GGCCTGCCCG GTGCTCCTGG TGTTGTTGGG TTCCCGGGTC 7320 

AGACAGGCCC TCGAGGAGAG ATGGGTCAGC CAGGCCCTAG TGGAGAGCGG GGTCTGGCAG 7380 

GCCCCCCAGG GAGAGAAGGA ATCCCAGGAC CCCTGGGGCC ACCTGGACCA CCGGGGTCAG 7440 

TGGGACCACC TGGGGCCTCT GGACTCAAAG GAGACAAGGG AGACCCTGGA GTAGGGCTGC 7500 
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CTGGGCCCCQ AGGCGAGCGT GGGGAGCCAG GCATCCGGGG TGAAGATGGC CGCCCCGGCC 7560 

AGGAGGGACC CCGAGGACTC ACGGGGCCCC CTGGCAGCAG GGGAGAGCGT GGGGAGAAGG 7620 

GTGATGTTGG GAGTGCAGGA CTAAAGGGTG ACAAGGGAGA CTCAGCTGTG ATCCTGGGGC 7680 

CTCCAGGCCC ACGGGGTGCC AAGGGGGACA TGGGTGAACG AGGQCCTCGQ GGCTTGGATG 7740 

GTGACAAAGG ACCTCGGGGA GACAATGGGG ACCCTGGTGA CAAGGGCAGC AAGGGAGAGC 7800 

CTGGTGACAA GGGCTCAGCC GGGTTGCCAG GACTGCGTGG ACTCCTGGGA CCCCAGGGTC 7860 

AACCTGGTGC AGCAGGGATC CCTGGTGACC CGGGATCCCC AGGAAAGGAT GGAGTGCCTG 7920 

GTATCCGAGG AGAAAAAGGA GATGTTGGCT TCATGGGTCC CCGGGGCCTC AAGGGTGAAC 7980 

GGGGAGTGAA GGGAGCCTGT GGCCTTGATG GAGAGAAGGG AGACAAGGGA GAAGCTGGTC 8040 

CCCCAGGCCG CCCCGGGCTG GCAGGACACA AAGGAGAGAT GGGGGAGCCT GGTGTGCCGG 8100 

GCCAGTCGGG GGCCCCTGGC AAGGAGGGCC TGATCGGTCC CAAGGGTGAC CGAGGCTTTG 8160 

ACGGGCAGCC AGGCCCCAAG GGTGACCAGG GCGAGAAAGG GGAGCGGGGA ACCCCAGGAA 8220 

TTGGGGGCTT CCCAGGCCCC AGTGGAAATG ATGGCTCTGC TGGTCCCCCA GGGCCACCTG 8280 

GCAGTGTTGG TCCCAGAGGC CCCGAAGGAC TTCAGGGCCA GAAGGGTGAG CGAGGTCCCC 8340 

CCGGAGAGAG AGTGGTGGGG GCTCCTGGGG TCCCTGGAGC TCCTGGCGAG AGAGGGGAGC 8400 

AGGGGGGGCC AGGGCCTGCC GGTCCTCGAG GCGAGAAGGG AGAAGCTGCA CTGACGGAGG 8460 

ATGACATCCG GGGCTTTGTG CGCCAAGAGA TGAGTCAGCA CTGTGCCTGC CAGGGCCAGT 8520 

T CAT CG CATC TGGATCACGA CCCCTCCCTA GTTATGCTGC AGACACTGCC GGCTCCCAGC 8580 

TCCATGCTGT GCCTGTGCTC CGCGTCTCTC ATGCAGAGGA GGAAGAGCGG GTACCCCCTG 8640 

AGGATGATGA GTACTCTGAA TACTCOGAGT ATTCTGTGGA GGAGTACCAG GACCCTGAAG 8700 

CTCCTTGGGA TAGTGATGAC CCCTGTTCCC TGCCACTGGA TGAGGGCTCC TGCACTGCCT 8760 

ACACCCTGCG CTGGTACCAT CGGGCTGTGA CAGGCAGCAC AGAGGCCTGT CACCCTTTTG 8820 

TCTATGGTGG CTGTGGAGGG AATGCCAACC GTTTTGGGAC CCGTGAGGCC TGCGAGCGCC 8880 

GCTGCCCACC CCGGGTGGTC CAGAGCCAGG GGACAGGTAC TGCCCAGGAC TGAGGCCCAG 8940 

ATAATGAGCT GAGATTCAGC ATCCCCTGGA GGAGTCGGGG TCTCAGCAGA ACCCCACTGT 9000 

CCCTCCCCTT GGTGCTAGAG GCTTGTGTGC ACGTGAGCGT GCGAGTGCAC GTCCGTTATT 9060 

TCAGTGACTT GGTCCCGTGG GTCTAGCCTT CCCCCCTGTG GACAAACCCC CATTGTGGCT 9120 

CCTGCCACCC TGGCAGATGA CTCACTGTGG GGGGGTGGCT GTGGGCAGTG AGCGGATGTG 9180 

ACTGGCGTCT GACCCGCCCC TTGACCCAAG CCTGTGATGA CATGGTGCTG ATTCTGGGGG 9240 
GCATTAAAGC TGCTGTTTTA AAAGGCAAAA AA 

Seq ID NO t 63 Protein sequence t 
Protein Accession ft: NP_ 000085.1 

1 11 21 31 41 51 

I 1 I I I I 

MTLRLLVAAL CAGILAEAPR VRAQHRERVT CTRLYAADIV FLLDGSSSIG RSNFREVRSF 60 

LEGLVLPPSG AASAQGVRFA TVQYSDDPRT BFGLOALGSG GDVTRAIREL SYKGGNTRTG 120 

AAILHVADHV PLPQLARPGV PKVCILITDG KSQDLVDTAA QRLKGQGVKL PAVGI KNADP 180 

EELKRVASQP TSDFFFFVND FS I LRTLLPL VSRRVCTTAG GVPVTRPPDO STSAPRDLVL 240 

SEPSSQSLRV QWTAASGPVT GYKVQYTPLT GLGQPLPSER QEVNVPAGET SVRLRGLRPL 300 

TEYQVTVIAL YAMSIGEAVS GTARTTALEG PELTIQNTTA HSLLVAWRSV PGATGYRVTW 360 

RVLSGGPTQQ QELGPGQGSV LLRDLEPGTD' YEVTVSTLFG RSVGPATSLM ARTDASVEQT 420 

LRPVILGPTS 1LLSWNLVPE ARGYRLEWRR ETGLEPPQKV VLPSDVTRYQ LDGLQPGTEY 480 

RLTLYTLLEG HEVATPATW PTGPELPVSP VTDLQATELP GQRVRVSWSP VPGATQYRII 540 

VRSTQGVERT LVLPGSQTAF DLDDVQAGLS YTVRVSARVG PREGSASVLT VRREPETPLA 600 

VPGLRVWSD ATRVRVAWGP VPQASGFRIS WSTGSGPE8S QTLPPDSTAT DITGLQPGTT 660 

YQVAVSVLRG REEGPAAVIV ARTDPLGPVR TVHVTQASSS SVTITWTRVP GATGYRVSWH 720 

SAHGPEKSQI* VSGEATVAEL DGLEPDTEYT VHVRAHVAGV DGPPASWVR TAPEPVGRVS 780 

RLQILNASSD VLRITNVGVT GATAYRLAWG RSEGGPMRHQ ILPGNTDSAE IRGLEGGVSY 840 

SVRVTALVGD REGTPVSIW TTPPEAPPAL GTLHWQRGE HSLRLRWEPV PRAQGFLLHW 900 

QPEGGQEQSR VLGPELSSYH LDGLEPATQY RVRLSVLGPA GEGPSAEVTA RTESPRVPSI 960 

ELRWDTSID SVTLAWTPVS RASSYILSWR PLRGPGQEVP GSPQTLPGIS SSQRVTGLEP 1020 

GVSYIFSLTP VLDGVRGPEA SVTQTPVCPR GLADWFLPH ATQDNAHRAE ATRRVLERLV 1080 

LAIiGPLGPQA VQVGLLSYSH RPSPLFPLNG SHDLGIILQR IRDMPYMDPS GNNLGTAWT 1140 

AHRYMLAPDA PGRRQHVPGV MVLLVDEPLR GDIFSPIREA QASGLNWML GMAGADPEQL 1200 

RRLAPGMDSV QTFFAVDDGP SLDQAVSGLA TALCQASFTT QPRPEPCPVY CPKGQKGEPG 1260 

EMGLRGQVGP PGDPGLPGRT GAPGPQGPPG SATAKGERGF PGADGRPGSP GRAGNPGTPG 1320 

APGLKGSPGL PGPRGDPGER GPRGPKGEPG APGQVIGGEG PGLPGRKGDP GPSGPPGPRG 1380 

PLGDPGPRGP PGLPGTAMKG OKGDRGERGP PGPGEGGIAP GEPGLPGLPG SPGPQGPVGP 1440 

PGKKGEKGDS EDGAPGLPGQ PGSPGEQGPR GPPGAIGPKG DRGFPGPLGE AGEKGERGPP 1500 

GPAGSRGLPG VAGRPGAXGP EGPPGPTGRQ GEKGEPGRPG DPAWGPAVA GPKGEXGDVG 1560 

PAGPRGATGV QGERGPPGLV LFGDPGPKGD PGDRGPIGLT GRAGPPGDSG PPGEKGDPGR 1620 

PGPPGPVGPR GRDGEVGEKG DEGPPGDPGL PGKAGERGLR GAPGVRGPVG EKGEQGDPGE 1680 

DGRNGSPGSS GPKGDRGEPG PPGPPGRLVD TGPGAREKGE PGDRGQEGPR GPKGDPGLPG 1740 

APGERGIEGF RGPPGPQGDP GVRGPAGEKG DRGPPGLDGR SGLDGKPGAA GPSGPNGAAG 1800 

KAGDPGRDGL PGLRGEQGLP GPSGPPGLPG KPGEDGKPGL NGKNGEPGDP GEDGRKGEKG 1860 

DSGASGREGR DGPKGERGAP GILGPQGPPG LPGPVGPPGQ GFPGVPGGTG PKGDRGBTGS 1920 

KGEQGLPGER GLRGEPGSVP NVDRLLETAG IKASALREIV ETWDESSGSF LP VPERRRGP 1980 

KGDSGEQGPP GKEGPIGFPG ERGLKGDRGD PGPQGPPGLA LGERGPPGPS GLAGEPGKPG 2040 

IPGLPGRAGG VGEAGRPGER GERGEKGERG EQGRDGPPGL PGTPGPPGPP GPKVSVDEPG 2100 

PGLSGEQGPP GLKGAKGEPG SNGDQGPKGD RGVPGIKGDR GEPGPRGQDG NPGLPGERGM 2160 

AGPEGKPGLQ GPRGPPGPVG GHGDPGPPGA PGLAGPAGPQ GPSGLKGEPG ETGPPGRGLT 2220 

GPTGAVGLPG PPGPSGLVGP QGSPGLPGQV GETGKPGAPG RDGASGKDGD RGSPGVPGSP 2280 

GLPGPVGPKG EPGPTGAPGQ AWGLPGAKG EKGAPGGLAG DLVGEPGAKG DRGLPGPRGE 2340 

KGBAGRAGEP GDPGEDGQKG APGPKGPKGD PGVGVPGSPG PPGPPGVKGD LGLPGLPGAP 2400 

GWGPPGQTG PRGEMGQPGP SGERGLAGPP GREG I PGP LG PPGPPGSVGP PGASGLKGDK 2460 

GDPGVGLPGP RGERGEPGZR GSDGRPGQEG PRGLTGPPGS RGERGBKGOV GSAGLKGDKG 2520 

DSAVILGPPG PRGAKGDMGE RGPRGLDGDK GPRGDNGD PG DKGSRGEPGD KG S AGLPGLR 2580 

GIjLGPQGOPG AAGIPGDPGS PGKDGVPGIR GEKGDVGFMG PRGLKGERGV KGACGLDGEK 2640 

GDKGEAGPPG RPGLAGHKGE MGEPGVPGQS GAPGKEGL I G PKGDRGFDGQ PGPKGDQGEK 2700 

GERGTPGIGG FPGPSGNDGS AGPPGPPGSV GPRGPEGLQG QKGERGPPGE RWGAPGVPG 2760 

APGERGEQGR PGPAGPRGEK GEAALTEDDI RGFVRQEMSQ HCACQGQPIA SGSRPLPSYA 2820 

ADTAGS Q LHA VPVLRVSHAE EEERVPPEDD EYSEYSEYSV EEYQDPEAPW DSDDPCSLPL 2880 

DEGSCTAYTL RWYHRAVTGS TEACHPFVYG GCGGNANRFG TREACERRCP PRWQSQGTG 2940 
TAQD 
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Seq ID NO: 64 DNA sequence 

Nucleic Acid Accession #> NM_006945 

Coding sequence* 1-219 

1 11 21 31 41 51 

i i i i i i 

ATGTCTTATC AACAGCAGCA GTGCAAGCAG CCCTGCCAGC CACCTCCTGT GTGCCCCACG 60 

CCAAAGTGCC CAGAGCCATG TCCACCCCCG AAGTGCCCTG AGCCCTGCCC ACCACCAAAG 120 

TGTCCACAGC CCTGCCCACC TCAGCAGTGC CAGCAGAAAT ATCCTCCTGT GACACCTTCC 180 
CCACCCTQCC AGCCAAAGTA TCCACCGAAG AGCAAGTAA 

Seq ID NO: 65 Protein sequence; 
Protein Accession #; NP_008876 

1 11 21 31 41 51 

I I I I I I 

MSYQQQQCKQ PCQPPPVCPT PKCPEPCPPP KCPEPCPPPK CPQPCPPQQC QQKYPPVTPS 60 
PPCQPKYPPK SK 

Seq ID NO: 66 DNA sequence 

Nucleic Acid Accession ft* NM_005629.1 

Coding sequence: 639-2546 

1 11 21 31 41 51 

I I I S I I 

TAGTCGGAGC GAGGTGGCGA GTCGCTGAGC CCGCCGCGGC CCCGAGAGCG GCTGCAGCCG 60 

CCGCCGCCGG GAAGGAGAGG GCGAGGCGCG CCCGAGCCGC CGCCGCCGCC GCCACCGCCG 120 

CCGCCGCCAC CACCGCCACC GGAGTCGCGG GCCAGCCGGG CAGCCTCCGC GGGCCCCGGC 180 

CGGGGCGGGG GGCGCGGGCC ACAGGCCCCT GCTCCGGCCG TCGTTTGCAG ACCGCGGGCG 240 

CCGATGTCGC CCGCGCCCCG TTAGGATGAG TCTCGGGTCG GGCGAGGAGC CGCCGCAGCC 300 

GCCGCCGCCC GAGCCGCGGG CAGGAGCCTC GGGAGCCGCC GCCGCCGCCG CCGCCGCCCG 360 

GCCGGGCCCC GACGCCGCCC GCGCGCCCCC GGGCCCCCGA CACACATGAG ATTCTTCAGG 420 

CTCACTTTCA AGTGCTTCGT GGACTGCTTC TGACTGCGCC GCCCGCGCCC CGCACCCCGC 480 

CGTCCGCCCG CCGCCCCGTC CCCCGGCCCG GCCGCCCCCC GGCCCCCGGC CGGCCCGCGC 540 

CCTCGGGGCC CTCCCCGGTG CCGCCGGTGC CCCCCGCCTG ACCGCCGCCC CCCGTGAGGC 600 

GCCGCGACCC CGGCCCGGCC GTGCGGCCCG CCGGGGCCAT GGCGAAGAAG AGCGCCGAGA 660 

ACGGCATCTA TAGCGTGTCC GGCGACGAGA AGAAGGGCCC CCTCATCGCG CCCGGGCCCG 720 

ACGGGGCCCC GQCCAAGGQC GACGGCCCCG TGGGCCTGGG GACACCCGGC GGCCGCCTGG 780 

CCGTGCCGCC GCGCGAGACC TGGACGCGCC AGATGGACTT CATCATGTCG TGCGTGGGCT 840 

TCGCCGTGGG CTTGGGCAAC GTGTGGCGCT TCCCCTACCT GTGCTACAAG AACGGCGGAG 900 

GTGTGTTCCT TATTCCCTAC GTCCTGATCG CCCTGGTTGG AGGAATCCCC ATTTTCTTCT 960 

TAGAGATCTC GCTGGGCCAG TTCATGAAGG CCGGCAGCAT CAATGTCTGG AACATCTGTC 1020 

CCCTGTTCAA AGGCCTGGGC TACGCCTCCA TGGTGATCGT CTTCTACTGC AACACCTACT 1080 

ACATCATGGT GCTGGCCTGG GGCTTCTATT ACCTGGTCAA GTCCTTTACC ACCACGCTGC 1140 

CCTGGGCCAC ATGTGGCCAC ACCTGGAACA CTCCCGACTG CGTGGAGATC TTCCGCCATG 1200 

AAGACTGTGC CAATGCCAGC CTGGCCAACC TCACCTGTGA CCAGCTTGCT GACCGCCGGT 1260 

CCCCTGTCAT CGAGTTCTGG GAGAACAAAG TCTTGAGGCT GTCTGGGGGA CTGGAGGTGC 1320 

CAGGGGCCCT CAACTGGGAG GTGACCCTTT GTCTGCTGGC CTGCTGGGTG CTGGTCTACT 1380 

TCTGTGTCTG GAAGGGGGTC AAATCCACGG GAAAGATCGT GTACTTCACT GCTACATTCC 1440 

CCTACGTGGT CCTGGTCGTG CTGCTGGTGC GTGGAGTGCT GCTGCCTGGC GCCCTGGATG 1500 

GCATCATTTA CTATCTCAAG CCTGACTGGT CAAAGCTGGG GTCCCCTCAG GTGTGGATAG 1560 

ATGCGGGGAC CCAGATTTTC TTTTCTTACG CCATTGGCCT GGGGGCCCTC ACAGCCCTGG 1620 

GCAGCTACAA CCGCTTCAAC AACAACTGCT ACAAGGACGC CATCATCCTG GCTCTCATCA 1680 

ACAGTGGGAC CAGCTTCTTT GCTGGCTTCG TGGTCTTCTC CATCCTGGGC TTCATGGCTG 1740 

CAGAGCAGGG CGTGCACATC TCCAAGGTGG CAGAGTCAGG GCCGGGCCTG GCCTTCATCG 1800 

CCTACCCGCG GGCTGTCACG CTGATGCCAG TGGCCCCACT CTGGGCTGCC CTGTTCTTCT 1860 

TCATGCTGTT GCTGCTTGGT CTCGACAGCC AGTTTGTAGG TGTGGAGGGC TTCATCACCG 1920 

GCCTCCTCGA CCTCCTCCCG GCCTCCTACT ACTTCCGTTT CCAAAGGGAG ATCTCTGTGG 1980 

CCCTCTGTTG TGCCCTCTGC TTTGTCATCG ATCTCTCCAT GGTGACTGAT GGCGGGATGT 2040 

ACGTCTTCCA GCTGTTTGAC TACTACTCGG CCAGCGGCAC CACCCTGCTC TGGCAGGCCT 2100 

TTTGGGAGTG CGTGGTGGTG GCCTGGGTGT ACGGAGCTGA CCGCTTCATG GACGACATTG 2160 

CCTGTATGAT CGGGTACCGA CCTTGCCCCT GGATGAAATG GTGCTGGTCC TTCTTCACCC 2220 

CGCTGGTCTG CATGGGCATC TTCATCTTCA ACGTTGTGTA CTACGAGCCG CTGGTCTACA 2280 

ACAACACCTA CGTGTACCCG TGGTGGGGTG AGGCCATGGG CTGGGCCTTC GCCCTGTCCT 2340 

CCATGCTGTG CGTGCCGCTG CACCTCCTGG GCTGCCTCCT CAGGGCCAAG GGCACCATGG 2400 

CTGAGCGCTG GCAGCACCTG ACCCAGCCCA TCTGGGGCCT CCACCACTTG GAGTACCGAG 2460 

CTCAGGACGC AGATGTCAGG GGCCTGACCA CCCTGACCCC AGTGTCCGAG AGCAGCAAGG 2520 

TCGTCGTGGT GGAGAGTGTC ATGTGACAAC TCAGCTCACA TCACCAGCTC ACCTCTGGTA 2580 

GCCATAGCAG CCCCTGCTTC AGCCCCACCG CACCCCTCCA GGGGGCCTGC CTTTCCCTGA 2640 

CACTTTTGGG GTCTGCCTGG GGGAGGAGGG GAGAAAGCAC CATGAGTGCT CACTAAAACA 2700 

ACTTTTTCCA TTTTTAATAA AACGCCAAAA ATATCACAAC CCACCAAAAA TAGATGCCTC 2760 

TCCCCCTCCA GCCCTAGCCG AGCTGGTCCT AGGCCCCGCC TAGTGCCCCA CCCCCACCCA 2820 

CAGTGCTGCA CTCCTCCTGC CCCTGCCACG CCCACCCCCT GCCCACCTCT CCAGGCTCTG 2880 

CTCTGCAGCA CACCCGTGGG TGACCCCTCA CCCCAGAAGC AGCAGTGGCA GCTTGGGAAA 2940 

TGTGAGGAAG GGAAGGAGGG AGAGACGGGA GGGAGGAGAG AGAGGAGAAG GGAGGCAGGG 3000 

GAGGGGCAGC AGAACCAAGG CAAATATTTC AGCTGGGCTA TACCCCTCTC CCCATCCCTG 3060 

TTATAGAAGC TTAGAGAGCC AGCCAGCAAT GGAACCTTCT GGTTCCTGCG CCAATCGCCA 3120 

CCAGTATCAA TTGTGTGAGC TTGGGTGCGA GTGCACGCGT GCGTGAGTAC GGAGAGTATA 3180 

TATAGATCTC TATCTCTTAG CAAAGGTGAA TGCCAGATGT AAATGGCGCC TCTGGGCAAA 3240 

GGAGGCTTGT ATTTTGCACA TTTTATAAAA ACTTGAGAGA ATGAGATTTC TGCTTGTATA 3300 

TTTCTAAAAA GAGGAAGGAG CCCAAACCAT CCTCTCCTTA CCACTCCCAT CCCTGTGAGC 3360 

CCTACCTTAC CCCTCTGCCC CTAGCCAAGG AGTGTGAATT TATAGATCTA ACTTTCATAG 3420 

GCAAAACAAA AGCTTCGAGC TGTTGCGTGT GTGAGTCTGT TGTGTGGATG TGCGTGTGTG 3480 

GTCCCCAGCC CCAGACTGGA TTGGAAAAGT GCATGGTGGG GGCCTCGGGG CTGTCCCCAC 3540 

GCTGTCCCTT TGCCACAAGT CTGTGGGGCA AGAGGCTGCA ATATTCCGTC CTGGGTGTCT 3600 

GGGCTGCTAA CCTGGCCTGC TCAGGCTTCC CACCCTGTGC GGGGCACACC CCCAGGAAGG 3660 

GACCCTGGAC ACGGCTCCCA CGTCCAGGCT TAAGGTGGAT GCACTTCCCG CACCTCCAGT 3720 
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CTTCTGTGTA GCAGCTTTAA CCCACGTTTG TCTGTCACGT CCAGTCCCGA GACGGCTGAG 3780 

TGACCCCAAG AAAGGCTTCC CCGACACCCA GACAGAGGCT GCRGGGCTGQ GGCTGGGTGA 3840 

GGGTGGCGGG CCTGCGGGGA CATTCTACTG TGCTAAAAAG CCACTGCAGA CATAGCAATA 3900 
AAAACATGTC ATTTTCC 

Seg ID NO i 67 Protein sequences 
Protein Accession ft: NP_005620.1 

1 n 21 31 41 51 

I I I I I I 

MARKS AENGI YSVSGDEKKG PLIAPGPDGA PAKGDGPVGL GTPGGRLAVP PRETWTRQMD 60 

PIMSCVGFAV GLGNVWRFPY LCYKNGGGVP LIPYVLIALV GGIPIPFLEI SLGQFMKAGS 120 

INVWNICPLP KGIiGYASMVI VPYGNTYYIM VLAWGFYYLV KSFTTTLPWA TCGHTWNTPD 180 

CVEIFRHBDC ANASLANLTC DQLADRRSPV IEFWENKVLR LSGGLEVPGA LNWEVTLCLL 240 

ACWVLVYFCV WKGVKSTGKI VYFTATFPYV VLWLLVRGV LLPGALDGI I YYLKPDWSKL 300 

GSPQVWIDAG TQIFFSYAIG LGALTALGSY NRFNNNCYKD AIILALINSG TSFFAGPWF 360 

SILGFMAAEQ GVHISKVAES GPGIAFIAYP RAVTLMPVAP LWAALFFFML LLLGLDSQFV 42 0 

GVEGFlTGLh DLLPASYYFR FQREISVALC CALCFVIDLS MVTDGGMYVF QLFDYYSASG 480 

TTLLWQAFWE CVWAWVYGA DRFMDDIACM IGYRPCPWMK WCWSFFTPLV CMGIFIFNW 540 

YYEPLVYNNT YVYPWWGEAM GWAFALSSML CVPLHLLGCL LRAKGTMAER WQHLTQPIWG 600 

LHHLEYRAQD ADVRGLTTLT PVSESSKWV VESVM 



Seq ID NO: 68 DNA sequence 

Nucleic Acid Accession #: NM_021953.1 

Coding sequence: 178-2469 

1 11 21 31 41 51 

1 1 1 I I I 

GGCACGAGGG GGACCOGGCC GGTCCGGCGC GAGCCCCCGT CCGGGGCCCT GGCTCGGCCC 60 
CCAGGTTGGA GGAGCCCGGA GCCCGCCTTC GGAGCTACGG CCTAACGGCG GCGGCGACTG 120 
CAGTCTGGAG GGTCCACACT TGTGATTCTC AATGGAGAGT GAAAACGCAG ATTCATAATG 180 
AAAGCTAGCC CCCGTCGGCC ACTGATTCTC AAAAGACGGA GGCTGCCCCT TCCTGTTCAA 240 
AATGCCCCAA GTGAAACATC AGAGGAGGAA CCTAAGAGAT CCCCTGCCCA ACAGGAGTCT 300 
AATCAAGCAG AGGCCTCCAA GGAAGTGGCG GAGTCCAACT CTTGCAAGTT TCCAGCTGGG 360 
ATCAAGATTA TTAACCACCC CACCATGCCC AACACGCAAG TAGTGGCCAT CCCCAACAAT 420 
GCTAATATTC ACAGCATCAT CACAGCACTG ACTGCCAAGG GAAAAGAGAG TGGCAGTAGT 480 
GGGCCCAACA AATTCATCCT CATCAGCTGT GGGGGAGCCC CAACTCAGCC TCCAGGACTC 540 
CGGCCTCAAA CCCAAACCAG CTATGATGCC AAAAGGACAG AAGTGACCCT GGAGACCTTG 600 
GGACCAAAAC CTGCAGCTAG GGATGTGAAT CTTCCTAGAC CACCTGGAGC CCTTTGCGAG 660 
CAGAAACGGG AGACCTGTGC AGATGGTGAG GCAGCAGGCT GCACTATCAA CAATAGCCTA 720 
TCCAACATCC AGTGGCTTCG AAAGATGAGT TCTGATGGAC TGGGCTCCCG CAGCATCAAG 780 
CAAGAGATGG AGGAAAAGGA GAATTGTCAC CTGGAGCAGC GACAGGTTAA GGTTGAGGAG 840 
CCTTCGAGAC CATCAGCGTC CTGGCAGAAC TCTGTGTCTG AGCGGCCACC CTACTCTTAC 900 
ATGGCCATGA TACAATTCGC CAT CAACAGC ACTGAGAGGA AGCGCATGAC TTTGAAAGAC 960 

ATCTATACGT GGATTGAGGA CCACTTTCCC TACTTTAAGC ACATTGCCAA GCCAGGCTGG 1020 

AAGAACTCCA TCCGCCACAA CCTTTCCCTG CACGACATGT TTGTCCGGGA GACGTCTGCC 1080 

AATGGCAAGG TCTCCTTCTG GACCATTCAC CCCAGTGCCA ACCGCTACTT GACATTGGAC 1140 

CAGGTGTTTA AGCCACTGGA CCCAGGGTCT CCACAATTGC CCGAGCACTT GGAATCACAG 1200 

CAGAAACGAC CGAATCCAGA GCTCCGCCGG AACATGACCA TCAAAACCGA ACTCCCCCTG 1260 

GGCGCACGGC GGAAGATGAA GCCACTGCTA CCACGGGTCA GCTCATACCT GGTACCTATC 1320 

CAGTTCCCGG TGAACCAGTC ACTGGTGTTG CAGCCCTCGG TGAAGGTGCC ATTGCCCCTG 1380 

GCGGCTTCCC TCATGAGCTC AGAGCTTGCC CGCCATAGCA AGCGAGTCCG CATTGCCCCC 1440 

AAGGTGCTGC TAGCTGAGGA GGGGATAGCT CCTCTTTCTT CTGCAGGACC AGGGAAAGAG 1500 

GAGAAACTCC TGTTTGGAGA AGGGTTTTCT CCTTTGCTTC CAGTTCAGAC TATCAAGGAG 1560 

GAAGAAATCC AGCCTGGGGA GGAAATGCCA CACTTAGCGA GACCCATCAA AGTGGAGAGC 162 0 

CCTCCCTTGG AAGAGTGGCC CTCCCCGGCC CCATCTTTCA AAGAGGAATC ATCTCACTCC 1680 

TGGGAGGATT CGTCCCAATC TCCCACCCCA AGACCCAAGA AGTCCTACAG TGGGCTTAGG 1740 

TCCCCAACCC GGTGTGTCTC GGAAATGCTT GTGATTCAAC ACAGGGAGAG GAGGGAGAGG 180 0 

AGCCGGTCTC GGAGGAAACA GCATCTACTG CCTCCCTGTG TGGATGAGCC GGAGCTGCTC 1860 

TTCTCAGAGG GGCCCAGTAC TTCCCGCTGG GCCGCAGAGC TCCCGTTCCC AGCAGACTCC 1920 

TCTGACCCTG CCTCCCAGCT CAGCTACTCC CAGGAAGTGG GAGGACCTTT TAAGACACCC 1980 

ATTAAGGAAA CGCTGCCCAT CTCCTCCACC CCGAGCAAAT CTGTCCTCCC CAGAACCCCT 2040 

GAATCCTGGA GGCTCACGCC CCCAGCCAAA GTAGGGGGAC TGGATTTCAG CCCAGTACAA 2100 

ACCTCCCAGG GTGCCTCTGA CCCCTTGCCT GACCCCCTGG GGCTGATGGA TCTCAGCACC 2160 

ACTCCCTTGC AAAGTGCTCC CCCCCTTGAA TCACCGCAAA GGCTCCTCAG TTCAGAACCC 2220 

TTAGACCTCA TCTCCGTCCC CTTTGGCAAC TCTTCTCCCT CAGATATAGA CGTCCCCAAG 2280 

CCAGGCTCCC CGGAGCCACA GGTTTCTGGC CTTGCAGCCA ATCGTTCTCT GACAGAAGGC 2340 

CTGGTCCTGG ACACAATGAA TGACAGCCTC AGCAAGATCC TGCTGGACAT CAGCTTTCCT 2400 

GGCCTGGACG AGGACCCACT GGGCCCTGAC AACATCAACT GGTCCCAGTT TATTCCTGAG 2460 

CTACAGTAGA GCCCTGCCCT TGCCCCTGTG CTCAAGCTGT CCACCATCCC GGGCACTCCA 2520 

AGGCTCAGTG CACCCCAAGC CTCTGAGTGA GGACAGCAGG CAGGG ACTGT TCTGCTCCTC 2580 

ATAGCTCCCT GCTGCCTGAT T ATG CAAAAG TAGCAGTCAC ACCCTAGCCA CTGCTGGGAC 2640 

CTTGTGTTCC CCAAGAGTAT CTGATTCCTC TGCTGTCCCT GCCAGGAGCT GAAGGGTGGG 2700 

AACAACAAAG GCAATGGTGA AAAGAGATTA GGAACCCCCC AGCCTGTTTC CATTCTCTGC 2760 

CCAGCAGTCT CTTACCTTCC CTGATCTTTG CAGGGTGGTC CGTGTAAATA GTATAAATTC 2820 

TCCAAATTAT CCTCTAATTA TAAATGTAAG CTTATTTCCT TAGATCATTA TCCAGAGACT 2880 

GCCAGAAGGT GGGTAGGATG ACCTGGGGTT TCAATTGACT TCTGTTCCTT GCTTTTAGTT 2940 

TTGATAGAAG GGAAGACCTG CAGTGCACGG TTTCTTCCAG GCTGAGGTAC CTGGATCTTG 3000 

GGTTCTTCAC TGCAGGGACC CAGACAAGTG GATCTGCTTG CCAGAGTCCT TTTTGCCCCT 3 060 

CCCTGCCACC TCCCCGTGTT TCCAAGTCAG CTTTCCTGCA AGAAGAAATC CTGGTTAAAA 3120 

AAGTCTTTTG TATTGGGTCA GGAGTTGAAT TTGGGGTGGG AGGATGGATG CAACTGAAGC 3180 

AGAGTGTGGG TGCCCAGATG TGCGCTATTA GATGTTTCTC TGATAATGTC CCCAATCATA 3240 

CCAGGGAGAC TGGCATTGAC GAGAACTCAG GTGGAGGCTT GAGAAGGCCG AAAGGGCCCC 33 00 

TGACCTGCCT GGCTTCCTTA GCTTGCCCCT CAGCTTTGCA AAGAGCCACC CTAGGCCCCA 3360 

GCTGACCGCA TGGGTGTGAG CCAGCTTGAG AACACTAACT ACTCAATAAA AGCGAAGGTG 3420 
GACCNAAAAA AAAAAAAAAA AAAA 
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Seq ID NO: 69 Protein sequence: 
Protein Accession ft: NP_068772«1 

1 11 21 

I I I 

MKASPRRPLI LKRRRLPLFV QNAPSETSEE 
GIKIINHPTM PNTQWAIPN NANIHSIITA 
LRPQTQTSYD AKRTEVTLET LGPKPAARDV 
LSNIQWLRKM SSDGLGSRSI KQEMEEKENC 
YMAMIQFAIN STERKRMTLK DIYTWIEDHP 
ANGKVSFWTI HPSANRYLTL DQVFKPLDPG 
LGARRKMKPL LPRVSSYLVP IQFPVNQSLV 
PKVLLAEEGI APLSSAGPGK EEKLLFGEGP 
SPPLEEWPSP APSFKEESSH SWEDSSQSPT 
RSRSRRKQHL LPPCVDEPEL LFSEGPSTSR 
PIKETLPISS TPSKSVLPRT PESWRLTPPA 
TTPLQSAPPL ESPQRLLSSE PLDLISVPFG 
GLVI/DTMNDS LSKIMJ5ISF PGLDEDPLGP 



31 41 51 

I i I 

EPKRSPAQQE SNQAEASKEV AESNSCKFPA 60 

LTAKGKE8GS SGPNKFILIS CGGAPTQPPG 12 0 

NLPRPPGALC EQKRETCADG EAAGCTINNS 180 

HLEQRQVKVE BPSRPSASWQ NSVSERPPYS 240 

PYFKHIAKPG WKNSIRHNLS LHEMFVRETS 300 

SPQLPEHLES QQKRPNPELR RNMTIKTELP 360 

LQPSVKVPLP LAASLMSSBL ARHSKRVRIA 420 

SPLLPVQTIK EEEIQPGEEM PHLARPIKVE 480 

PRPKKSYSGL RSPTRCVSEM LVIQHRERRE 540 

WAAELPFPAD SSDPASQLSY SQEVGGPFKT 600 

KVGGLDFSPV QTSQGASDPL PDPLGLMDLS 660 

NSSPSDIDVP KPGSPEPQVS GLAANRSLTE 720 
DNINWSQFIP BLQ 



Seq ID NO i 70 DNA sequence 

Nucleic Acid Accession #: BC006529.1 

Coding sequence: 178-2424 

1 11 21 31 41 51 

I I I I I I 

GGCACGAGGG GGACCCGGCC GGTCCGGCGC GAGCCCCCGT CCGGGGCCCT GGCTCGGCCC 60 

CCAGGTTGGA GGAGCCCGGA GCCCGCCTTC GGAGCTACGG CCTAACGGCG GCGGCGACTG 120 

CAGTCTGGAG GGTCCACACT TGTGATTCTC AATGGAGAGT GAAAACGCAG ATTCATAATG 180 

AAAACTAGCC CCCGTCGGCC ACTGATTCTC AAAAGACGGA GGCTGCCCCT TCCTGTTCAA 240 

AATGCCCCAA GTGAAACATC AGAGGAGGAA CCTAAGAGAT CCCCTGCCCA ACAGGAGTCT 300 

AATCAAGCAG AGGCCTCCAA GGAAGTGGCA GAGTCCAACT CTTGCAAGTT TCCAGCTGGG 360 

ATCAAGATTA TTAACCACCC CACCATGCCC AACACGCAAG TAGTGGCCAT CCCCAACAAT 420 

GCTAATATTC ACAGCATCAT CACAGCACTG ACTGCCAAGG GAAAAGAGAG TGGCAGTAGT 480 

GGGCCCAACA AATTCATCCT CATCAGCTGT GGGGGAGCCC CAACTCAGCC TCCAGGACTC 540 

CGGCCTCAAA CCCAAACCAG CTATGATGCC AAAAGGACAG AAGTGACCCT GGAGACCTTG 600 

GGACCAAAAC CTGCAGCTAG GGATGTGAAT CTTCCTAGAC CACCTGGAGC CCTTTGCGAG 660 

CAGAAACGGG AGACCTGTGC AGATGGTGAG GCAGCAGGCT GCACTATCAA CAATAGCCTA 720 

TCCAACATCC AGTGGCTTCG AAAGATGAGT TCTGATGGAC TGGGCTCCCG CAGCATCAAG 780 

CAAGAGATGG AGGAAAAGGA GAATTGTCAC CTGGAGCAGC GACAGGTTAA GGTTGAGGAG 840 

CCTTCGAGAC CATCAGCGTC CTGGCAGAAC TCTGTGTCTG AGCGGCCACC CT ACT CTT AC 900 

ATGGCCATGA TACAATTCGC CATCAACAGC ACTGAGAGGA AGCGCATGAC TTTGAAAGAC 960 

ATCTATACGT GGATTGAGGA CCACTTTCCC TACTTTAAGC ACATTGCCAA GCCAGGCTGG 1020 

AAGAACTCCA TCCGCCACAA CCTTTCCCTG CACGACATGT TTGTCCGGGA GACGTCTGCC 1080 

AATGGCAAGG TCTCCTTCTG GACCATTCAC CCCAGTGCCA ACCGCTACTT GACATTGGAC 1140 

CAGGTGTTTA AGCAGCAGAA ACGACCGAAT CCAGAGCTCC GCCGGAACAT GACCATCAAA 1200 

ACCGAACTCC CCCTGGGCGC ACGGCGGAAG ATGAAGCCAC TGCTACCACG GGTCAGCTCA 1260 

TACCTGGTAC CTATCCAGTT CCCGGTGAAC CAGTCACTGG TGTTGCAGCC CTCGGTGAAG 1320 

GTGCCATTGC CCCTGGCGGC TTCCCTCATG AGCTCAGAGC TTGCCCGCCA TAG CAAGCGA 1380 

GTCCGCATTG CCCCCAAGGT GCTGCTAGCT GAGGAGGGGA TAGCTCCTCT TTCTTCTGCA 1440 

GGACCAGGGA AAGAGGAGAA ACTCCTGTTT GGAGAAGGGT TTTCTCCTTT GCTTCCAGTT 1500 

CAGACTATCA AGGAGGAAGA AATCCAGCCT GGGGAGGAAA TGCCACACTT AGCGAGACCC 1560 

ATCAAAGTGG AGAGCCCTCC CTTGGAAGAG TGGCCCTCCC CGGCCCCATC TTTCAAAGAG 1620 

GAATCATCTC ACTCCTGGGA GGATTCGT CC CAATCTCCCA CCCCAAGACC CAAGAAGTCC 1680 

TACAGTGGGC TTAGGTCCCC AACCCGGTGT GTCTCGGAAA TGCTTGTGAT TCAACACAGG 1740 

GAGAGGAGGG AGAGGAGCCG GTCTCGGAGG AAA CAG CATC TACTGCCTCC CTGTGTGGAT 1800 

GAGCCGGAGC TGCTCTTCTC AGAGGGGCCC AGTACTTCCC GCTGGGCCGC AGAGCTCCCG 1860 

TTCCCAGCAG ACTCCTCTGA CCCTGCCTCC CAGCTCAGCT ACTCCCAGGA AGTGGGAGGA 1920 

CCTTTTAAGA CACCCATTAA GGAAACGCTG CCCATCTCCT CCACCCCGAG CAAATCTGTC 1980 

CTCCCCAGAA CCCCTGAATC CTGGAGGCTC ACGCCCCCAG CCAAAGTAGG GGGACTGGAT 2040 

TTCAGCCCAG TACAAACCCC CCAGGGTGCC TCTGACCCCT TGCCTGACCC CCTGGGGCTG 2100 

ATGGATCTCA GCACCACTCC CTTGCAAAGT GCTCCCCCCC TTGAATCACC GCAAAGGCTC 2160 

CTCAGTTCAG AACCCTTAGA CCTCATCTCC GTCCCCTTTG GCAACTCTTC TCCCTCAGAT 2220 

ATAGACGTCC CCAAGCCAGG CTCCCCGGAG CCACAGGTTT CTGGCCTTGC AGCCAATCGT 2280 

TCTCTGACAG AAGGCCTGGT CCTGGACACA ATGAATGACA GCCTCAGCAA GATCCTGCTG 2340 

GACATCAGCT TTCCTGGCCT GGACGAGGAC CCACTGGGCC CTGACAACAT CAACTGGTCC 2400 

CAGTTTATTC CTGAGCTACA GTAGAGCCCT GCCCTTGCCC CTGTGCTCAA GCTGTCCACC 2460 

ATCCCGGGCA CTCCAAGGCT CAGTGCACCC CAAGCCTCTG AGTGAGGACA GCAGGCAGGG 2520 

ACTGTTCTGC TCCTCATAGC TCCCTGCTGC CTGATTATGC AAAAGTAGCA GTCACACCCT 2580 

AGCCACTGCT GGGACCTTGT GTTCCCCAAG AGTATCTGAT TCCTCTGCTG TCCCTGCCAG 2640 

GAGCTGAAGG GTGGGAACAA CAAAGGCAAT GGTGAAAAGA GATTAGGAAC CCCCCAGCCT 2700 

GTTTCCATTC TCTGCCCAGC AGTCTCTTAC CTTCCCTGAT CTTTGCAGGG TGGTCCGTGT 2760 

AAATAGTATA AATTCTCCAA ATTATCCTCT AATTATAAAT GTAAGCTTAT TTCCTTAGAT 2820 

CATTAT CCAG AGACTGCCAG AAGGTGGGTA GGATGACCTG GGGTTTCAAT TGACTTCTGT 2880 

TCCTTGCTTT TAGTTTTGAT AGAAGGGAAG ACCTGCAGTG CACGGTTTCT TCCAGGCTGA 2940 

GGTACCTGGA TCTTGGGTTC TTCACTGCAG GGACCCAGAC AAGTGGATCT GCTTGCCAGA 3000 

GTCCTTTTTG CCCCTCCCTG CCACCTCCCC GTGTTTCCAA GTCAGCTTTC CTGCAAGAAG 3060 

AAATCCTGGT TAAAAAAGTC TTTTGTATTG GGTCAGGAGT TGAATTTGGG GTGGGAGGAT 3120 

GGATGCAACT GAAGCAGAGT GTGGGTGCCC AGATGTGCGC TATTAGATGT TTCTCTGATA 3180 

ATGTCCCCAA TCATACCAGG GAGACTGGCA TTGACGAGAA CTCAGGTGGA GGCTTGAGAA 3240 

GGCCGAAAGG GCCCCTGACC TGCCTGGCTT CCTTAGCTTG CCCCTCAGCT TTGCAAAGAG 3300 

CCACCCTAGG CCCCAGCTGA CCGCATGGGT GTGAGCCAGC TTGAGAACAC TAACTACTCA 3360 
ATAAAAGCGA AGGTGGAAAA AAAAAAAAAA AAAAAAA 

Seq ID NO: 71 Protein sequence: 
Protein Accession #: AAH06529.1 
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1 11 21 

I 1 I 

MKTSPRRPLI LKRRRLPLPV QNAPSETSEE 
GIKIINHPTM PNTQWAIPN NANIHSIITA 
LRPQTQTSYD AKRTEVTLET LQPKPAARDV 
LSNIQWLRKM SSDGLGSRSI KQEMEEKENC 
YMAMIQPAIN STEFUCRMTLK DIYTWIEDHF 
ANGKVSFWTI HPSANRYLTL DQVFKQQKRP 
SYLVPIQFPV NQSLVLQPSV KVPLPLAASL 
AGPGKEEKLL FGEGFSPLLP VQTIKEEEIQ 
EESSHSWEDS SQSPTPRPKK SYSGLRSPTR 
DEPELLFSEG PSTSRWAAEL PFPADSSDPA 
VLPRTPESWR LTPPAKVGGL DFSPVQTPQG 
LLSSEPLDLI SVPFGNSSPS DIDVPKPGSP 
LDISFPGLDE DPLGPDNINW SQFIPELQ 



31 41 51 

I ! I 

EPKRSPAQQE SKQAEASKEV AESNSCKFPA 60 

LTAKGKESGS SGPNKFIIilS CGGAPTQPPG 120 

NLPRPPGALC EQKRETCADG EAAGCTINNS 1B0 

HLEQRQVKVE EPSRPSASWQ NSVSERPPYS 240 

PYFKHIAKPG WKNSIRHNLS LHDMFVRETS 300 

NPELRRNMTI KTELPLGARR KMKPLLPRVS 360 

MSSELARHSK RVRIAPKVLL AEEGIAPLSS 420 

PGEEMPHLAR PIKVESPPLE EWPSPAPSFK 480 

CVSEMLVIQH RERRERSRSR RKQHLLPPCV 540 

SQLSYSQEVG GPFKTPIKET IiPISSTPSKS 600 

ASDPLPDPLG LMDLSTTPLQ SAPPLESPQR 660 

EPQVSGIAAN RSLTEGLVLD TMNDSLSKIL 720 



Seg ID NO: 72 DNA sequence 
Nucleic Acid Accession #: U74612.1 
Coding sequence t 178-2583 

1 11 21 31 41 51 

I I I I 1 I 

GGCACGAGGQ GGACCCGGCC GGTCCGQCGC QAQCCCCCGT CCGGGGCCCT GGCTCQGCCC 60 

CCAGGTTGGA GGAGCCCGGA GCCCGCCTTC GGAGCTACGG CCTAACGGCG GCGGCGACTG 120 

CAGTCTGGAG GGTCCACACT TGTGATTCTC AATGGAGAGT GAAAACGCAG ATTCATAATG 180 

AAAACTAGCC CCCGTCGGCC ACTGATTCTC AAAAGACGGA GGCTGCCCCT TCCTGTTCAA ' 240 

AATGCCCCAA GTGAAACATC AGAGGAGGAA CCTAAGAGAT CCCCTGCCCA ACAGGAGTCT 300 

AATCAAGCAG AGGCCTCCAA GGAAGTGGCA GAGTCCAACT CTTGCAAGTT TCCAGCTGGG 360 

ATCAAGATTA TTAACCACCC CACCATGCCC AACACGCAAG TAGTGGCCAT CCCCAACAAT 420 

GCTAATATTC ACAGCATCAT CACAGCACTG ACTGCCAAGG GAAAAGAGAG TGGCAGTAGT 480 

GGGCCCAACA AATTCATCCT CATCAGCTGT GGGGGAGCCC CAACTCAGCC TCCAGGACTC 540 

CGGCCTCAAA CCCAAACCAG CTATGATGCC AAAAGGACAG AAGTGACCCT GGAGACCTTG 600 

GGACCAAAAC CTGCAGCTAG GGATGTGAAT CTTCCTAGAC CACCTGGAGC CCTTTGCGAG 660 

CAGAAACGGG AGACCTGTGC AGATGGTGAG GCAGCAGGCT GCACTATCAA CAATAGCCTA 720 

TCCAACATCC AGTGGCTTCG AAAGATGAGT TCTGATGGAC TGGGCTCCCG CAGCATCAAG 780 

CAAGAGATGG AGGAAAAGGA GAATTGTCAC CTGGAGCAGC GACAGGTTAA GGTTGAGGAG 840 

CCTTCGAGAC CATCAGCGTC CTGGCAGAAC TCTGTGTCTG AGCGGCCACC CTACTCTTAC 900 

ATGGCCATGA TACAATTCGC CATCAACAGC ACTGAGAGGA AGCGCATGAC TTTGAAAGAC 960 

ATCTATACGT GGATTGAGGA CCACTTTCCC TACTTTAAGC ACATTGCCAA GCCAGGCTGG 1020 

AAGAACTCCA TCCGCCACAA CCTTTCCCTG CACGACATGT TTGTCCGGGA GACGTCTGCC 1080 

AATGGCAAGG TCTCCTTCTG GACCATTCAC CCCAGTGCCA ACCGCTACTT GACATTGGAC 1140 

CAGGTGTTTA AGCCACTGGA CCCAGGGTCT CCACAATTGC CCGAGCACTT GGAATCACAG 1200 

CAGAAACGAC CGAATCCAGA GCTCCGCCGG AACATGACCA TCAAAACCGA ACTCCCCCTG 1260 

GGCGCACGGC GGAAGATGAA GCCACTGCTA CCACGGGTCA GCTCATACCT GGTACCTATC 1320 

CAGTTCCCGG TGAACCAGTC ACTGGTGTTG CAGCCCTCGG TGAAGGTGCC ATTGCCCCTG 1380 

GCGGCTTCCC TCATGAGCTC AGAGCTTGCC CGCCATAGCA AGCGAGTCCG CATTGCCCCC 1440 

AAGGTTTTTG GGGAACAGGT GGTGTTTGGT TACATGAGTA AGTTCTTTAG TGGCGATCTG 1500 

CGAGATTTTG GTACACCCAT CACCAGCTTG TTTAATTTTA TCTTTCTTTG TTTATCAGTG 1560 

CTGCTAGCTG AGGAGGGGAT AGCTCCTCTT TCTTCTGCAG GACCAGGGAA AGAGGAGAAA 1620 

CTCCTGTTTG GAGAAGGGTT TTCTCCTTTG CTTCCAGTTC AGACTATCAA GGAGGAAGAA 1680 

ATCCAGCCTG GGGAGGAAAT GCCACACTTA GCGAGACCCA TCAAAGTGGA GAQCCCTCCC 1740 

TTGGAAGAGT GGCCCTCCCC GGCCCCATCT TTCAAAGAGG AATCATCTCA CTCCTGGGAG 1800 

GATTCGTCCC AATCTCCCAC CCCAAGACCC AAGAAGTCCT ACAGTGGGCT TAGGTCCCCA 1860 

ACCCGGTGTG TCTCGGAAAT GCTTGTGATT CAACACAGGG AGAGGAGGGA GAGGAGCCGG 1920 

TCTCGGAGGA AACAGCATCT ACTGCCTCCC TGTGTGGATG AGCCGGAGCT GCTCTTCTCA 1980 

GAGGGGCCCA GTACTTCCCG CTGGGCCGCA GAGCTCCCGT TCCCAGCAGA CTCCTCTGAC 2040 

CCTGCCTCCC AGCTCAGCTA CTCCCAGGAA GTGGGAGGAC CTTTTAAGAC ACCCATTAAG 2100 

GAAACGCTGC CCATCTCCTC CACCCCGAGC AAATCTGTCC TCCCCAGAAC CCCTGAATCC 2160 

TGGAGGCTCA CGCCCCCAGC CAAAGTAGGG GGACTGGATT TCAGCCCAGT ACAAACCTCC 2220 

CAGGGTGCCT CTGACCCCTT GCCTGACCCC CTGGGGCTGA TGGATCTCAG CACCACTCCC 2280 

TTGCAAAGTG CTCCCCCCCT TGAATCACCG CAAAGGCTCC TCAGTTCAGA ACCCTTAGAC 2340 

CTCATCTCCG TCCCCTTTGG CAACTCTTCT CCCTCAGATA TAGACGTCCC CAAGCCAGGC 2400 

TCCCCGGAGC CACAGGTTTC TGGCCTTGCA GCCAATCGTT CTCTGACAGA AGGCCTGGTC 2460 

CTGGACACAA TGAATGACAG CCTCAGCAAG ATCCTGCTGG ACATCAGCTT TCCTGGCCTG 2520 

GACGAGGACC CACTGGGCCC TGACAACATC AACTGGTCCC AGTTTATTCC TGAGCTACAG 2580 

TAGAGCCCTG CCCTTGCCCC TGTGCTCAAG CTGTCCACCA TCCCGGGCAC TCCAAGGCTC 2640 

AGTGCACCCC AAGCCTCTGA GTGAGGACAG CAGGCAGGGA CTGTTCTGCT CCTCATAGCT 2700 

CCCTGCTGCC TGATTATGCA AAAGTAGCAG TCACACCCTA GCCACTGCTG GGACCTTGTG 2760 

TTCCCCAAGA GTATCTGATT CCTCTGCTGT CCCTGCCAGG AGCTGAAGGG TGGGAACAAC 2B20 

AAAGGCAATG GTGAAAAGAG ATTAGGAACC CCCCAGCCTG TTTCCATTCT CTGCCCAGCA 2880 

GTCTCTTACC TTCCCTGATC TTTG CAGGGT GGTCCGTGTA AATAGTATAA ATTCTCCAAA 2 940 

TTATCCTCTA ATTATAAATG TAAGCTTATT TCCTTAGATC ATTATCCAGA GACTGCCAGA 3000 

AGGTGGGTAG GATGACCTGG GGTTTCAATT GACTTCTGTT CCTTGCTTTT AGTTTTGATA 3060 

GAAGGGAAGA CCTGCAGTGC ACGGTTTCTT CCAGGCTGAG GTACCTGGAT CTTGGGTTCT 3120 

TCACTGCAGG GACCCAGACA AGTGGATCTG CTTGCCAGAG TCCTTTTTGC CCCTCCCTGC 3180 

CACCTCCCCG TGTTTCCAAG TCAGCTTTCC TGCAAGAAGA AATCCTGGTT AAAAAAGTCT 3240 

TTTGTATTGG GTCAGGAGTT GAATTTGGGG TGGGAGGATG GATGCAACTG AAGCAGAGTG 3300 

TGGGTGCCCA GATGTGCGCT ATTAGATGTT TCTCTGATAA TGTCCCCAAT CATACCAGGG 3360 

AGACTGGCAr TGACGAGAAC TCAGGTGGAG GCTTGAGAAG GCCGAAAGGG CCCCTGACCT 3420 

GCCTGGCTTC CTTAGCTTGC CCCTCAGCTT TGCAAAGAGC CACCCTAGGC CCCAGCTGAC 3480 

CGCATGGGTG TGAGCCAGCT TGAGAACACT AACTACTCAA TAAAAGCGAA GGTGGACAAA 3540 
AAAAAAAAAA AAAAA 

Seq ID NO i 73 Protein sequence: 
Protein Accession #: AAC51128.1 
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1 11 21 31 41 51 

I I I I I I 

MKTSPRRPLI LKRRRLPLPV QNAPSETSEE EPKRSPAQQE SNQAEASKEV AESNSCKFPA 60 

5 GIKIINHPTM PNTQWAIPN NANIHSIITA LTAKGKESGS SGPNKPILIS CGGAPTQPPG 120 

LRPQTQTSYD AKRTEVTLET LGPKPAARDV NLPRPPGALC EQKRETCADG EAAGCT INNS 180 

LSNIQWLRKM SSDGLGSRSI KQEMEEKENC HLEQRQVKVE EPSRPSASWQ NSVSERPPYS 240 

YMAMIQPAIN STERKRMTLK DIYTWIEDHF PYFKKIAKPG WKNSIRHNLS LHDMFVRETS 300 

ANGKVSFWTI HPSANRYLTL DQVFKPLDPG SPQLPEHLES QQKRPNPELR RNMTIKTELP 360 

10 LGARRKMKPL LPRVSSYLVP IQFPVNQSLV LQPSVKVPLP LAASLMSSEL ARHSKRVRIA 420 

PKVPGEQWP GYMSKFFSGD LRDFGTPITS LFNFIFLCLS VLLAEEGIAP LSSAGPGKEE 480 

KLLFGEGFSP LLPVQTIKEE EIQPGEEMPH LARPIKVESP PLEEWPSPAP SFKEESSHSW 540 

EDSSQSPTPR PKKSYSGLRS PTRCVSEMLV IQKRERRERS RSRRKQHLLP PCVDEPELLF 600 

SEGPSTSRWA AELPFPADSS DPASQLSYSQ EVGGPFKTPI KETLPISSTP SKSVLPRTPE 660 

15 SWRLTPPAKV GGLDFSPVQT SQGASDPLPD PLGLMDLSTT PLQSAPPLES PQRLLSSEPL 720 

DLISVPFGNS SPSDIDVPKP GSPEPQVSGL AANRSLTEGL VLDTMNDSLS KILLDISFPG 780 
LDEDPLGPDN INVJSQFIPEL Q 

Seq ID NO i 74 DNA sequence 
20 Nucleic Acid Accession tti Eos sequence 
Coding sequence: 111-416 

1 11 21 31 41 51 

oc I 1 1 I I ' 

Zj GGGAAGAGCC AGGCTGAGCC TTATAAAGGA CTGCTCTTTG TCCAAACACA CACATCTCAC 60 

TCATCCTTCT ACTCGTGACG CTTCCCAGCT CTGGCTTTTT GAAAGCAAAG ATGAGCAACA 120 

CTCAAGCTGA GAGGTCCATA ATAGGCATGA TCGACATGTT TCACAAATAC ACCAGACGTG 180 

ATGACAAGAT TGAGAAGCCA AGCCTGCTGA CGATGATGAA GGAGAACTTC CCCAACTTCC 240 

TTAGTGCCTG TGACAAAAAG GGCACAAATT ACCTCGCCGA TGTCTTTGAG AAAAAGGACA 300 

30 AGAATGAGGA TAAGAAGATT GATTTTTCTG AGTTTCTGTC CTTGCTGGGA GACATAGCCA 360 

CAGACTACCA CAAGCAGAGC CATGGAGCAG CGCCCTGTTC CGGGGGCAGC CAGTGACCCA 420 
GCCCCACCAA TGGGCCTCCA GAGACCCCAG GAACAATAAA ATGTCTTCTC CCACCAGA 

Seq ID NO: 75 Protein sequence: 
35 Protein Accession #t Eos sequence 

1 11 21 31 41 51 

I | I I I I 

MSNTQAERSI IGMIDMFHKY TRRDDKIEKP SLLTMMKENF PNFLSACDKK GTNYIiADVFE 60 
40 KKDKNEDKKI DPSEFLSLLG DIATDYHKQS HGAAPCSGGS Q 

Seq ID NO: 76 DNA sequence 
Nucleic Acid Accession #: Eos sequence 
45 Coding sequence: 111-416 

1 11 21 31 41 51 

I I II I I 

GGGAAGAGCC AGGCTGAGCC TTATAAAGGA CTGCTCTTTG TCCAAACACA CACATCTCAC 60 

50 TCATCCTTCT ACTCGTGACA CTTCCCAGTT CTGGCTTTTT GAAAGCAAAG ATGAGCAACA 120 

CTCAAGCTGA GAGGTCCATA ATAGGCATGA TCGACATGTT TCACAAATAC ACCGGACGTG 180 

ATGGCAAGAT TGAGAAGCCA AGCCTGCTGA CGATGATGAA GGAGAACTTC CCCAATTTCC 240 

TCAGTGCCTG TGACAAAAAG GGCATACATT ACCTCGCCAC TGTCTTTGAG AAAAAGGACA 300 

AGAATGAGGA TAAGAAGATT GATTTTTCTG AGTTTCTGTC CTTGCTGGGA GACATAGCCG 3 60 

55 CAGACTACCA CAAGCAGAGC CATGGAGCGG CGCCCTGTTC TGGGGGAAGC CAGTGATCCA 420 
GCCCCACCAA GGGGCCTCCA GAGACCCCAG GAACAATAAG TGTCTCCTCC CACCAGA 



60 
65 
70 



Seq ID NO: 77 Protein sequence: 
Protein Accession #j XP_048124.l 

1 11 21 31 41 51 

I I I I 1 I 

MSNTQAERSI IGMIDMFHKY TGRDGKIEKP SLLTMMKENF PNFLSACDKK GIHYLATVFE 60 
KKDKNEDKKI DFSEFLSLLG DIAADYHKQS HGAAPCSGGS Q 

Seq ID NO: 78 DNA sequence 
Nucleic Acid Accession #: Z73678.1 
Coding sequence: 253-2433 



1 11 21 31 41 51 

I I I I I I 

GGGGTGGTGC AGGGCAGGGG TGGTATATCC TGTCTGACGG AGGGCGGGCC TCGCCAGTGC 60 

CAGAGAGGGA CGAACCAGGG TGGAAGCGCC AGGAGCAGCT GCAGGGAGCC CTCACGCGGA 120 

75 CCTCGCACTC TATGGCCGTA GGGAGCCGCT GAGAGCGAGA AGAGCACGCT CCTGCCCGCC 180 

CGCTGCACCG CACCTCGCCT CGCCTCTCTG CTCTCCTAGG CCCCGGCCGC GCGCCACCCG 240 

CCTCCCGCCA CCATGAACCA CTCGCCGCTC AAGACCGCCT TGGCGTACGA ATGCTTCCAG 300 

GACCAGGACA ACTCCACGTT GGCTTTGCCG TCGGACCAAA AGATGAAAAC AGGCACGTCT 360 

GGCAGGCAGC GCGTGCAGGA GCAGGTGATG ATGACCGTCA AGCGGCAGAA GTCCAAGTCT 420 

80 TCCCAGTCGT CCACCCTGAG CCACTCCAAT CGAGGTTCCA TGTATGATGG CTTGGCTGAC 480 

AATTACAACT ATGGGACCAC CAGCAGGAGC AGCTACTACT CCAAGTTCCA GGCAGGGAAT 540 

GGCTCATGGG GATATCCGAT CTACAATGGA ACCCTCAAGC GGGAGCCTGA CAACAGGCGC 600 

TTCAGCTCCT ACAGCCAGAT GGAGAACTGG AGCCGGCACT ACCCCCGGGG CAGCTGTAAC 660 

ACCACCGGCG CAGGCAGCGA CATCTGCTTC ATGCAGAAAA TCAAGGCGAG CCGCAGTGAG 720 

85 CCCGACCTCT ACTGTGACCC ACGGGGCACC CTGCGCAAGG GCACGCTGGG CAGCAAGGGC 780 

CAGAAGACCA CCCAGAACCG CTACAGCTTT TACAGCACCT GCAGTGGTCA GAAGGCCATA 840 

AAGAAGTGCC CTGTGCGCCC GCCCTCTTGT GCCTCCAAGC AGGACCCTGT GTATATCCCG 900 
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5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



CCCATCTCCT 
AGTGAGGACA 
CAGGATGAGA 
GAATCTGCCA 
CGCAGCCCCA 
AGGAGCACCA 
CTCCTGAGGA 
CTGTCTTCCA 
GACCGCGTCA 
GTGGTGGACC 
GATGCAGGCC 
GTCCAGAACT 
GTTCTGCACA 
GAGTATAACG 
AGCGACAAGA 
AAGGGCAGCG 
AAGAGCAAGA 
AGCAAGGGGC 
CCACAAATTG 
CTCCTGAGCA 
CCGGAGGTGA 
TTGTCCTCGG 
CAGTACTTCT 
CCCAAGGCCG 
CAGGGTGTCC 
AACAGCCTCA 
TGCAGGAAGA 
CATCCTGTGC 
ATAGTGGAAA 
AGGGAGGTTG 
TTTCTTCCTC 
GCGTGCATGT 
TAAGCTATTT 
TAGGCAAGGA 
CAGGGCAAGG 
AGTGGTCTCC 
GCTTTCCACG 
ACCCCTCCAG 
AGGACACTCT 
GGTGCCTTCC 
TGCATTCAGA 
TTAAGAAAAG 
TCATATTCCC 
GTGGGATGGA 
AGTCAATTGA 
AGGAAAGGAA 
CCATGCCCCA 
ATGTGATAAA 
TCAGACCAGC 
CCTTGGTGTA 
TCTACCTCCA 
TCTCTAGAAG 
TCCTCTTCTG 
AACACTGATG 
TGTCCTCCGA 
ATACCCTCCC 
AGGGACCCAC 
CTTTGTTGTC 
TCTGTGCGGG 
CCCGTCCTCA 
GGGCTCCCCC 
CATGTGAACC 
GCATCCCCTC 
CCCTGCCCTG 
AGGGCTGACT 
CTGACCCAGG 
AGAACAAGGA 
ATTGGCCTCA 
TGAGGAAGGA 
GTCATTGGCC 
CCTCCTGGGG 
ACTGGCTGGG 
TGTGAGGGAT 
AACCTTGTGC 
GGAAATAGAG 



GCAACAAGGA 
TCGAGTGCAG 
AGTACCAGGC 
AGCAACAGGT 
ACCAGAACGT 
CCAACAAGCT 
GAACCGGGAA 
CTGACGAGCT 
TCATTCCCTT 
CTGAGGTCTT 
GCCAGACCAT 
GTGTAGCGGC 
ACCTCTCCTA 
CCCGCAACGC 
TGATGAACAA 
GCTGGTTGTA 
AAGATGCTAC 
TGATGTCCAG 
CCOGCCTCCT 
ACATGTCCCG 
CCAGGCTCCT 
CCTGCTACAC 
CCAGCAGCAT 
CAGAAGCTGC 
TCAGACAGCA 
GGAACTTCAC 
TATGACCCAG 
AGTATTTGGG 
GATTTTTAGA 
GGGGGGGGGG 
TGAGAAATGG 
GTGTGTGTGT 
TGTTGCAGCT 
CAACATGTGC 
GGQCCATCAC 
AAGGGTGCAG 
AAGGGAGGTC 
CAGCGCCACA 
CGCATACTTT 
ACCCTCT TGG 
GGTCTTGTAA 
CTATTTTATT 
AGTGGAGAGG 
GTTGGGGTAT 
GGGAGTGTTT 
AGTGCCATAT 
CTTCCCCTGA 
CAGGGCTATT 
AGCAGCAGGA 
CACTAAGACA 
TGCTAGCATT 
AATATAAGAT 
CACCACCCCC 
TGGACTCAGT 
GAGGTGTCCC 
TTGCTTCTGT 
GTGGGAGCCT 
ATCAGAACCC 
GCAGTGTCCT 
TGCGGCTTAT 
GGCTGGGGGC 
TTACCAGCCC 
GCCTCATGTT 
GGAACCAGGC 
TTGGTGACAC 
CCTGTCACTT 
GCTTGCCAGG 
CACTAGCCTA 
CACAGACTCT 
ACCAGCCACC 
ACCCAAGAGG 
AGGACCAAGA 
CTTGQAGCTT 
OGGCCAGGCC 
AAAATCAATA 



CCTGTCCTTT 
TGGGCTGACC 
CATTGGGGCC 
CTATCAGCTG 
CCAGCAGGCC 
GGAGACCCGG 
CGCCGAGATC 
GAAGGAGGAA 
CTCTGGCTGG 
CTTCAATGCC 
GCGTAACTAC 
CAGCCGCTGT 
CGGCCTGGAC 
CTACACCGAG 
CAACTATGAC 
CCATT CAGAT 
CCTGGAGGCC 
TGG CAT GAG C 
GCAATCTGGC 
CCACCCTCTG 
CACCAGCCAC 
TGTGAGGAAC 
GCTCAACAAC 
CCGGCTTCTC 
AGGTTTCGAT 
CTCCCGATTC 
CTGAGAAGCC 
AAAGTTCACA 
TTTTTTTTTT 
GGCTTTCTTG 
TATATATATG 
GAGTGTCTTA 
CATAAGGTGG 
TTTTTGGTGA 
TGCAGTCAGG 
GAGTAACTGG 
TGGTGTATCT 
AGGACTGAGG 
GCCAAATGAG 
GCTGTGTGCC 
TCTACTTGTT 
GCAGCTCTTT 
GGAACAAGTG 
AGAAATTAAC 
GGGTCCCAGG 
CAGGGTACCG 
CCCCAGCTGT 
AGGGGTATCA 
GGGCTCCCGA 
GAGCAGAGCC 
GCTGGTGTTA 
GCTCCTCCTC 
GAGTCCAAAC 
ATGACAACTG 
CGGCTGTTAG 
TCACACTGGG 
GGATCCCTGG 
AGAGGAATTC 
AAGCACTTAG 
GTTTTCTGGA 
TGTGCAGTCC 
AGGCGGATGC 
CCTCCCACCT 
AGCCTTCCAG 
TGCCCATTCC 
TGAGAGGGGC 
CTTCATGTAG 
GGCCAATGCC 
GCCCTGGGAT 
TCTGCAGTGG 
CAGTGTTGCT 
CTGCGGCTGG 
CCCTGTAGCC 
CAGTTTOCTT 
AATTGCTAGT 



GGCCACTCTA 
ATCCCCAAGG 
TATTACATCC 
GGAGGCATCT 
GCGGCAGGGG 
AGGCAGAATG 
CAGAAGCAGC 
CTCATTGCCG 
TGCGATGGCA 
ACAGGCTGCT 
TCAGGGCTCA 
GACGACAAGT 
GCCGAGGTGC 
AAGTCCTCCA 
TGCCCCCTGC 
GCCATCCGCA 
TGTGCTGGTG 
CAGTTGATTG 
AACTCTGATG 
CTGCACAGAG 
ACTGGCAATA 
CTGATGGCCT 
ATCATCAACC 
CTGTCTGACA 
AGGAACATGC 
TAAGAAGAGA 
CTCAGGCCTC 
AGAAACTGAG 
CCTTGGGGAA 
AGTTAAAGGG 
TGTCTAATGT 
AAGCATAACC 
TGAAAAGGAC 
GCTGCTCATA 
CCCTCAGAGG 
GGCTGGGCCA 
CATGGGAGAA 
TTGGGTAGGT 
GCCTGCTCAG 
CATCAGAGCA 
G CAGG AG AAA 
CCCAAGAGCT 
GGGCTGGGCA 
CAGGAAGATG 
AGACTTGGAC 
GTACCGGCAA 
CTTGTCTCCA 
GCCACGTCGA 
GGGCCTTATG 
CAGCGCTCCC 
GAGAGGAATT 
CTCACCCCTT 
CCACCTCTTG 
AGATGGGGGA 
CCAGCTGTGC 
AGGCCCACTC 
ACTGTCCTGG 
TTCTCCTAAA 
ACTACATCAG 
GGAAAGTGGA 
GGTCAGGGCG 

TCAAAGAATG 
ACCTCAGGGG 
CTCTCAGGCC 
AAAACTGAGA 
CCGACACACG 
CAAAATAAGG 
CTCCTGTGCT 
GGACCACACT 
GTCTGCGTGT 
GGTGGGCAGG 
CACCTTCCCC 
GTGTGATACA 
GTTTCTTTGA 



GGGCCAGCTC 
CTGTGCAGTA 
AGCATACCTG 
GCAAGCTGGT 
CCCTGCGCAA 
GGATCCGCGA 
TGACTGGGCT 
ACGCCCTGCC 
ATAGCAACAT 
TGAGGAACCT 
TTGATTCCCT 
CTGTGGAAAA 
CCACCCGCTA 
CTGGCTGCTT 
CTGAGGAAGA 
CCTACCTGAA 
CCCTGCAGAA 
GGCTGAAGGA 
TGGTGCGGTC 
TGATGGGGAA 
CCAGCAACTC 
CGCAGCCACA 
TGTGCCGAAG 
TGTGGTCCAG 
TGGGAACCTT 
CTGTCCAAGC 
GCTGGATGGG 
AAGAAACCTA 
ACTGGCAGGC 
GCTTATATGT 
AAGTGTGTGC 
ACAAACTGCA 

ATTCCTGAAA 
AGTCCTGCAG 
GCCTCCCCCC 
TCTGGGGTGT 
GTGAGGTTCC 
AGGAGTAGGA 
GGCTCAGCCT 
GAAGGTAAAA 
GTTCTGGGAA 
TATACCTATT 
TTTCCACCAA 
GGGGGGAGTT 
GCTCACATCT 
CTCTGTGAAA 
GCCCCCAGAC 
AGAAAACCTG 
AAGCCTTCCT 
AACTT CCTGG 
CTCAGCCTCC 
CCCCAGCATT 
AGCCAGACAT 
TGTGGTGCTG 
CTGGCTCACC 
GCATAGGTTT 
AAATACGTAT 
GGAAGAACAC 
GACACAAGTC 
GGAGGGGAAA 
TAGCACTACC 
AAGAGCCCCA 
CTGAGGCAGA 
AGCTCAGGTC 
GGGGCTTTTC 
TCTCAGGATT 
AGTTCCAATT 
AGCGGCCAAT 
AGCAGCCCTG 
CCACCTTGGA 
GAAGGGAAGC 
TTGCTTCATG 
CTAATGTATT 
AAAAAAAAA 



CAAGATCTGC 
CCTGAGCTCC 
CTTCCAGGAT 
GGACCTCCTC 
CCTGGTGTTC 
GGCAGTCAGC 
GCTCTGGAAC 
TGTTCTGGCC 
GTCCCGGGAA 
GAGCTOGGCC 
CATGGCCTAT 
CTGCATGTGT 
CCGCCAGCTG 
CAGCAACAAG 
GACCAACCCC 
CCTCATGGGC 
CCTGACAGCC 
AAAGGGCCTG 
CGGAGCCTCC 
CCAGGTGTTC 
CGAAGACATC 
ACTGGCCAAG 
CAGTGCCTCA 
CAAGGAACTG 
AGCTGGGGCC 
AAGTTAGGCT 
GTTTTCTGTC 
AAAACTGTGG 
AATGGGGGTT 
GATGTCAATA 
ATGCATGTGC 
AAAAGCTAGG 
TTCTTACTCA 
TGTGTGGTGC 
GCTTCCTACC 
TTACAAGGCT 
CTGTAGTGTC 
AGAGGACAGC 
GCTGAAAGAT 
GCAAAGGCCC 
AATGATTTTT 
TGGCTGGTCT 
CCGGCTTCTA 
GCCTGCTGTG 
TGGGTAGACT 
CAGCCAGGGG 
CCCACAGGGG 
TCTGTGCACT 
TGTGGACATC 
CCTTCCAGCT 
TCTGTGCCCT 
TCCCAAGTCT 
CAGGCTGGAA 
GTGAGGACGC 
TGGGTCTGTC 



CAGGGGCCTC 
GGCATACCAA 
AGACCACATC 
CTTGGCTTTA 
TGCACCGCTG 
CTGGCCTCCT 
TGGGCCCAGC 
CTATTAGGGC 
ACCCGGGCCT 
CTAGAGAAAG 
TTAAGT CCAC 
TGGGGCCAAA 
GACAAATCCA 
ACTCCACACT 
ATCTGGCTGA 
CGGGGGCTGC 
TTTGTAGAGG 
TGCTTTTTTT 



960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
43 80 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 



Geq ID NO: 79 Protein sequence: 
Protein Accession #: CAA98022.1 



1 
I 

MNHSPLKTAL 
TLSHStTRGSM 
SQMENWSRHY 
QNRYS FYSTC 
ECSGLTIPKA 
QNVQQAAAGA 



11 

1 

AYECFQDQDN 
YDGLADNYNY 
PRGSCNTTGA 
SGQKAIKKCP 
VQYLSSQDEK 
LRNLVFRSTT 



21 
I 

STLALPSDQK 
GTTSRSSYYS 
GSDICFMQKI 
VRPPSCASKQ 
YQAIGAYYIQ 
NKLETRRQNG 



31 
I 

MKTGTSGRQR 
KFQAGNGSWG 
KASRSEPDLY 
DPVY1PPISC 
HTCFQDESAK 
IREAVSLLRR 



41 
I 

VQEQVMMTVK 
YPIYNGTLKR 
CDPRGTLRKG 
NKDLSFGHSR 
QQVYQLGGIC 
TGNAEIQKJQL 



51 

I 

RQKSKSSQSS 
EPDNRRFSSY 
TLGSKGQKTT 
ASSKICSEDI 
KLVDLLRSPN 
TGLLWNLSST 



60 
120 
180 
240 
300 
360 
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DELKEELIAD ALPVIADRVI IPFSGWCDGN SNMSREWDP EVFFNATGCL RNLSSADAGR 420 

QTMRNYSGLI DSLMAYVQNC VAASRCDDKS VENCMCVLHN L.SYRLDAEVP TRYRQLEYNA 480 

RNAYTEKSST GCFSNK9DKM MNNKYDCPLP EEETNPKGSG WLYHSDAIRT YLNLMGKSKK 540 

DATLEACAGA LQNLTASKGL MSSGMSQLIG LKEKGLPQIA RI*LQSGNSDV VRSGASLLSN 600 

MS RH PLLHRV MGNQVFPEVT RLLTSHTGNT SNSEDILSSA CYTVRNLMAS QPQLAKQYFS 660 

SSMLNNIINL CRSSASPKAA EAARLLLSDM WSSKELQGVL RQQGFDRNML GTLAGANSLR 720 
NFTSRP 

Seq ID NOt 60 DNA sequence 

Nucleic Acid Accession #: NM_006S16.1 

Coding sequence: 180-1658 

1 11 21 31 41 51 

I 1 I I I 1 

TAGTCGCGGG TCCCCGAGTG AGCACGCCAG GGAGCAGGAG ACCAAACGAC GGGGGTCGGA 60 

GTCAGAGTCG CAGTGGGAGT CCCCGGACCG GAGCACGAGC CTGAGCGGGA GAGCGCCGCT 120 

CGCACGCCCG TCGCCACCCG CGTACCCGGC GCAGCCAGAG CCACCAGCGC AGCGCTGCCA 1B0 

TGGAGCCCAG CAGCAAGAAG CTGACGGGTC GCCTCATGCT GGCTGTGGGA GGAGCAGTGC 240 

TTGGCTCCCT GCAGTTTGGC TACAACACTG GAGTCATCAA TGCCCCCCAG AAGGTGATCG 300 

AGGAGTTCTA CAACCAGACA TGGGTCCACC GCTATGGGGA GAGCATCCTG CCCACCACGC 360 

TCACCACGCT CTGGTCCCTC TCAGTGGCCA TCTTTTCTGT TGGGGGCATG ATTGGCTCCT 420 

TCTCTGTGGG CCTTTTCGTT AACCGCTTTG GCCGGCGGAA TTCAATGCTG ATGATGAACC 480 

TGCTGGCCTT CGTGTCCGCC GTGCTCATGG GCTTCTCGAA ACTGGGCAAG TCCTTTGAGA 540 

TGCTGATCCT GGGCCGCTTC ATCATCGQTG TGTACTGCGG CCTGACCACA GGCTTCGTGC 600 

CCATGTATGT GGGTGAAGTG TCACCCACAG CCTTTCGTGG GGCCCTGGGC ACCCTGCACC 660 

AGCTGGGCAT CGTCGTOGGC ATCCTCATCG CCCAGGTGTT CGGCCTGGAC TCCATCATGG 720 

GCAACAAGGA CCTGTGGCCC CTGCTGCTGA GCATCATCTT CATCCCGGCC CTGCTGCAGT 780 

GCATCGTGCT GCCCTTCTGC CCCGAGAGTC CCCGCTTCCT GCTCATCAAC CGCAACGAGG 840 

AGAACCGGGC CAAGAGTGTG CTAAAGAAGC TGCGCGGGAC AGCTGACGTG ACCCATGACC 900 

TGCAGGAGAT GAAGGAAGAG AGTCGGCAGA TGATGCGGGA GAAGAAGGTC ACCATCCTGG 960 

AGCTGTTCCG CTCCCCOGCC TACCGCCAGC CCATCCTCAT CGCTGTGGTG CTGCAGCTGT 1020 

CCCAGCAGCT GTCTGGCATC AACGCTGTCT TCTATTACTC CACGAGCATC TTCGAGAAGG 1080 

CGGGGGTGCA GCAGCCTGTG TATGCCACCA TTGGCTCCGG TATCGTCAAC ACGGCCTTCA 1140 

CTGTCGTGTC GCTGTTTGTG GTGGAGCGAG CAGGCCGGCG GACCCTGCAC CTCATAGGCC 1200 

TCGCTGGCAT GGCGGGTTGT GCCATACTCA TGACCAT CGC GCTAGCACTG CTGGAGCAGC 1260 

TACCCTGGAT GTCCTATCTG AGCATCGTGG CCATCTTTGG CTTTGTGGCC TTCTTTGAAG 1320 

TGGGTCCTGG CCCCATCCCA TGGTTCATCG TGGCTGAACT CTTCAGCCAG GGTCCACGTC 1380 

CAGCTGCCAT TGCCGTTGCA GGCTTCTCCA ACTGGACCTC AAATTTCATT GTGGGCATGT 1440 

GCTTCCAGTA TGTGGAGCAA CTGTGTGGTC CCTAOGTCTT CATCATCTTC ACTGTGCTCC 1500 

TGGTTCTGTT CTTCATCTTC ACCTACTTCA AAGTTCCTGA GACTAAAGGC CGGACCTTCG 1560 

ATGAGATCGC TTCCGGCTTC CGGCAGGGGG GAGCCAGCCA AAGTGATAAG ACACCCGAGG 1620 

AGCTGTTCCA TCCCCTGGGG GCTGATTCCC AAGTGTGAGT CGCCCCAGAT CACCAGCCCG 1680 

GCCTGCTCCC AGCAGCC CT A AGGATCTCTC AGGAGCACAG GCAGCTGGAT GAGACTTCCA 1740 

AACCTGACAG ATGTCAGCCG AGCCGGGCCT GGGGCTCCTT TCTCCAGCCA GCAATGATGT 1800 

CCAGAAGAAT' ATTCAGGACT TAACGGCTCC AGGATTTTAA CAAAAGCAAG ACTGTTGCTC 18 60 

AAATCTATTC AGACAAGCAA CAGGTTTTAT AATTTTTTTA TTACTGATTT TGTTATTTTT 1920 

ATATCAGCCT GAGTCTCCTG TGCCCACATC CCAGGCTTCA CCCTGAATGG TTCCATGCCT 1980 

GAGGGTGGAG ACTAAGCCCT GTCGAGACAC TTGCCTTCTT CACCCAGCTA ATCTGTAGGG 2040 

CTGGACCTAT GTCCTAAGGA CACACTAATC GAACTATGAA CTACAAAGCT TCTATCCCAG 2100 

GAGGTGGCTA TGGCCACCCG TTCTGCTGGC CTGGATCTCC CCACTCTAGG GGTCAGGCTC 2160 

CATTAGGATT TGCCCCTTCC CATCTCTTCC TACCCAACCA CTCAAATTAA TCTTTCTTTA 2220 

CCTGAGACCA GTTGGGAGCA CTGGAGTGCA GGGAGGAGAG GGGAAGGGCC AGTCTGGGCT 2280 

GCCGGGTTCT AGTCTCCTTT GCACTGAGGG CCACACTATT ACCATGAGAA GAGGGCCTGT 2340 

GGGAGCCTGC AAACTCACTG CTCAAGAAGA CATGGAGACT CCTGCCCTGT TGTGTATAGA 2400 

TGCAAGATAT TTATATATAT TTTTGGTTGT CAATATTAAA TACAGACACT AAGTTATAGT 2460 

ATATCTGGAC AAGCCAACTT GTAAATACAC CACCTCACTC CTGTTACTTA CCTAAACAGA 2520 

TATAAATGGC TGGTTTTTAG AAACATGGTT TTGAAATGCT TGTGGATTGA GGGTAGGAGG 2580 

TTTGGATGGG AGTGAGACAG AAGTAAGTGG GGTTGCAACC ACTGCAACGG CTTAGACTTC 2640 

GACTCAGGAT CCAGTCCCTT ACACGTACCT CTCATCAGTG TCCTCTTGCT CAAAAATCTG 2700 

TTTGATCCCT GTTACCCAGA GAATATATAC ATT CTTTATC TTGACATTCA AGGCATTTCT 2760 

ATCACATATT TGATAGTTGG TGTTCAAAAA AACACTAGTT TTGTGCCAGC CGTGATGCTC 2820 
AGGCTTGAAA TCGCATTATT TTGAATGTGA AGGGAA 

Seq ID NO: 81 Protein sequence t 
Protein Accession NP_006507.1 

1 11 21 31 41 51 

llllll 

MEPSSKKLTG RLMLAVGGAV LGSLQFGYNT GVINAPQKVI EEFYNQTWVH RYGESILPTT 60 

LTTLWSLSVA IFSVGGMIGS FSVGLPWRP GRRNSMLMMN LLAFVSAVLM GPSKLGKSFE 120 

MLILGRFIIG VYCGLTTGFV PMYVGEVSPT AFRGALGTLH QLGIWGIIiI AQVFGLDS1M 180 

GNKDLWPLLL SIIPIPALLQ CIVLPFCPES PRFLL1NRNE ENRAKSVLKK LRGTADVTHD 240 

LQEMKEESRQ MMREKKVTIL ELFRSPAYRQ PILIAWLQIi BQQLSGINAV FYYSTSIFEK 300 

AGVQQPVYAT IGSGIVNTAF TWSLFWER AGRRTLHLIG LAGMAGCAIL MTIALALLEQ 360 

I/PWMSYLSIV AIFGFVAFFE VGPGPIPWFI VAELFSQGPR PAAIAVAGFS NWTSNFIVGM 420 

CFQYVEQLCG PYVPIIFTVL IiVLFFIFTYF KVPETKGRTF DEIASGFRQG GASQSDKTPE 480 
ELPHPLGADS QV 

Seq ID NO> 82 DNA sequence 
Nucleic Acid Accession tit BC001291 
Coding sequence: 44-541 

1 11 21 31 41 51 

llllll 

GGGGGCGCCG CGCGCTGACC CTCCCTGGGC ACCOCTGGGO ACQATGGCGC TGCTCGCCTT 60 

GCTGCTGGTC GTGGCCCTAC CGCGGGTGTG GACAGACGCC AACCTGACTG CGAGACAACG 120 



219 



WO 02/086443 



PCT/US02/12476 



5 
10 
15 

20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



AGATCCAGAG 
TGAGAGAGAA 
CTGCGTTATA 
CGCTGGTTGT 
GCCCATGCCC 
ACCTATCAAC 
GCTGTGGCTG 
AGCCACGGGA 
ACCTGTTGCA 
GGGATGGGAG 
ACATTCAGAG 
AAATCAAACC 
CCTCTGAGGG 
TGCTGAGATG 
GGGTGAAGAC 
AGGQCTGCCC 
CTACCAGATT 
ACCAGCTGGC 
ACTTAGGCCA 
CATCCATGGG 
TTCAAAAGTT 



GACTCCCAGC 
AACACTTTCG 
GCGGCCGTGA 
GCAGCGATGG 
TTCTTTTACC 
TCATCAGTGT 
GCCATCCTCC 
CTGCCACAGA 
TTAAACTTGT 
AGTGGGGATC 
GAAGTCCAGA 
TTGTAACTCA 
CTTCAGTATT 
CTTCCGACCT 
ATCCCTGGAG 
CCATTCCAGT 
CCAGGAGGCA 
ACAGGTGCAC 
AGTAGAGAGC 
GAGCTGAGAA 
CACGAAAAAA 



GAACGGACGA 
AGTGCCAGAA 
AAATATTTCC 
AGAGACCCAA 
TCAAGTGTTG 
TCAAAGAATA 
TGCTGCTGGC 
CTGAGCCTTC 
TTTCTGTTGA 
AGGTGCAGTT 
TCTCCTGAGT 
TTTATTGCTG 
GATGGGGAGG 
TTCAGGTGAC 
TGAAGGACTC 
GGTGGAGGCG 
GAAGATAACT 
AGATT CAT AA 
ATCAGGGTAA 
ATCAGACTCA 
AAAAAAAAAA 



GGGTGACAAT 
CCCAAGGAGG 
ACGTTTTTTC 
GCCAGAGGAG 
TAAAATTCGC 
TGCTGGGAGC 
CTCCATTGCA 
CGGAGCATGG 
TTACCTCTTG 
GGCTCTTAAC 
AGTGATTTTG 
ATGGCCACTC 
GAGGCCTAAG 
GCAGGAACAC 
CTCAGCATGG 
CTOTGGATGQ 
AATTGTGTTG 
ATTCCCACAC 
ATGGCGTTCA 
AAGTTCCACC 
AAAAAAAAAA 



AGAGTGTGGT 
TGCAAATGGA 
ATGGTTGCGA 
AAGCGGTTTC 
TACTGCAATT 
ATGGGTGAGA 
GCCGGCCTCA 
ACTCGCTCCA 
GTTTGACTTC 
CCTCAAGGGT 
GTGACAAGTT 
TTTTCCTTGA 
TACCACTCAT 
TGGGGGAGTC 
GGGGCAGTGG 
CTGCTTTTCC 
AAGAAACTTA 
GTGTGTGTTC 
TTTCTCTGTT 
AAAAACAAAT 
AAAAAAAAAA 



GTCATGTTTG 
CAGAGCCATA 
AGCAGTGCTC 
TCCTGGAAGA 
TAGAGGGGCC 
GCTGTGGTGG 
GCCTGTCTTG 
GACCGTTGTC 
CCAGGGTCTT 
TCTTTAACTC 
TTTCTCTTTG 
CTCCCCTCTG 
GGAGAGTATG 
TGAATGATTG 
GGCACACGTT 
TCAACCTTTC 
GACTTCACCC 
AACATCTGAA 
AAGATGCAGC 
ACAAGGGGAC 
AAA 



Seq ID NO i 83 Protein sequence: 
Protein Accession ft: AAH01291 



41 



51 



1 11 21 31 

I.I I I ! 1 

MAIiIALLLW ALPRVWTDAN LTARQRDPED SQRTDEGDNR VWCHVCEREN TPECQNPRRC 
KWTEPYCVIA AVKIFPRPPM VAKQCSAGCA AMERPKPEEK RPLLEEPMPF FYLKCCKIRY 
CNLEGPPINS SVPKEYAGSM GESCGGLWLA ILLLLASIAA GLSLS 

Seq ID NO: 84 DNA sequence 

Nucleic Acid Accession #: NM_022893.1 

Coding sequence: 229-2726 



1 
I 

TTTTTTTTTT 
TGCGCCATCT 
TTTTCTCTGG 
CGCCCGCCGC 
AAGCAAGGCA 
ATTCTTACAG 
CTCCTCACCT 
GAGCACAAAC 
CCTTCCCCTT 
GTCACGCCAG 
GAACACATAG 
GGAGCTCTAA 
GATGAGCCCA 
CTCTTGCAAC 
AGTCCCCTGA 
CCACCTCTCC 
GGATCAGTAT 
CTGTTTAGTC 
GAAGAAATGG 
CCAATGGCTA 
AACACGTCTA 
CCATTCCAGC 
TCCGCCCCTC 
ACGTTCAAAT 
TACAAGTGCA 
AAGACGCACA 
GCCAGCTCCC 
TCCGTGGTGG 
GAGGAGGAAG 
CTGACGGAGA 
CACGAGAACA 
GACGTCATGC 
GTCCTGGGCG 
TGCGAOGAAG 
CGCGGCTGCT 
AGCCCCAGCT 
CCCCCGGCCA 
GCCTCCAGGC 
GCCTCCTCGT 
GAG CTGGACG 
ATTAGTGGTC 
GAGTACTGTG 
ACGGGCGAAA 
CTCACCAGGC 
TGTAAGATGC 
GATOGAGTGT 
CTCCCACCTG 
CCTGTAGGAT 
ACGAAGCTAA 
TTCTTTTTTC 



11 
I 

TTTTTTGCTT 
TTGTATTATT 
AGTCTCCTTC 
CGCCGCCGCC 
AACCCCAGCA 
ATGATGAACC 
GTGGGCAGTG 
GGAAACAATG 
CACCAATCGA 
AAGATGACGA 
CAGATAAACT 
TCCCCACGCC 
GCAGCTACAC 
ACGCACAGAA 
CCCCGCGGGT 
ATGGGATI CA 
CGAGAGAGGC 
CACCACCGAG 
CCCTGGCCAC 
TGGAGCCTCC 
GCCCACCGCT 
CAGGTAGCAA 
CTCCCTCCCA 
TTCAGAGCAA 
ACCTGTGCGA 
TGCACAAATC 
CGGAACCCGG 
CCAAGTTCAA 
AGGAGGACGA 
GCGAGAGGGT 
GCTCGCGGGG 
AGGGCATGGT 
AGAAGCATAA 
ACTCGGTGGC 
CCCCGGGCGA 
CGCTGAGCCC 
CGATGCCCAA 
AGCTCAAAGA 
CGGAGCACTC 
GAGGGATCTC 
CGGGCACGGG 
GGAAAGTCTT 
GGCCTTATAA 
ACATGAAAAC 
CTTTTAGCGT 
TGAATAATGA 
ACACCCCCTT 
TTTTTTCTAG 
GAATATGAGA 
TTTTTCCTTT 



21 
I 

AAAAAAAAGC 
TCTAATTTAT 
TTTCTAACCC 
GCCGCCGCCG 
CTTAAGCAAA 
AGACCACGGC 
CCAGATGAAC 
CAATGGCAGC 
GATGAAAAAA 
TTGTTT AT CA 
TCTGCACTGG 
TGGGATGAGT 
ATGTACAACT 
CACT CAT GG A 
TGGTATCCCT 
TATTGCAGAC 
TTCCGGCCTG 
ACATCACTTG 
CCATCACCCG 
CGCCATGGAT 
GTCCCCAGGC 
GCCGCCCTTC 
GCCCCCGGTC 
CCTGGTGGTG 
CCACGCGTGC 
GTCCCCCATG 
CACCAGCGAC 
GAGCGAGAAC 
CGAGGAAGAG 
GGACTACGGC 
CGCGGTCGTG 
GCTCAGCTCC 
GCGCGGCCAC 
CGGCGAGTCG 
GTCGGCCTCG 
CTTCTCTAAG 
CACGGAGAAC 
TCCCTTCCTT 
CTCGGAGAAC 
GGGGCGCAGC 
CAGGCCCAGC 
CAAGAACTGT 
ATGCGAGCTG 
GCATGGCCAG 
GTACAGTACC 
TATAAAAACT 
TTTCACCACT 
TCCCATGTGA 
GTGCTTGTCA 
TTTTTTTTTT 



31 

I 

CATGACGGCT 
TTTGGATGTC 
GGCTCTCCCG 
CCCGCCCCGC 
CGGGAATTCT 
CCGTTGGGAG 
TTCCCATTGG 
CTCTGCTTAG 
GCATCCAATC 
ACGTCATCTA 
AGGGGCCTCT 
GCAGAATATG 
TGCAAACAGC 
TTAAGAATCT 
TCAGGACTAG 
AATAACCCCT 
GCAGAAGGGC 
GACCCCCACC 
AGTGCCTTTG 
TTCTCTAGGA 
CGGCCCAGCC 
CTGGCGACGC 
AAGTCCAAGT 
CACCGGCGCA 
ACCCAGGCCA 
ACGGTCAAGT 
TTGGTGGGCA 
GACCCCAACC 
GAAGAAGAGG 
TTCGGGCTGA 
GGCGTGGGCG 
ATGCAGCACT 
CTGGCCGAGG 
GACCGCATAG 
GGGGGCCTGT 
CGCATCAAGC 
GTGTACTCGC 
AGCTTCGGAG 
GGGAGCTTGC 
GGCACGGGAA 
TCAAAAGAGG 
AGCAATCTCA 
TGCAACTATG 
GTGGGGAAGG 
CTGGAGAAAC 
GAATAGAGGT 
CCCTTTCCCC 
TTTAAACAAA 
CCAGCACACC 
TCCTTTATGT 



41 
I 

CTCCCACAAT 
AAAAGGCACT 
ATGTGAACCG 
AGCCCACCAT 
CGCCCGAGCC 
CTCCAGAAGG 
GGGACATTCT 
AAAAAG CTGT 
CCGTGGAGGT 
GAAGAATTTG 
CCTCCCCTCG 
CCCCGCAGGG 
CATTCACCAG 
ACTTAGAAAG 
GTGCAGAATG 
TTAACCTGCT 
GCTTTCCACC 
GCATAGAGCG 
ACAGGGTGCT 
GACTTAGAGA 
CTATGCAAAG 
CCCCCCTCCC 
CATGCGAGTT 
GCCACACGGG 
GCAAGCTGAA 
CCGACGACGG 
GCGCCAGCAG 
TGATCCCGGA 
AGGAAGAGGA 
GCCTGGAGGC 
ACGAGAGCCG 
TCAGCGAGGC 
CCGAGGGCCA 
ACGATGGCAC 
CCAAAAAGCT 
TCGAGAAGGA 
AGTGGCTCGC 
ACTCCAGACA 
GCTTCTCCAC 
GTGGAGGGAG 
GCAGACGCAG 
CTGTCCACAG 
CCTGTGCCCA 
ACGTTTACAA 
ACATGAAAAA 
ATATTAATAC 
ATCGCCCTCC 
CAAACAAACA 
TGTTTTTTTT 
TCTCACCGTT 




GCGGTTGAAT 
GCTGGCAGGG 
GTTACTGCAA 
TCCTCTGCAA 
CTGCGGCAAG 
CGAGAAGCCC 
GCGCCACATG 
TCTCTCCACC 
CGCGCTCAAG 
GAACGGGGAC 
GGAGGAGGAG 
GGCGCGCCAC 
CGCCCTGCCC 
CTTCCACCAG 
CAGGGACACT 
TGTTAATGGC 
GCTGCTGGGC 
GTTCGACCTG 
CGGCTACGCG 
ATCGCCTTTT 
ACCGCCCGGG 
CACGCCCCAT 
CGACACTTGT 
GAGAAGCCAC 
GAGTAGCAAG 
ATGTGAAATT 
ATGGCACAGT 
CCCTCCCTCA 
AGCCCCACTC 
AACAGAAGTA 
CTTTTTCTTT 
TGAATGCATG 



180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 



60 
120 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1360 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 



220 



WO 02/086443 

ATCTGTATGG GGCAATACTA TTGCATTTTA CGCAAACTTT GAGCCTTTCT CTTGTGCAAT 3060 

AATTTACATG TTGTGTATGT TTTTTTTTAA ACTTAGACAG CATGTATGGT ATGTTATGGC 3120 

TATTTTAAAT TGTCCCTAAT TCGTTGCTGA GCAAACATGT TGCTGTTTCC AGTTCCGTTC 31 BO 

TGAGAGAAAA AGAGAGAGAG AGAGAAAAAG ACCATGCTGC ATACATTCTG TAATACATAT 3240 

CATGTACAGT TTTATTTTAT AACGTGAGGA GGAAAAACAG TCTTTGGATT AACCCTCTAT 3300 

AGACAGAATA GATAGCACTG AAAAAAAATC TCTATGAGCT AAATGTCTGT CTCTAAAGGG 3360 

TTAAATGTAT CAATTGGAAA GGAAGAAAAA AGGCCTTGAA TTGACAAATT AACAGAAAAA 3420 

CAGAACAAGT TTATTCTATC ATTTGGTTTT AAAATATGAG TGCCTTGGAT CTATTAAAAC 3480 

CACATCGATG GTTCTTTCTA CTTGTTATAA ACTTGTAGCT TAATTCAGCA TTGGGTGAGG 3S40 

TAATAAACCT TAGGAACTAG CATATAATTC TATATTGTAT TTCTCACAAC AATGGCTACC 3600 

TAAAAAGATG ACCCATTATG TCCTAGTTAA TCATCATTTT TCCTTTAGTT TAATTTTATA 3660 

AACAAAACTG ATTATACCAG TATAAAAGCT ACTTTGCTCC TGGTGAGAGC TTAAAAGAAA 3720 

TGGGCTGTTT TGCCCAAAGT TTTATTTTTT TTAAACAATG ATTAAATTGA ATGTGTAATG 3780 

TGCAAAAGCC CTGGAACGCA ATTAAATACA CTAGTAAGGA GTTCATTTTA TGAAGATATT 3840 

TGCTTTAATA ATGTCTTTTT AAAAATACTG GCACCAAAAG AAATAGATCC AGATCTACTT 3900 

GGTTGTCAAG TGGACAATCA AATGATAAAC TTTAAGACCT TGTATACCAT ATTGAAAGGA 3960 

AGAGGCTGAC AATAAGGTTT GACAGAGGGG AACAGAAGAA AATAATATGA TTTATTAGCA 4020 

CAACGTGGTA CTATTTGCCA TTTAAAACTA GAACAGGTAT ATAAGCTAAT ATTGATACAA 4080 

TGATGATTAA CTATGAATTC TTAAGACTTG CATTTAAATG TGACATTCTT AAAAAAAGAA 4140 

GAGAAAGAAT TTTAAGAGTA GCAGTATATA TGTCTGTGCT CCCTAAAAGT TGTACTTCAT 4200 

TTCTTTTCCA TACACTGTGT GCTATTTGTG TTAACATGGA AGAGGATTCA TTGTTTTTAT 4260 

TTTTATTTTT TTAATTTTTT CTTTTTTATT AAGCTAGCAT CTGCCCCAGT TGGTGTTCAA 4320 

ATAGCACTTG ACTCTGCCTG TGATATCTGT ATCTTTTCTC TAATCAGAGA TACAGAGGTT 4380 

GAGTATAAAA TAAACCTGCT CAGATAGGAC AATTAAGTGC ACTGTACAAT TTTCCCAGTT 4440 

TACAGGTCTA TACTTAAGGG AAAAGTTGCA AGAATGCTGA AAAAAAATTG AACACAATCT 4500 

CATTGAGGAG CATTTTTTAA AAACTAAAAA AAAAAAAACT TTGCCAGCCA TT TACT TG AC 4560 

TATTGAGCTT ACTTACTTGG ACGCAA CATT GCAAGCGCTG TGAATGGAAA CAGAATACAC 4620 

TTAACATAGA AATGAATGAT TGCTTTCGCT TCTACAGTGC AAGGATTTTT TTGTACAAAA 4680 

CTTTTTTAAA TATAAATGTT AAGAAAAATT TTTTTTAAAA AACACTTCAT TATGTTTAGG 4740 

GGGGAACTGC ATTTTAGGGT TCCATTGTCT TGGTGGTGTT ACAAGACTTG TTATCCATTT 4800 

AAAAATGGTA GTGGAAATTC TATGCCTTGG ATACACACCG CTCTTCAGGT TGTAAAAAAA 4860 

AAAAACATAC ATTGGGGAAA GGTTTAAGAT TATATAGTAC TTAAATATAG GAAAATGCAC 4920 

ACTCATGTTG ATTCCTATGC TAAAATACAT TTATGGTCTT TTTTCTGTAT TTCTAGAATG 4980 

GTATTTGAAT TAAATGTTCA TCTAGTGTTA GGCACTATAG TATTTATATT GAAGCTTGTA 5040 

TTTTTAACTG TTGCTTGTTC TCTTAAAAGG TATCAATGTA CCTTTTTTGG TAGTGGAAAA 5100 

AAAAAAGACA GGCTGCCACA GTATATTTTT TTAATTTGGC AGGATAATAT AGTGCAAATT 5160 

ATTTGTATGC TTCAAAAAAA AAAAAAAGAG AGAAACAAAA AAGTGTGACA TTACAGATGA 5220 

GAAGCCATAT AATGGCGGTT TGGGGGAGCC TGCTAGAATG TCACATGGAT GGCTGTCATA 5280 

GGGGTTGTAC ATATCCTTTT TTGTTCCTTT TTCCTGCTGC CATACTGTAT GCAGTACTGC 5340 

AAGCTAATAA CGTTGGTTTG TTATGTAGTG TGCTTTTTGT CCCTTTCCTT CTATCACCCT 54 00 

ACATTCCAGC ATCTTACCTT CATATGCAGT AAAAGAAAGA AAGAAAAAAA AAGGAAAAAA 5460 

AAAAAAAAAC CAATGTTTTG CAGTTTTTTT CATTGCCAAA AACTAAATGG TGCTTTATAT 5520 

TTAGATTGGA AAGAATTTCA TATGCAAAGC ATATTAAAGA GAAAGCCCGC TTTAGTCAAT 5580 

ACTTTTTTGT AAATGGCAAT GCAGAATATT TTGTT AT TGG CCTTTTCTAT TCCTGTAATG 5640 

AAAGCTGTTT GTCGTAACTT GAAATTTTAT CTTTTACTAT GGGAGTCACT ATTTATTATT 5700 

GCTTATGTGC CCTGTTCAAA ACAGAGGCAC TTAATTTGAT CTTTTATTTT TCTTTGTTTT 5760 

TATTTTTTTT TTTATTTAGA TGACCAAAGG TCATTACAAC CTGGCTTTTT ATTGTATTTG 5820 

TTTCTGGTCT TTGTTAAGTT CTATTGGAAA AACCACTGTC TGTGTTTTTT TGGCAGTTGT 5880 

CTGCATTAAC CTGTTCATAC ACCCATTTTG TCCCTTTATT GAAAAAATAA AAAAAATTAA 5940 
A 

Seq ID NO i 85 Protein sequence: 
Protein Accession #x np_075044.i 

1 ll 21 31 41 51 

I I I I I I 

M8RRKQGKPQ HLSKREFSPE PLEAILTDDE PDHGPLGAPE GDHDLLTCGQ CQMNFPLGDI 60 

IiIFXEHKRXQ CNGSLCLEKA VDKPPSPSPI EMKKASNPVE VGIQVTPEDD DCIjSTSSRRI 120 

CPKQEHIADK LLHWRGLSSP HSAHGAIilPT PGMSAEYAPQ GICKDEPSSY TCTTCKQPFT 180 

SAWPLLQHAQ NTHGLRIYLE SEHGSPLTPR VGIPSGLGAE CPSQPPLHGI HIADNNPFNL 240 

LRIPGSVSRE ASGLAEGRFP PTPPLFSPPP RHHLDPHRIE RLGAEEMALA THHPSAFDRV 300 

LRLNPMAMEP PAMDFSRRLR ELAGNTSSPP L8PGRPSPMQ RLLQPFQPGS KPPFLATPPL 360 

PPLQSAPPPS QPPVKSKSCE FCGKTFKFQS NLWHRRSHT GEKPYKCNLC DHACTQASKL 420 

KRHMKTHMHK SSPMTVKSDD GLSTASSPEP GTSDLVGSAS SALKSWAKF KSENDPNLIP 480 

ENGDEEEEED DEEEEEEEEE EEEELTESER VDYGFGLSLE AARHHENSSR GAWGVGD ES 540 

RALPDVMQGM VLSSMQHFSE AFHQVLGEKH KRGHLAEAEG HRDTCDEDSV AGESDRIDDG 600 

TVNGRGCSFG ESASGGLSKK LLLGSPSSLS PFSKRIKLEK EFDLPPATMP NTENVYSQWL 660 

AGYAASRQLK DPFLSFGDSR QSPFASSSEH SSENGSLRFS TPPGELDGGI SGRSGTGSGG 720 

STPHISGPGT GRPSSKEGRR SDTCEYCGKV FKNCSNLTVH RRSHTGERPY KCELCNYACA 780 
QSSKLTRHMK THGQVGKDVY KCEICKMPFS VYSTLEKHMK KWHSDRVLNN DIKTE 

Seq ID NO: 86 DNA sequence 

Nucleic Acid Accession XM_035292.2 

Coding sequence: 53-1576 

1 11 21 31 41 51 

] I I I I I 

GCTCGCTGGG CCGCGGCTCC CGGGTGTCCC AGGCCCGGCC GGTGCGCAGA GCATGGCGGG 60 

TGCGGGCCCG AAGCGGCGOG CGCTAGCGGC GCCGGCGGCC GAGGAGAAGG AAGAGGCGCG 120 

GGAGAAGATG CTGGCCGCCA AGAGCGCGGA CGGCTCGGCG CCGGCAGGCG AGGGCGAGGG 180 

CGTGACCCTG CAGCGGAACA TCACGCTGCT CAACGGCGTG GCCATCATCG TGGGGACCAT 240 

TATCGGCTCG GGCATCTTCG TGACGCCCAC GGGCGTGCTC AAGGAGGCAG GCTCGCCGGG 300 

GCTGGCGCTG GTGGTGTGGG CCGCGTGCGG CGTCTTCTCC ATCGTGGGCG CGCTCTGCTA 360 

CGCGGAGCTC GGCACCACCA TCTCCAAATC GGGCGGCGAC TACGCCTACA TGCTGGAGGT 420 

CTACGGCTCG CTGCCCGCCT TCCTCAAGCT CTGGATCGAG CTGCTCATCA TCCGGCCTTC 4 80 

ATCGCAGTAC AT CGTGGCCC TGGTCTTCGC CACCTACCTG CTCAAGCCGC TCTTCCCCAC 540 
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CTGCCCGGTG CCCGAGGAGG CAGCCAAGCT CGTGGCCTGC 
GGCCGTGAAC TGCTACAGCG TGAAGGCCGC CACCCGGGTC 
CAAGCTCCTG GCCCTGGCCC TGATCATCCT GCTGGGCTTC 
TGTGTCCAAT CTAGATCCCA ACTTCTCATT TGAAGGCACC 
TGTGCTGGCA TTATACAGCG GCCTCTTTGC CTATGGAGGA 
CACAGAGGAA ATGATCAACC CCTACAGAAA CCTGCCCCTG 
CATCGTGACG CTGGTGTACG TGCTGACCAA CCTGGCCTAC 
GCAGATGCTG TCGTCCGAGG CCGTGGCCGT GGACTTCGGG 
GTCCTGGATC ATCCCCGTCT TCGTGGGCCT GTCCTGCTTC 
GTTCACATCC TCCAGGCT CT TCTTCGTGGG GTCCCGGGAA 
CTCCATGATC CACCCACAGC TCCTCACCCC CGTGCCGTCC 
GACGCTGCTC TACGCCTTCT CCAAGGACAT CTTCTCCGTC 
CAACTGGCTC TGCGTGGCCC TGGCCATCAT CGGCATGATC 
TGAGCTTGAG CGGCCCATCA AGGTGAACCT GGCCCTGCCT 
CCTCTTCCTG ATCGCCGTCT CCTTCTGGAA GACACCCGTG 
CATCATCCTC AGCGGGCTGC CCGTCTACTT CTTCGGGGTC 
GTGGCTCCTC CAGGGCATCT TCTCCACGAC CGTCCTGTGT 
CCCCCAGGAG ACATAGCCAG GAGGCCGAGT GGCTGCCGGA 

Seq ID NO: 87 Protein sequence: 
Protein Accession #: XP_0352 92.2 



PCT/US02/12476 



MAGAGPKRRA 
GTIIGSGIFV 
LEVYGSLPAF 
LLTAVNCYSV 
GN IVLALYSG 
STEQMLSSEA 
SILSMIHPQL 
RKPELERPIK 
KPKWLLQGIF 



11 
I 

LAAPAAEEKE 
TPTGVLKEAG 
LKLWIELLII 
KAATRVQDAF 
LFAYGGWNYL 
VAVDFGNYHL 
LTPVPSLVFT 
VNIiALPVFFI 
STTVLCQKLM 



21 

I 

EAREKMXiAAK 
SPGIiALWWA 
RPSSQYIVAIi 
AAAKLLALAL 
NFVTEEM INP 
GVMSWIIPVF 
CVMTLLYAFS 
LACLFLIAVS 
QWPQET 



31 

! 

SADGSAPAGE 
ACGVFSIVGA 
VFATYLLKPL 
IILLGFVQIG 
YRNLPLAIII 
VGLSCFGSVN 
KDIFSVINFF 
FWKTPVECGI 



CTCTGCGTGC 
CAGGATGCCT 
GTCCAGATCG 
AAACTGGATG 
TGGAATTACT 
GCCATCATCA 
TTCACCACCC 
AACTATCACC 
GGCTCCGTCA 
GGCCACCTGC 
CTCGTGTTCA 
ATCAACTTCT 
TGGCTGCGCC 
GTGTTCTTCA 
GAGTGTGGCA 
TGGTGGAAAA 
CAGAAGCTCA 
GGAGCATGC 



41 
I 

GEGVTLQRNI 
LCYAELGTTI 
FPTCPVPEEA 
KGDVSNLDPN 
SLPIVTLVYV 
GSLFTSSRLF 
SFFNWLCVAL 
GFTIILSGLP 



TGCTGCTCAC 
TTGCCGCCGC 
GGAAGGGTGA 
TGGGGAACAT 
TGAATTTCGT 
TCTCCCTGCC 
TGTCCACCGA 
TGGGCGTCAT 
ATGGGTCCCT 
CCTCCATCCT 
CGTGTGTGAT 
TCAGCTTCTT 
ACAGAAAGCC 
TCCTGGCCTG 
TCGGCTTCAC 
ACAAGCCCAA 
TGCAGGTGGT 



51 
! 

TIiLNGVAIIV 
SKSGGDYAYM 
AKLVACLCVL 
FSFEGTKLDV 
LTNLAYFTTL 
FVGSREGHLP 
AIIGMIWLRH 
VYFFGVWWKN 



Seq ID NO i 88 DNA sequence 

Nucleic Acid Accession #: NM_005268. 

Coding sequence: 168-989 



TAAAAAGCAA 
TCTGGATATG 
AGCCCTGAGG 
TCTTTGAGGG 
TGTCTCTGGT 
GTGATGACCA 
TTGATGAGTT 
CATGCCCCTC 
ACCGAGAAGC 
. GTGGGCTCTG 
TTCTCTATGT 
ACGCAGATCC 
TTTTCACCCT 
TCATCTACCT 
TGTGCACAGG 
CGGGTGACCT 
GAGACCATGT 
CCTGGATGGG 
CATGAGGTAG 
TCAACTCCAG 
GCTCGGTTTC 



11 
I 

AAGAATTCGC 
AAATTCAAGC 
AGTAGTCACT 
ACTCCTGAGT 
CTTCATCTTC 
CAAGGACTTC 
CTTCCCTGTG 
ACTGCTCGTG 
CCATGGGGAG 
GTGGACATAT 
GTTCCACTCA 
ATGTCCCAAT 
CTTCATGGTG 
GGTGAGCAAG 
TCATCACCCC 
CATCTTTCTG 
GAAGAAAACC 
GAGGCTCTAG 
GGGCAGGCAA 
CCACCTGCCC 
CTTTTCTAGA 



21 
I 

GGCCGCGTCG 
TGCTTGCTGA 
CAGTAGCAGC 
GGGGTCAACA 
CGCGTGCTGG 
GACTGCAATA 
TCCCATGTGC 
GTCATGCACG 
AACAGTGGGC 
GTCTGCAGCC 
TTCTACCCCA 
ATAGTGGACT 
GCCACAGCTG 
AGATGCCACG 
CACGGTACCA 
GGCTCAGACA 
ATCTTGTGAG 
CATCTCTCAT 
GAGAGAGGAT 
CAGCTCGACG 
ATGGAAATAG 



31 
I 

ACACGGGCTT 
GTCCTATTGC 
TGACGCGTGG 
AGTACTCCAC 
TGTACCTGGT 
CTCGCCAGCC 
GCCTCTGGGC 
TGGCCTACCG 
GCCTCTACCT 
TAGTGTTCAA 
AATATATCCT 
GCTTCATCTC 
CCATCTGCAT 
AGTGCCTGGC 

GTCATCCTCC 
GGGCTGCCTG 
AGGTGCAACC 
TCAGACGCTC 
GCACTGGGCC 
TGAGGGCCAA 



41 
I 

CCCCGAAAAC 
CGGCTGCTGG 
GTCCACCATG 
AGCCTTTGGG 
GACGGCCGAG 
CGGCTGCTCC 
CCTGCAGCTT 
GGAGGTTCAG 
GAACCCCGGC 
GGCGAGCGTG 
CCCTCCTGTG 
CAAGCCCTCA 
CCTGCTCAAC 
AGCAAGGAAA 
CAAACAAGAC 
TCTCTTACCA 
GACTGGTCTG 
TGAGAGTGGG 
TGGGAGCCAG 
AGTTCCCCCT 
TGC 



Seq ID NO: 89 Protein sequence: 
Protein Accession #i NP_005259.l 

1 11 



600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1S60 



51 
1 

CTTCCCCGCT 
GAGCCAGGAG 
AACTGGAGTA 
CGCATCTGGC 
CGTGTGTGGA 
AACGTCTGCT 
ATCCTGGTGA 
GAGAAGAGGC 
AAGAAGCGGG 
GACATCGCCT 
GTCAAGTGCC 
GAGAAGAACA 
CTCGTGGAGC 
GCTCAAGCCA 
GACCTCCTTT 
GACCGCCCCC 
GCAGGTTGGG 
GGAGCTAAGC 
TTCCTAGTCC 
CTGCTCTGCA 



60 
120 
180 
240 
300 
360 
420 
480 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 



21 31 41 51 

I I I I 

MNWSIFEGIiL SGVNKYSTAF GRIWLSLVFI FRVLVYLVTA ERVWSDDHKD FDCNTRQPGC 
SNVCFDEFFP VSHVRLWALQ LILVTCPSLIi WMHVAYREV QEKRHREAHG ENSGRLYLNP 
GKKRGGLWWT YVCSLVFKAS VDIAFLYVFH SFYPKYILPP WKCHADPCP NIVDCFISKP 
SEKNIFTLFM VATAAICILL NLVELIYLVS KRCHECLAAR KAQAMCTGHH PHGTTSSCKQ 
DDLLSGDLIF LGSDSHPPLL PDRPRDHVKK TIL 



Seq ID NO: 90 DNA sequence 

Nucleic Acid Accession #: NM_002391.1 

Coding sequence: 26-457 

1 11 21 31 41 51 

I I I 1 I I 

CGGGCGAAGC AGCGCGGGCA GCGAGATGCA GCACCGAGGC TTCCTCCTCC TCACCCTCCT 
CGCCCTGCTG GCGCTCACCT CCGCGGTCGC CAAAAAGAAA GATAAGGTGA AGAAGGGCGG 
CCCGGGGAGC GAGTGCGCTG AGTGGGCCTG GGGGCCCTGC ACCCCCAGCA GCAAGGATTG 
CGGCGTGGGT TTCCGCGAGG GCACCTGCGG GGCCCAGACC CAGCGCATCC GGTGCAGGGT 
GCCCTGCAAC TGGAAGAAGG AGTTTGGAGC CGACTGCAAG TACAAGTTTG AGAACTGGGG 
TGCGTGTGAT GGGGGCACAG GCACCAAAGT CCGCCAAGGC ACCCTGAAGA AGGCGCGCTA 



60 
120 
160 
240 



60 
120 
180 
240 
300 
360 



222 



5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

CAATGCTCAQ TGCCAGGAGA CCATCCGCGT CACCAAGCCC TGCACCCCCA AGACCAAAGC 
AAAGGCCAAA GCCAAGAAAG GGAAGGGAAA GGACTAGACG CCAAGCCTGG ATGCCAAGGA 
GCCCCTGGTQ TCACATGOGG CCTGGCCACO CCCTCCCTCT CCCAGGCCCG AGATGTGACC 
CACCAGTGCC TTCTGTCTGC TCGTTAGCTT TAATCAATCA TGCCCTGCCT TGTCCCTCTC 
ACTCCCCAGC CCCACCCCTA AGTGCCCAAA GTGGGGAGGG ACAAGGGATT CTGGGAAGCT 
TGAGCCTCCC CCAAAGCAAT GTGAOTCCCA GAGCCCGCTT TTGTTCTTCC CCACAATTCC 
ATTACTAAGA AACACATCAA ATAAACTGAC TTTTTCCCCC CAATAAAAGC TCTTCTTTTT 
TAATAT 

seq ID NOt 91 Protein sequence: 
Protein Accession S: NP_002302.1 

11 



PCT/US02/12476 



31 



41 



51 



1 11 21 

I I I I I I 

MQHRGPLLLT LLALLALTSA VAKKKDKVKK GGPGSECAEW AWGPCTPSSK DCGVGFREGT 
CGAQTQRIRC RVPCNWKKEP QADCKYKFEN WGACDGGTGT KVRQGTLKKA RYNAQCQETI 
RVTKPCTPKT KAKAKAKKGK GKD 



Seq ID MO: 92 DNA sequence 

Nucleic Acid Accession #: NM_005130.1 

Coding sequence.- 98-802 



CTCTACCTGA 
CGTGTGCTCA 
GCTCTCCTTC 
GAATGGACTT 
TAAGCAGAAA 
CAGATGGGCT 
GGACCATGAA 
TGAGAGAGTC 
ATATTCCAAG 
TAAGCTAGTC 
GTCCCCCAGG 
GACCATGGCC 
GACTGCCCTG 
AGTGCAGGAC 
TGT CGTAAGT 
TGTGCTTAGT 
TGGAATTTGC 
TTCCATGGCC 
GAGTGATAAT 
TTTTTCAAAA 



11 
I 

CACAGCTGCA 
GAACAAGGTG 
CTCCTACTGG 
CACAGCAAAG 
AGCAGGCCCG 
GCTACTGAGC 
TTTTCCTGTG 
TATTGGAAAC 
ACAGCTGTGA 
AGCTCCACTC 
GAGCACATCA 
ACCAAAGCTC 
GAGTTCTGTG 
ACGTCATGCT 
CCCTCTGTAT 
GAGTGCAACG 
CTTATTTTTC 
CACACAGCTA 
TTCAGTGCAA 
AAAAAAAAAA 



21 
1 

GCCTGCAATT 
AACGCCCAGC 
CTGCTCAGGT 
TGGTCTCAGA 
GGAACAAAGG 
AGGAGGAGGG 
TCTTTGCTGG 
AAGTTGCCCG 
AAACCAGAGT 
TATTTGGGAA 
AGGGCAAAGA 
CCGAGTGTGT 
GAGAGACTTG 
AATGAGGTCA 
ACTTTAAAGC 
AAATATTTAA 
TTGGATGCGA 
TGTGTTTGAG 
CGAACTTTCT 
AAA 



31 
I 

CACTCCCACT 
TGCAGCCATG 
GCTCCTGGTG 
ACAAAAGGAC 
CAAGTTTGTC 
CATCTCTCTC 
CAATCCAACC 
GAATCTGCGC 
GTGCAGAAAG 
CACAAAGCCC 
GACCACCCCC 
GGAGGACCCA 
GAGCTCTCTC 
AAAGAGAACG 
TCTCTACAGT 
ACAAGTTTTG 
TGTTCAGAGG 
CAGCGAAGAG 
GCTGAATTAA 



41 

I 

GCCTGGGATT 
AAGATCTGTA 
GAGGGGAAAA 
ACTCTGGGCA 
ACCAAAGACC 
AAGGTTGAGT 
TCATGCCTAA 
TCACAGAAAG 
GATTTTCCAG 
AGGAAGGAGA 
TCTAGCCTAG 
GATATGGCAA 
TGCACATTCT 
GGTTCCTTTA 
CCCCCCAAAA 
TATTTTTTGC 
CTGTTTCCTG 
TCTTTGAGCT 
TGGTAATAAA 



51 
I 

GCACTGGATC 
GCCTCACCCT 
AAAAAGTGAA 
ACACCCAGAT 
AAGCCAACTG 
GCACTCAATT 
AGCTCAAGGA 
ACATCTGTAG 
AATCCAGTCT 
AAACAGAGAT 
CAGTGACCCA 
ACCAGAGGAA 
TCCTCAGCAT 
AGAGATGTCA 
TATGAACTTT 
TTTTGTGTTT 
CAGCATGTAT 
GAATGAGCCA 
ACTCTGGGTG 



Seq ID NO i 93 Protein sequence: 
Protein Accession #t NP 005121.1 



1 11 21 31 41 51 

I I I 1 I I 

MKICSLTLLS FLLIiAAQVLL VEGKKKVKNG LHSKWSEQK DTLGNTQIKQ KSRPGNKGKP 
VTKDQANCRW AATEQEEGIS LKVECTQLDH EFSCVPAGNP TSCLKLKDER VYWKQVARNL 
RSQKDICRYS KTAVKTRVCR KDFPESSLKL VSSTLFGNTK PRKEKTEMSP REHIKGKETT 
PSSIAVTQTM ATKAPECVED PDMANQRKTA LEFCGETWSS LCTFPLSIVQ DTSC 

Seq ID NO: 94 DNA sequence 

Nucleic Acid Accession #: NM_012101 

Coding sequences 125-1891 



CTCCTCACAG 
TGCCAGAAAG 
TGCGATGGAA 
CCGGAGCCCG 
TGCCAAGACC 
CCTGAAGCCA 
CATCCAGTTT 
GGA AGGC AAG 
TACCTTTGCC 
GGTGTCCATC 
CCTTTTTTCA 
CAAGCAGAAG 
CAAGCCCCAC 
CTTTGAGGCC 
CCAGACCTGC 
AGTGGAGGAG 
GCTCAAGATC 
CAAGAGCTTC 
GGACCTGGAG 
TGTGGACCAA 
GGACAAGCAG 
ATTTGGTGCA 
GCTGGAGGGG 
ATGCATGCGC 
GAACCACATG 



11 
I 

GTGTGTCTCT 
GTCACCTATC 
GCTG CAGATG 
TCGGGCCCCA 
ACCAACGGGC 
GGGGAAGGTA 
GTCGAGTCCG 
AGGTCGCCGT 
GAAAAGGGCG 
ATGGAGCCCG 
CGGTCCAAGT 
GCGGTCAAGT 
CTGGAGGGCG 
CGCAAGTGT C 
ATCTGCTACC 
GCCAAGGCCG 
ATTGAGATTG 
ACCACCAATG 
AAGCAAAAGG 
GTGAAGGTGA 
ACCCGGGAGC 
TTGATGAGCA 
GAGGGCCTGG 
CACGTTGAGA 
GAGAACGGTG 



21 
I 

AGTCCTCGTG 
CTGAACCCCA 
CCTCCAGGAG 
GTGGCAGCCT 
ACGGCGGGGA 
GGAGCGCCCT 
GGGACGACAA 
ACGCAGGGCT 
ACGTGCGCAA 
GGGAGACCCG 
CCGGCTCCGA 
CCTGCCTGGT 
CCGCCTTCCG 
CCGTGCATGG 
TTTG CATGTT 
AGAAGGAGAC 
AGGATGAAGC 
'AGAAGGCCAT 
AGGAAGTGAG 
TCATGGATGC 
AGCTGCATAG 
ATTACTCTCT 
GACAGTCACT 
AGATGTGCAA 
GTGACCATCG 



31 

1 

GTTGCCTGCC 
GCAAGCCTGA 
CAACGGGTCG 
GGAGAATGGC 
GGCAGCTGAG 
GTTCGCGGGC 
GAACTCCAAC 
CCAGCTGGGG 
GTCCATTTTC 
GCGGAACAGC 
GGAGGTGCTG 
GTGCCAGGCC 
AGACCACCAG 
CAAGACGATG 
CCAGGAGCAC 
GGAGCTGTCA 
TGAGAAGTGG 
CCTGGAGCAG 
GGCTGCGCTG 
TCTGGATGAG 
CATCAGCGAC 
CCCCCCACCC 
AGGCAACTTC 
GGCGGACCTG 
CTATGTGAAC 




CTGCCCACCT 
AAGGACGACC 
AGCCGTAACT 
AACTACACGA 



51 
I 

CCGAGACGCC 
CCAAGCACCC 
CCAGGGATGC 
ACGGCAAGGA 
TGGGCAGCGC 
GGCGACCCAT 
TGGACTCTAT 
AGCCACCCGT 
GGAAGCCCAC 
CCGACACGGG 
GCATCGGCAA 
AGCTGCATCT 
CCATCCGGGA 
GCCAGACCGA 
GCACCGTGAC 
AGCAGCTGCA 
AGGACCGCAT 
ACCTGGTGCG 
AG CAGGATGC 
TGCTGCATGA 
TTCTGCAGGA 
ATCATGTCCT 
TGCTCAATGT 
TCATTGAGAG 
ACAGCTTCGG 



420 
480 
540 
600 
660 
720 
780 



60 
120 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



60 
120 
180 



60 
120 
180 
240 
300 
360 
420 
460 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 



223 



5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 

85 



WO 02/086443 

GGGTGAGTGG AGTGCACCGQ 
TGGGGTCCGG ACATCATACC 
GAAGAATTTC AACAATCTCT 
CTCCTCCAGC ATTCAGAACT 
CTCCCTGAAA GGCTATCCCT 
TTGGAAATCT GGCAAGCAGA 
CAACGGGATT GGGTCCAACG 
CCCCTGCTCT TCCTCCTGAC 
TGGGAGGGAG CCTGGTCCTG 
GTTCCGGCCT CTCCGACTTC 
TGACCTCAGA TGGTCACCAT 
TAGGTTGGGG CCTGCCCTAA 
CAGTGAGTAC CCGCATGGTA 
CTATAGACGT TTCTCTCCAA 
ACAGCCACCC ATCTCCCATT 
GTGCTCTCTC TCGTCCTACC 
CTGCAGATGG AAACCTCTCA 
ACAGCCACTT TGAGTCTGTG 
TAGCCAAGAT ATTCCTCTGT 
TCCACCCATG CAAATAGCTA 
TTCAGTCTAC ACTTTGGCAT 
GTGCCTTACA CACTGCCCCC 
TGATTACCCC CCATGTTGCA 
AGCAGCACAG TGGGGACATC 
GATGGGCCAG CTTGCAGGGG 
ATAAACCATT GGTCTGTC 



Seq ID NO: 95 Protein sequence: 
Protein Accession #: NP_036233.1 



ACACCATGAA 
AGCCCTCX3TC 
ATGGCACCAA 
CTGACAATGA 
CCCTCATGCG 
CTATGCTGTC 
AAGCCCCATG 
CCTGCTGCTC 
CACCTGCCCT 
CCCACTGGCC 
CATTCCTGTG 
CCCGCCAGCC 
TCAGCCTGCC 
GGCCCTATCC 
CACATGGCCC 
TATCAATGCC 
GTGTCTTGAC 
GTCCCTGGAG 
TCCCTCTGCT 
CTGGCCCAGC 
TCTCTCTGGC 
ACCCTCAGCC 
TATCAGGGTG 
TCCCGTCTCA 
TTGGGGAGGG 



GAGATACTCC 
TCCTGGCCGC 
AGGTAACTAC 
CCTGCCCGTC 
GAGCCAAAGC 
TCACTACCGG 
AGCTCCTGGC 
TTGCCTTCTA 
CTGCAGCCCT 
ACACTCCATT 
CTCAGAGGCC 
TCCTCCTCTC 
TCTCCCGCCC 
CCCAATGTTG 
ACCTCCTGCT 
CAGCATGGCA 
ATCACCCTAC 
GGTGGCTTCT 
GAGATAAAGA 
TACCATTTAC 
GATGGAGTGT 
GTTGCCCCAT 
CTCAAGGATT 
ACAGCCCCAG 
AGACATCCAG 



ATGTACCTGA 
TTCACCAAGG 
ACCTCCCGGG 
GTCCAAGGCA 
CCCAAGGCCC 
CCATTCTACG 
GGAAGGAACG 
AGCTACTGTG 
CTGCCAGCCT 
CAGACTCCTT 
AACCCATCAC 
GGGCTGGATC 
ACGCCCTGCT 
TCAGCAGATG 
TCCCAGAGGA 
GAACCTGCAG 
CCAGGCGGTG 
CCTGACTGGC 
ATTCCCTTAA 
CATTTGCCTA 
GGCTGGGCTG 
CAGAGGCTGC 
GGAGAGGAGA 
GCCTATGGGG 
CTTGGGCTTT 



CACCCAAAGG 
AGACCACCCA 
TCTGGGAGTA 
GCTCCTCCTT 
AGCCCCAGAC 
TCAACAAAGG 
AGGCGCCACA 
CTTGTCTGGG 
CTTGGGGGCA 
TCCTGCCTTG 
AGGGGTGAGA 
TGGGGGCTAG 
GTCTCCAGGC 
CCTGGACAGC 
CTGGCCCTAC 
TGGCCAAGGG 
GGTCTCCACC 
AGGATGACCT 
CATGATATAA 
CAGAATTTCA 
ACCGCAAAAG 
CTCCTCCTTC 
CAAAACCAGG 
GCTCTGGAAG 
CCCCTTTGGA 



1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 



PCTYUS02/12476 



i 

1 

MEAADASRSN 
KPGEGRSAI»F 
FAEKGDVRKS 
QXAVKSCIiVC 
TCICYLCMFQ 
S FTTNEKAI h 
KQTREQLHSI 
MRHVEKMCKA 
VRTSYQPSSP 
LKGYPSLMRS 



11 

I 

GSSPEARDAR 
AGNEWRRPII 
IPSESRKPTV 
QASFCELHLK 
EHKNHSTVTV 
EQNPRDLVRD 
SDSVLFLQEF 
DLSRNFIERN 
GRFTKETTQK 
QSPKAQPQTW 



21 
I 

SPSGPSGSLE 
QFVESGDDKN 
SIMEPGETRR 
PHLEGAAFRD 
EBAXAEKETE 
LEKQKEEVRA 
GALMSNYSLP 
HMENGGDHRY 
NFNNLYGTKG 
KSGKQTMLSH 



31 
I 

NGTKADGKDA 
SNYFSMDSME 
NSYPRADTGL 
HQLLEPIRDF 
LSLQKEQLQL 
ALEQREQDAV 
PFIiPTYHVLL 
VNNYTNSFGG 
NYTSRVWEYS 
YRPFYVNKGN 



41 



KTTNGHGGEA 
GKRSPYAGIiQ 



EARKCPVHGK 
KIIEIEDEAE 
DQVKVIMDAL 
EGEGLGQSLG 
EWSAPDTMKR 
SSIQNSDNDIi 
GIGSNEAP 



51 
I 

AEGKSLGSAL 
LGAAKKPPVT 
VDCDSCIGNK 
TMELFCQTDQ 
KWQKEKDRIK 
DERAKVLHED 
NFKDDLLNVC 
YSMYLTPKGG 



Seq ID NO: 96 DNA sequence 

Nucleic Acid Accession NM_080668.1 

Coding sequence: 83-841 



GGCACGAGGG 
GAGCTCGAGA 
GCGCTCCGGG 
AGGCTCTGAA 
AGTCAGAAAG 
CCAATCACCT 
TGGCAGGGAG 
CAGCACTCCT 
CAGAGACTTG 
CTCTGCCTCT 
GGCAGAAGAC 
GGTTTGTGCA 
GAAACAGAAA 
GGCTGCGGCC 
AGATGCAGTG 
CCTGTGGAGA 
GCTGCGCATG 
TTAGGAAATG 
CCTTCCTATC 
GAATCAGTGG 
CTTGTTGGGA 
CCCTCCTGAG 
TTCAGTTCCG 
GAGGGCAATT 
AAGGATGTAG 
GTTTTCTAGT 
GCTGCTTGGA 
GCTCATGGCC 
TGCAGGGGTG 
ACCGAGTCTA 
CAACTATG CT 
ATAGCAATTT 
CTCATGATCT 
TTTGGATTTG 
TTTGATGTTT 
CAGAGAATGC 
AGGCGGGCCC 



11 
I 

CAGCGAGTGG 
CGGAGCCTAG 
CCAAGGGCCC 
CTCCCGAGCA 
CCCATCGTCT 
CGCAGGAGCC 
CTTACTAAGG 
GTGCCGAACC 
GAAATGTCTA 
ACCTCCACCC 
TTGTCCGGAG 
AAGCCCTGGG 
CGTAAGAAGA 
ATGAATGCCG 
GGGGGTGCAC 
GGACACTTAG 
AGGACTGTCT 
GGGCCGCCTG 
TCCCCAAAGT 
CCCAGCCAGA 
GGGGTGGCTG 
TTGCCTTCTG 
TGGGTTTCCC 
CTGTCTTGGA 
GGAGCCTTAG 
TCTGGTCTGC 
GCAAAGGGTG 
TCTGGGCTGG 
GAAGGTGGCC 
CTTGGCCCTA 
TGTAAAGTCC 
TAGTTTTTGG 
CTGGAGAATT 
AAGGCTGCCC 
AGAAGTTCGT 
CAGGGAAGAT 
TAATAAAACC 



21 
I 

CCTTCCCGGT 
TTATGTCTGG 
CATCTCCTAC 
TCCTCCCTGA 
TAAAGAGGAT 
CTAGGATTTC 
AGGACCTTTT 
CTGAGGCCGA 
AGAAAGTCAG 
CAGGCCGCCG 
TCTCGCCAGT 
CCCCAGACAT 
AGAAAATGCC 
AGTTTGAAGC 
CTGGCCAGAC 
GGTCCCCTCC 
GCCTTTGAGG 
GCCCAGCCAC 
ACCATAGCCA 
AGTTAAAGGG 
CTTGGAAATA 
TTGGTGGCTT 
CTTTGATTCT 
GGAAGAAGCT 
TTGGGTCTCA 
TGTTGAGATG 
CATTTCAGGG 
TCCCTTGCAC 
CCTCTTGTCA 
GAAGAGAAAG 
TCGGAAAGTT 
CCTCCCTATC 
TCCCCATCTT 
TGCCCCCGAC 
GGAAGTAGAC 
GAGTGCTGGG 
CTCTGCCAGG 



31 
I 

TGGCGCGCGC 
GAGGCGAACG 
TAAGCCTCTG 
AATCTGGCCG 
CGTGGCCCAT 
CTTTTTCTTG 
CAAGACACAC 
GTCCAGCTCC 
GCGTTCCTAC 
GTCCTGCTTT 
GGTGTGCTCC 
GACTCTCCCT 
AGAGAT CTTG 
TGCTGAGCAG 
TCTCCCTCCT 
CCTGGTCTTG 
GCTTGGGCAG 
TCACTGGTGT 
GTTTCCAGAT 
CTGAGGGTTG 
GGCCGAGGGG 
TCTTCTTGAA 
GTAAATAGTC 
GGACATTCAG 
GACCATAAGT 
TTTGGTAAAT 
TGTGGCCACC 
AGGGCCCACG 
CCCATACCCA 
TTGAAGAGTC 
TCCTCGCGTA 
CTCTCACATG 
TCTCTTCTTT 
TCTCCTGCCG 
GCTGAGGTGT 
TCAGGGTACT 
TCTGGGAGTC 



41 

I 

CCGGGGCGGC 
CGGTCCGGAG 
CGGAGGTCCC 
AAGACACCCA 
GCTGTAGAGG 
GAGAAAGAAA 
AGCGTCCCTG 
AAGGAAGGAG 
AGCCGGCTGG 
GGCTTCGAGG 
AAACTCACCG 
GGAATCTCCC 
AAAACGGAGC 
TTTGATCTCC 
GTCCTGTACA 
TTACCTGTGT 
CAGCGGCAGC 
CCTGTCTCTT 
GGGCCACAGA 
AGGTGAGAGG 
CTCTGCCAGC 
CCCACCTGTG 
CCAGAGAGAA 
CCTGTGGAGT 
GTGTACTACA 
GCCAGGTTGA 
AGGTGCTGTG 
CTGGAGTCTT 
TTTCTTACAA 
CCAGACCTAC 
CCAGACAGCG 
AGAACACTGC 
CCATCGTGTG 
CACCCCTGGC 
GCAGAGGAGC 
TGGATGAAAC 
CCAGGCCATC 



51 
I 

GGCGCTGGAG 
GAGCCGCTCA 
AGCGGAAATC 
GTGCGGCTGC 
TCCCAGCTGT 
ACGAGCCCCC 
CCACCCCCAC 
AGCTGGACGC 
AGACCCTGGG 
GGCTGCTGGG 
AGGTCCCCAG 
CACCACCCGA 
TGGATGAGTG 
TGGTTGAATG 
TAGCCACCTC 
GTGTGCTGGT 
CATCTTGGTT 
GTCGTCCTGT 
CTGGGGAGGA 
CACCTCTGCT 
CTCGGCCTCT 
TAAAGAGGTT 
TTCGTGGGCT 
CTGAGTTTTG 
CAGAAGCTGT 
TAGGGCGCTG 
AGTTTCTGTG 
ACCACTCTGC 
AATAAGTTAC 
TAGCATTTTG 
GCGGGGGCTG 
CTGGATGCAT 
GATTCAATAG 
CATTGTACCT 
TGGTGGATAA 
GGTGCAGGCC 
TGCTCAACGC 



60 
120 
180 
240 
300 
360 
420 
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540 
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120 
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300 
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600 
660 
720 
780 
840 
900 
960 
1020 
1080 ' 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 



224 



WO 02/086443 

TCTGTGGTTT GTCAGACCTG CAAGCAAGCC CCCTGCTGGG GAAGCCTAGG TGTCCTTGAG 2280 

CTGAACCGCA CTGAAGAACT CTTGTCCTCA CTGGCTGATG CAGCAGAACT CTTGGGAAAT 2340 

GTCTTAGTCC TGCAGAATCA GGAGTCACCA GATGATGCAG AGTTGAGATC ATCATTGCAA 2400 

AGTTCTCTGT TCCTGAGGAA CTAAATTTAA GGAAAAAATG GGATTTTGTT TTAGAGTTGG 2460 
AAAAAAAGCC TGATTAAAGA GTTTCTGCCT GTTAAAAAAA AAAAAAAAAA AAAAAA 

Seq ID NO t 97 Protein sequence i 
Protein Accession #: NP_542399.1 

1 11 21 31 41 51 

I I I I I I 

MSGHRTRSGG AAQRSGPRAP SPTKPLRRSQ RKSGSELPSI LPEIWPKTPS AAAVRKPIVL 60 

KRIVAHAVEV PAVQSPRRSP RISFFLEKEN EPPGRELTKE DLFKTHSVPA TPTSTPVPKP 120 

EAE8SSKEGE LDARDLEMSK KVRRSYSRLE TLGSASTSTP GRRSCFGFEG LLGAEDLSGV 180 

SPWCSKLTE VPRVCAKPWA PDMTLPGISP PPEKQKRKKK KMPEILKTEL DEWAAAMHAE 240 
FEAAEQFDLL VE 

Seq id no t 98 DNA sequence 

Nucleic Acid Accession #> Eos sequence 

Coding sequence: 58-12444 

1 11 21 31 41 51 

I I I I 1 I 

GGGGCATTTC CGGGTCCGGG CCGAGCGGGC GCACGCGCGG GAGCGGGACT CGGCGGCATG 60 

GCGGGCTCCG GAGCCGGTGT GCGTTGCTCC CTGCTGCGGC TGCAGGAGAC CTTGTCCGCT 120 

GCGGACCGCT GCGGTGCTGC CCTGGCCGGT CAT CAACTG A TCCGCGGCCT GGGGCAGGAA 180 

TGCGTCCTGA GCAGCAGCCC CGCGGTGCTG GCATTACAGA CATCTTTAGT TTTTTCCAGA 240 

GATTTCGGTT TGCTTGTATT TGTCCGGAAG TCACTCAACA GTATTGAATT TCGTGAATGT 300 

AGAGAAGAAA TCCTAAAGTT TTTATGTATT TTCTTAGAAA AAATGGGCCA GAAGATCGCA 360 

CCTTACTCTG TTGAAATTAA GAACACTTGT ACCAGTGTTT ATACAAAAGA TAGAGCTGCT 420 

AAATGTAAAA TTCCAGCCCT GGACCTTCTT ATTAAGTTAC TTCAGACTTT TAGAAGTTCT 4 80 

AGACTCATGG ATGAATTTAA AATTGGAGAA TTATTTAGTA AATTCTATGG AGAACTTGCA 540 

TTGAAAAAAA AAATACCAGA TACAGTTTTA GAAAAAGTAT ATGAGCTCCT AGGATTATTG 600 

GGTGAAGTTC ATCCTAGTGA GATGATAAAT AATGCAGAAA ACCTGTTCCG CGCTTTTCTG 660 

GGTGAACTTA AGACCCAGAT GACATCAGCA GTAAGAGAGC CCAAACTACC TGTTCTGGCA 720 

GGATGTCTGA AGGGGTTGTC CT CACTTCTG TGCAACTTCA CTAAGTCCAT GGAAGAAGAT 780 

CCCCAGACTT CAAGGGAGAT TTTTAATTTT GTACTAAAGG CAATTCGTCC TCAGATTGAT 840 

CTGAAQAGAT ATGCTGTGCC CTCAGCTGGC TTGCGCCTAT TTGCCCTGCA TGCATCTCAG 900 

TTTAGCACCT GCCTTCTGGA CAACTACGTG TCTCTATTTG AAGTCTTGTT AAAGTGGTGT 960 

GCCCACACAA ATGTAGAATT GAAAAAAGCT GCACTTTCAG CCCTGGAATC CTTTCTGAAA 1020 

CAGGTTTCTA ATATGGTGGC GAAAAATGCA GAAATGCATA AAAATAAACT GCAGTACTTT 1080 

ATGGAGCAGT TTTATGGAAT CATCAGAAAT GTG GATTCGA ACAACAAGGA GTTATCTATT 1140 

GCTATCCGTG GATATGGACT TTTTGCAGGA COGTGCAAGG TTATAAACGC AAAAGATGTT 1200 

GACTTCATGT ACGTTGAGCT CATTCAGCGC TGCAAGCAGA TGTTCCTCAC CCAGACAGAC 1260 

ACTGGTGACG ACCGTGTTTA TCAGATGCCA AGCTTCCTCC AGTCTGTTGC AAGCGTCTTG 1320 

CTGTACCTTG ACACAGTTCC TGAGGTGTAT ACTCCAGTTC TGGAGCACCT CGTGGTGATG 1380 

CAGATAGACA GTTTCCCACA GTACAGTCCA AAAATGCAGC TGGTGTGTTG CAGAGCCATA 1440 

GTGAAGGTGT TCCTAGCTTT GGCAGCAAAA GGGCCAGTTC TCAGGAATTG CATTAGTACT 1500 

GTGGTGCATC AGGGTTTAAT CAGAATATGT TCTAAACCAG TGGTCCTTCC AAAGGGCCCT 1560 

GAGTCTGAAT CTGAAGACCA CCGTGCTTCA GGGGAAGTCA GAACTGGCAA ATGGAAGGTG 1620 

CCCACATACA AAGACTACGT GGATCTCTTC AGACATCTCC TGAGCTCTGA CCAGATGATG 1680 

GATTCTATTT TAGCAGATGA AGCATTTTTC TCTGTGAATT CCTCCAGTGA AAGTCTGAAT 1740 

CATTTACTTT ATGATGAATT TGTAAAATCC GTTTTGAAGA TTGTTGAGAA ATTGGATCTT 1800 

ACACTTGAAA TACAGACTGT TGGGGAACAA GAGAATGGAG ATGAGGCGCC TGGTGTTTGG 1860 

ATGATCCCAA CTTCAGATCC AGCGGCTAAC TTGCATCCAG CTAAACCTAA AGATTTTTCG 1920 

GCTTTCATTA ACCTGGTGGA ATTTTGCAGA GAGATTCTCC CTGAGAAACA AG CAGAATTT 1980 

TTTGAACCAT GGGTGTACTC ATTTTCATAT GAATTAATTT TGCAATCTAC AAGGTTGCCC 2040 

CTCATCAGTG GTTTCTACAA ATTGCTTTCT ATTACAGTAA GAAATGCCAA GAAAATAAAA 2100 

TATTTCGAGG GAGTTAGTCC AAAGAGTCTG AAACACTCTC CTGAAGACCC AGAAAAGTAT 2160 

TCTTGCTTTG CTTTATTTGT GAAATTTGGC AAAGAGGTGG CAGTTAAAAT GAAGCAGTAC 2220 

AAAGATGAAC TTTTGGCCTC TTGTTTGACC TTTCTTCTGT CCTTGCCACA CAACATCATT 2280 

GAACTCGATG TTAGAGCCTA CGTTCCTGCA CTGCAGATGG CTTTCAAACT GGGCCTGAGC 2340 

TATACCCCCT TGGCAGAAGT AGGCCTGAAT GCTCTAGAAG AATGGTCAAT TTATATTGAC 2400 

AGACATGTAA TGCAGCCTTA TTACAAAGAC ATTCTCCCCT GCCTGGATGG ATACCTGAAG 2460 

ACTTCAGCCT TGTCAGATGA GACCAAGAAT AACTGGGAAG TGTCAGCTCT TTCTCGGGCT 2520 

GCCCAGAAAG GATTTAATAA AGTGGTGTTA AAGCATCTGA AGAAGACAAA GAACCTTTCA 2580 

TCAAACGAAG CAATATCCTT AGAAGAAATA AGAATTAGAG TAGTACAAAT GCTTGGATCT 2640 

CTAGGAGGAC AAATAAACAA AAATCTTCTG ACAGTCACGT CCTCAGATGA GATGATGAAG 2700 

AGCTATGTGG CCTGGGACAG AGAGAAGCGG CTGAGCTTTG CAGTGCCCTT TAGAGAGATG 2760 

AAACCTGTCA TTTTCCTGGA TGTGTTCCTG CCTCGAGTCA CAGAATTAGC GCTCACAGCC 2820 

AGTGACAGAC AAACTAAAGT TGCAGCCTGT GAACTTTTAC ATAGCATGGT TATGTTTATG 2880 

TTGGGCAAAG CCACGCAGAT GCCAGAAGGG GGACAGGGAG CCCCACCCAT GTACCAGCTC 2940 

TATAAGCGGA CGTTTCCTGT GCTGCTTCGA CTTGCGTGTG ATGTTGATCA GGTGACAAGG 3 000 

CAACTGTATG AGCCACTAGT TATGCAGCTG ATTCACTGGT TCACTAACAA CAAGAAATTT 3 050 

GAAAGTCAGG AT ACTGTTG C CTTACTAGAA GCTATATTGG ATGGAATTGT GGACCCTGTT 3120 

GACAGTACTT TAAGAGATTT TTGTGGTCGG TGTATTCGAG AATTCCTTAA ATGGTCCATT 3180 

AAGCAAATAA CACCACAGCA GCAGGAGAAG AGTCCAGTAA ACACCAAATC GCTTTT CAAG 3240 

CGACTTTATA GCCTTGCGCT TCACCCCAAT GCTTTCAAGA GGCTGGGAGC ATCACTTGCC 3300 

TTTAATAATA TCTACAGGGA ATTCAGGGAA GAAGAGTCTC TGGTGGAACA GTTTGTGTTT 3360 

GAAGCCTTGG TGATATACAT GGAGAGTCTG GCCTTAGCAC ATGCAGATGA GAAGTCCTTA 3420 

GGTACAATTC AACAGTGTTG TGATGCCATT GATCACCTAT GCCGCATCAT TGAAAAGAAG 3480 

CATGTTT CTT TAAATAAAGC AAAQAAACGA CGTTTGCCGC GAGGATTTCC ACCTTCCGCA 3540 

TCATTGTGTT TATTGGATCT GGTCAAGTGG CTTTTAGCTC ATTGTGGGAG GCCCCAGACA 3 600 

GAATGTCGAC ACAAATCCAT TGAACTCTTT TATAAATTCG TTCCTTTATT GCCAGGCAAC 3660 

AGATCCCCTA ATTTGTGGCT GAAAGATGTT CTCAAGGAAG AAGGTGTCTC TTTTCTCATC 3720 

AACACCTTTG AGGGGGGTGG CTGTGGCCAG CCCTCGGGCA TCCTGGCCCA GCCCACCCTC 3780 

TTGTACCTTC GGGGGCCATT CAGCCTGCAG G CCACGCT AT GCTGGCTGGA CCTGCTCCTG 3840 
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GCCGCGTTGG AGTGCTACAA CACGTTCATT GGCGAGAGAA CTGTAGGAGC GCTCCAGGTC 3900 

CTAGGTACTG AAGCCCAGTC TTCACTTTTG AAAGCAGTGG CTTTCTTCTT AGAAAGCATT 3 960 

GCCATGCATG ACATTATAGC AGCAGAAAAG TGCTTTGGCA CTGGGGCAGC AGGTAACAGA 4020 

ACAAGCCCAC AAGAGGGAGA AAGGTACAAC TACAGCAAAT GCACCGTTGT GGTCGGGATT 4080 

ATGGAGTTTA CCACGACTCT GCTAAACACC TCCCCGGAAG GATGGAAGCT CCTGAAGAAG 4140 

GACTTGTGTA ATACACACCT GATGAGAGTC CTGGTGCAGA CGCTGTGTGA GCCCGCAAGC 4200 

ATAGGTTTCA ACATCGGAGA CGTCCAGGTT ATGGCTCATC TTCCTGATGT TTGTGTGAAT 4260 

CTGATGAAAG CTCTAAAGAT GTCCCCATAC AAAGATATCC TAGAGACCCA TCTGAGAGAG 432 0 

AAAATAACAG CACAGAGCAT TGAGGAGCTT TGTGCCGTCA ACTTGTATGG CCCTGACGCG 43 80 

CAAGTGGACA GGAGCAGGCT GQCTGCTQTT GTGTCTGCCT GTAAACAGCT TCACAGAGCT 4440 

GGGCTTCTGC ATAATATATT ACCGTCTCAG TCCACAGATT TGCATCATTC TGTTGGCACA 4500 

GAACTTCTTT CCCTGGTTTA TAAAGGCATT GCCCCTGGAG ATGAGAGACA GTGTCTGCCT 4S60 

TCTCTAGACC TCAGTTGTAA GCAGCTGGCC AGCGGACTTC TGGAGTTAGC CTTTGCTTTT 4620 

GGAGGACTGT GTGAGCGCCT TGTGAGTCTT CTCCTGAACC CAGCGGTGCT GTCCACGGCG 4680 

TCCTTGGGCA GCTCACAGGG CAGCGTCATC CACTTCTCCC ATGGGGAGTA TTTCTATAGC 4740 

TTGTTCTCAG AAACGATCAA CACGGAATTA TTGAAAAATC TGGATCTTGC TGTATTGGAG 4800 

CTCATGCAGT CTTCAGTGGA TAATACCAAA ATGGTGAGTG CCGTTTTGAA CGGCATGTTA 4860 

GACCAGAGCT TCAGGGAGCG AG CAAACCAG AAACACCAAG GACTGAAACT TGCGACTACA 4920 

ATTCTGCAAC ACTGGAAGAA GTGTGATTCA TGGTGGGCCA AAGATTCCCC TCTCGAAACT 4980 

AAAATGGCAG TGCTGGCCTT ACTGGCAAAA ATTTTACAGA TTGATTCATC TGTATCTTTT 5040 

AATACAAGTC ATGGTTCATT CCCTGAAGTC TTTACAACAT ATATTAGTCT ACTTGCTGAC 5100 

ACAAAGCTGG ATCTACATTT AAAGGGCCAA GCTGTCACTC TTCTTCCATT CTTCACCAGC 5160 

CTCACTGGAG GCAGTCTGGA GGAACTTAGA CGTGTTCTGG AGCAGCTCAT CGTTGCTCAC 5220 

TTCCCCATGC AGTCCAGGGA ATTTCCTCCA GGAACTCCGC GGTTCAATAA TTATGTGGAC 5280 

TGCATGAAAA AGTTTCTAGA TGCATTGGAA TTATCTCAAA GCCCTATGTT GTTGGAATTG 5340 

ATGACAGAAG TTCTTTGTCG GGAACAGCAG CATGTCATGG AAGAATTATT TCAATCCAGT 5400 

TTCAGGAGGA TTG CCAGAAG GGGTTCATGT GTCACACAAG TAGGCCTTCT GGAAAGCGTG 5460 

TATGAAATGT TCAGGAAGGA TGACCCCCGC CTAAGTTTCA CACGCCAGTC CTTTGTGGAC 5520 

CGCTCCCTCC TCACTCTGCT GTGGCACTGT AGCCTGGATG CTTTGAGAGA ATTCTTCAGC 5580 

ACAATTGTGG TGGATGCCAT TGATGTGTTG AAGTCCAGGT TTACAAAGCT AAATGAATCT 5640 

ACCTTTGATA CTCAAATCAC CAAGAAGATG GGCTACTATA AGATTCTAGA CGTGATGTAT 5700 

TCTCGCCTTC CCAAAGATGA TGTTCATGCT AAGGAATCAA AAATTAATCA AGTTTTCCAT 5760 

GGCTCGTGTA TTACAGAAGG AAATGAACTT ACAAAGACAT TGATTAAATT GTGCTACGAT 5820 

GCATTTACAG AGAACATGGC AGGAGAGAAT CAGCTGCTGG AGAGGAGAAG ACTTTACCAT 5880 

TGTGCAGCAT ACAACTGCGC CATATCTGTC ATCTGCTGTG TCTTCAATGA GTTAAAATTT 5940 

TACCAAGGTT TTCTGTTTAG TGAAAAACCA GAAAAGAACT TGCTTATTTT TGAAAATCTG 6000 

ATCGACCTGA AGCGCCGCTA TAATTTTCCT GTAGAAGTTG AGGTTCCTAT GGAAAGAAAG 6060 

AAAAAGTACA TTGAAATTAG GAAAGAAGCC AGAGAAGCAG CAAATGGGGA TTCAGATGGT 6120 

CCTTCCTATA TGTCTTCCCT GTCATATTTG GCAGACAGTA CCCTGAGTGA GGAAATGAGT 6180 

CAATTTGATT TCTCAACCGG AGTTCAGAGC TATTCATACA GCTCCCAAGA CCCTAGACCT 6240 

GCCACTGGTC GTTTTCGGAG ACGGGAGCAG CGGGACCCCA CGGTGCATGA TGATGTGCTG 6300 

GAGCTGGAGA TGGACGAGCT CAATCGGCAT GAGTGCATGG CGCCCCTGAC GGCCCTGGTC 6360 

AAGCACATGC ACAGAAGCCT GGGCCCGCCT CAAGGAGAAG AGGATTCAGT GCCAAGAGAT 6420 

CTTCCTTCTT GGATGAAATT CCTCCATGGC AAACTGGGAA ATCCAATAGT ACCATTAAAT 6480 

ATCCGTCTCT TCTTAGCCAA GCTTGTTATT AATACAGAAG AGGTCTTTCG CCCTTACGCG 6540 

AAGCACTGGC TTAGCCCCTT GCTGCAGCTG GCTGCTTCTG AAAACAATGG AGGAGAAGGA 660 0 

ATTCACTACA TGGTGGTTGA GATAGTGGCC ACTATTCTTT CATGGACAGG CTTGGCCACT 6660 

CCAACAGGGG TCCCTAAAGA TGAAGTGTTA GCAAATCGAT TGCTTAATTT CCTAATGAAA 6720 

CATGTCTTTC ATCCAAAAAG AGCTGTGTTT AGACACAACC TTGAAATTAT AAAGACCCTT 6780 

GTCGAGTGCT GGAAGGATTG TTTATCCATC CCTTATAGGT TAATATTTGA AAAGTTTTCC 6840 

GGTAAAGATC CTAATTCTAA AGACAACTCA GTAGGGATTC AATTGCTAGG CATCGTGATG 6900 

GCCAATGACC TGCCTCCCTA TGACCCACAG TGTGGCATCC AGAGTAGCGA ATACTTCCAG 6960 

GCTTTGGTGA ATAATATGTC C T TTGTAAGA TATAAAGAAG TGTATGCCGC TGCAGCAGAA 7020 

GTTCTAGGAC TTATACTTCG ATATGTTATG GAGAGAAAAA ACATACTGGA GGAGTCTCTG 7080 

TGTGAACTGG TTGCGAAACA ATTGAAGCAA CATCAGAATA CTATGGAGGA CAAGTTTATT 7140 

GTGTGCTTGA ACAAAGTGAC CAAGAGCTTC CCTCCTCTTG CAGACAGGTT CATGAATGCT 7200 

GTGTTCTTTC TGCTGCCAAA ATTTCATGGA GTGTTGAAAA CACTCTGTCT GGAGGTGGTA 7260 

CTTTGTCGTG TGGAGGGAAT GACAGAGCTG TACTTCCAGT TAAAGAGCAA GGACTTCGTT 7320 

CAAGTCATGA GACATAGAGA TGATGAAAGA CAAAAAGTAT GTTTGGACAT AATTTATAAG 7380 

ATGATGCCAA AGTTAAAACC AGTAGAACTC CGAGAACTTC TGAACCCCGT TGTGGAATTC 7440 

GTTTCCCATC CTTCTACAAC ATGTAGGGAA CAAATGTATA ATATTCTCAT GTGGATTCAT 7500 

GATAATTACA GAGATCCAGA AAGTGAGACA GATAATGACT CCCAGGAAAT ATTTAAGTTG 7560 

GCAAAAGATG TGCTGATTCA AGGATTGATC GATGAGAACC CTGGACTTCA ATTAATTATT 7620 

CGAAATTTCT GGAGCCATGA AACTAGGTTA CCTTCAAATA CCTTGGACCG GTTGCTGGCA 7680 

CTAAATTCCT TATATTCTCC TAAGATAGAA GTGCACTTTT TAAGTTTAGC AACAAATTTT 7740 

CTGCTCGAAA TGACCAGCAT GAGCCCAGAT TATCCAAACC CCATGTTCGA GCATCCTCTG 7800 

TCAGAATGCG AATTTCAGGA ATATACCATT GATTCTGATT GGCGTTTCCG AAGTACTGTT 7860 

CTCACTCCGA TGTTTGTGGA GACCCAGGCC TCCCAGGGCA CTCTCCAGAC CCGTACCCAG 7920 

GAAGGGTCCC TCTCAGCTCG CTGGCCAGTG GCAGGGCAGA TAAGGGCCAC CCAGCAGCAG 7980 

CATGACTTCA CACTGACACA GACTGCAGAT GGAAGAAGCT CATTTGATTG GCTGACCGGG 8040 

AGCAGCACTG ACCCGCTGGT CGACCACACC AGTCCCTCAT CTGACTCCTT GCTGTTTGCC 8100 

CACAAGAGGA GTGAAAGGTT ACAGAGAGCA CCCTTGAAGT CAGTGGGGCC TGATTTTGGG 8160 

AAAAAAAGGC TGGGCCTTCC AGGGGACGAG GTGGATAACA AAGTGAAAGG TGCGGCCGGC 8220 

CGGACGGACC TACTACGACT GCGCAGACGG TTTATGAGGG ACCAGGAGAA GCTCAGTTTG 8280 

ATGTATGCCA GAAAAGGCGT TGCTGAGCAA AAACGAGAGA AGGAAATCAA GAGTGAGTTA 8340 

AAAATGAAGC AGGATGCCCA GGTCGTTCTG TACAGAAGCT ACCGGCACGG AGACCTTCCT 8400 

GACATTCAGA TCAAGCACAG CAGCCTCATC ACCCCGTTAC AGGCCGTGGC CCAGAGGGAC 8460 

CCAATAATTG CAAAACAGCT CTTTAGCAGC TTGTTTTCTG GAATTTTGAA AGAGATGGAT 8520 

AAATTTAAGA CACTGTCTGA AAAAAACAAC ATCACTCAAA AGTTGCTTCA AGACTTCAAT 8580 

CGTTTTCTTA ATACCACCTT CTCTTTCTTT CCACCCTTTG TCTCTTGTAT TCAGGACATT 8640 

AGCTGTCAGC ACGCAGCCCT GCTGAGCCTC GACCCAGCGG CTGTTAGCGC TGGTTGCCTG 8700 

GCCAGCCTAC AGCAGCCCGT GGGCATCCGC CTGCTAGAGG AGGCTCTGCT CCGCCTGCTG 8760 

CCTGCTGAGC TGCCTGCCAA GCGAGTCCGT GGGAAGGCCC GCCTCCCTCC TGATGTCCTC 8820 

AGATGGGTGG AGCTTGCTAA GCTGTATAGA TCAATTGGAG AATACGACGT CCTCCGTGGG 8880 

ATTTTTACCA GTGAGATAGG AACAAAGCAA AT CACTCAGA GTGCATTATT AGCAGAAGCC 8940 

AGAAGTGATT ATTCTGAAGC TGCTAAGCAG TATGATGAGG CTCTCAATAA ACAAGACTGG 9000 

GTAGATGGTG AGCCCACAGA AGCCGAGAAG GATTTTTGGG AACTTGCATC CCTTGACTGT 9060 
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TACAACCACC TTGCTGAGTG GAAATCACTT GAATACTGTT CTACAGCCAG TATAGACAGT 9120 
GAGAACCCCC CAGACCTAAA TAAAATCTGG AGTGAACCAT TTTATCAGGA AACATATCTA 9180 
CCTTACATGA TCCGCAGCAA GCTGAAGCTG CTGCTCCAGG GAGAGGCTGA CCAGTCCCTG 9240 
CTGACATTTA TTGACAAAGC TATGCACGGG GAGCTCCAGA AGGCGATTCT AGAGCTTCAT 9300 
TACAGTCAAG AGCTGAGTCT GCTTTACCTC CTGCAAGATG ATGTTGACAG AGCCAAATAT 9360 
TACATTCAAA ATGGCATTCA GAGTTTTATG CAGAATTATT CTAGTATTGA TGTCCTCTTA 9420 
CACCAAAGTA GACTCACCAA ATTGCAGTCT GTACAGGCTT TAACAGAAAT TCAGGAGTTC 9480 
ATCAGCTTTA TAAGCAAACA AGGCAATTTA TCATCTCAAG TTCCCCTTAA GAGACTTCTG 9540 
AACACCTGGA CAAACAGATA TCCAGATGCT AAAATGGACC CAATGAACAT CTGGGATGAC 9600 
ATCATCACAA ATCGATGTTT CTTTCTCAGC AAAATAGAGG AGAAGCTTAC CCCTCTTCCA 9660 
GAAGATAATA GTATGAATGT GGATCAAGAT GGAGACCCCA GTGACAGGAT GGAAGTGCAA 9720 
GAGCAGGAAG AAGATATCAG CTCCCTGATC AGGAGTTGCA AGTTTTCCAT GAAAATGAAG 9780 
ATGATAGACA GTGCCCGGAA GCAGAACAAT TTCTCACTTG CTATGAAACT ACTGAAGGAG 9840 
CTGCATAAAG AGTCAAAAAC CAGAGACGAT TGGCTGGTGA GCTGGGTGCA GAGCTACTGC 9900 
CGCCTGAGCC ACTGCCGGAG CCGGTCCCAG GGCTGCTCTG AGCAGGTGCT CACTGTGCTG 9960 
AAAACAGTCT CTTTGTTGGA TGAGAACAAC GTGTCAAGCT ACTTAAGCAA AAATATTCTG 10020 
GCTTTCCGTG ACCAGAACAT TCTCTTGGGT ACAACTTACA GGATCATAGC GAATGCTCTC 10080 
AGCAGTGAGC CAGCCTGCCT TGCTGAAATC GAGGAGGACA AGGCTAGAAG AATCTTAGAG 10140 
CTTTCTGGAT CCAGTTCAGA GGATTCAGAG AAGGTGATCG CGGGTCTGTA CCAGAGAGCA 10200 
TTCCAGCACC TCTCTGAGGC TGTGCAGGCG GCTGAGGAGG AGGCCCAGCC TCCCTCCTGG 10260 
AGCTGTGGGC CTGCAGCTGG GGTGATTGAT GCTTACATGA CGCTGGCAGA TTTCTGTGAC 10320 
CAACAGCTGC GCAAGGAGGA AGAGAATGCA TCAGTTATTG ATTCTGCAGA ACTGCAGGCG 10380 
TATCCAGCAC TTGTGGTGGA GAAAATGTTG AAAGCTTTAA AATTAAATTC CAATGAAGCC 10440 
AGATTGAAGT TTCCTAGATT ACTTCAGATT ATAGAACGGT ATCCAGAGGA GACTTTGAGC 10500 
CTCATGACAA AAGAGATCTC TTCCGTTCCC TGCTGGCAGT TCATCAGCTG GATCAGCCAC 10560 
ATGGTGGCCT TACTGGACAA AGACCAAGCC GTTGCTGTTC AGCACTCTGT GGAAGAAATC 10620 
ACTGATAACT ACCCGCAGGC TATTGTTTAT CCCTTCATCA TAAGCAGCGA AAGCTATTOC 10680 
TTCAAGGATA CTTCTACTGG TCATAAGAAT AAGGAGTTTG TGGCAAGGAT TAAAAGTAAG 10740 
TTGGATCAAG GAGGAGTGAT TCAAGATTTT ATTAATGCCT TAGATCAGCT CTCTAATCCT 10800 
GAACTGCTCT TTAAGGATTG GAGCAATGAT GTAAGAGCTG AACTAGCAAA AACCCCTGTA 10860 
AATAAAAAAA ACATTGAAAA AATGTATGAA AGAATGTATG CAGCCTTGGG TGACCCAAAG 10920 
GCTCCAGGCC TGGGGGCCTT TAGAAGGAAG TTTATTCAGA CTTTTGGAAA AGAATTTGAT 10980 
AAACATTTTG GGAAAGGAGG TTCTAAACTA CTGAGAATGA AGCTCAGTGA CTTCAACGAC 11040 
ATTACCAACA TGCTACTTTT AAAAATGAAC AAAGACTCAA AGCCCCCTGG GAATCTGAAA 11100 
GAATGTTCAC CCTGGATGAG CGACTTCAAA GTGGAGTTCC TGAGAAATGA GCTGGAGATT 11160 
CCCGGTCAGT ATGACGGTAG GGGAAAGCCA TTGCCAGAGT ACCACGTGCG AATCGCCGGG 11220 
TTTGATGAGC GGGTGACAGT CATGGCGTCT CTGCGAAGGC CCAAGCGCAT CATCATCCGT 11280 
GGCCATGACG AGAGGGAACA CCCTTTCCTG GTGAAGGGTG GCGAGGACCT GCGGCAGGAC 11340 
CAGCGCGTGG AGCAGCTCTT CCAGGTCATG AATGGGATCC TGGCCCAAGA CTCCGCCTGC 11400 
AGCCAGAGGG CCCTGCAGCT GAGGACCTAT AGCGTTGTGC CCATGACCTC CAGGTTAGGA 11460 
TTAATTGAGT GGCTTGAAAA TACTGTTACC TTGAAGGACC TTCTTTTGAA CACCATGTCC 11520 
CAAGAGGAGA AGGCGGCTTA CCTGAGTGAT CCCAGGGCAC CGCCGTGTGA ATATAAAGAT 11580 
TGGCTGACAA AAATGTCAGG AAAACATGAT GTTGGAGCTT ACATGCTAAT GTATAAGGGC 11640 
GCTAATCGTA CTGAAACAGT CACGTCTTTT AGAAAACGAG AAAGTAAAGT GCCTGCTGAT 11700 
CTCTTAAAGC GGGCCTTCGT GAGGATGAGT ACAAGCCCTG AGGCTTTCCT GGCGCTCCGC 11760 
TCCCACTTCG CCAGCTCTCA CGCTCTGATA TGCATCAGCC ACTGGATCCT CGGGATTGGA 11820 
GACAGACATC TGAACAACTT TATGGTGGCC ATGGAGACTG GCGGCGTGAT CGGGATCGAC 11880 
TTTGGGCATG CGTTTGGATC CGCTACACAG TTTCTGCCAG TCCCTGAGTT GATG CCTTTT 11940 
CGGCTAACTC GCCAGTTTAT CAATCTGATG TTACCAATGA AAGAAACGGG CCTTATGTAC 12000 
AGCATCATGG TACACGCACT CCGGGCCTTC CGCTCAGACC CTGGCCTGCT CACCAACACC 12060 
ATGGATGTGT TTGTCAAGGA GCCCTCCTTT GATTGGAAAA ATTTTGAACA GAAAATGCTG 12120 
AAAAAAGGAG GGTCATGGAT TCAAGAAATA AATGTTGCTG AAAAAAATTG GTACCCCCGA 12180 
CAGAAAATAT GTTACGCTAA GAGAAAGTTA GCAGGTGCCA ATCCAGCAGT CATTACTTGT 12240 
GATGAGCTAC TCCTGGGTCA TGAGAAGGCC CCTGCCTTCA GAGACTATGT GGCTGTGGCA 12300 
CGAGGAAGCA AAGATCACAA CATTCGTGCC CAAGAACCAG AGAGTGGGCT TT CAGAAG AG 12360 
ACTCAAGTGA AGTGCCTGAT GGACCAGGCA ACAGACCCCA ACATCCTTGG CAGAACCTGG 12420 
GAAGGATGGG AGCCCTGGAT GTGAGGTCTG TGGGAGTCTG CAGATAGAAA GCATTACATT 12480 
GTTTAAAGAA TCTACTATAC TTTGGTTGGC AGCATTCCAT GAGCTGATTT TCCTGAAACA 12540 
CTAAAGAGAA ATGTCTTTTG TGCTACAGTT TCGTAGCATG AGTTTAAATC AAGATTATGA 12600 
TGAGTAAATG TGTATGGGTT AAATCAAAGA TAAGGTTATA GTAACATCAA AG ATT AGGTG 12660 
AGGTTTATAG AAAGATAGAT ATCCAGGCTT ACCAAAGTAT TAAGTCAAGA ATATAATATG 12720 
TGATCAGCTT TCAAAGCATT TACAAGTGCT GCAAGTTAGT GAAACAGCTG TCTCCGTAAA 12780 
TGGAGGAAAT GTGGGGAAGC CTTGGAATGC CCTTCTGGTT CTGGCACATT GGAAAGCACA 12840 
CTCAGAAGGC TTCATCACCA AGATTTTGGG AGAGTAAAGC TAAGTATAGT TGATGTAACA 12900 
TTGTAGAAGC AGCATAGGAA CAATAAGAAC AATAGGTAAA GCTATAATTA TGGCTTATAT 12960 
TTAGAAATGA CTGCAT TTGA TATTTTAGGA TATTTTTCTA GGTTTTTTCC TTTCATTTTA 13020 
TTCTCTTCTA GTT TTGA CAT TTTATGATAG ATTTGCTCTC TAGAAGGAAA CGTCTTTATT 13080 
TAGGAGGGCA AAAATTTTGG TCATAGCATT CACTTTTGCT ATTCCAATCT ACAACTGGAA 13140 
GATACATAAA AGTGCTTTGC ATTGAATTTG GGATAACTTC AAAAATCCCA TGGTTGTTGT 13200 
TAGGGATAGT ACTAAG CATT TCAGTTCCAG GAGAATAAAA GAAATTCCTA TTTGAAATGA 13260 
ATTCCTCATT TGGAGGAAAA AAAGCATGCA TTCTAGCACA ACAAGATGAA ATTATGGAAT 13320 
ACAAAAGTGG CTCCTTCCCA TGTGCAGTCC CTGTCCCCCC CCGCCAGTCC TCCACACCCA 13380 
AACTGTTTCT GATTGGCTTT TAGCTTTTTG TTGTTTTTTT TTTTCCTTCT AACACTTGTA 13440 
TTTGGAGGCT CTTCTGTGAT TTTGAGAAGT ATACTCTTGA GTGTTTAATA AAGTTTTTTT 13500 
CCAAAAGTA 

Seq ID NO: 99 Protein sequence: 
Protein Accession #: NP_008835.5 

1 11 21 31 41 51 

1 I I I I I 

MAGSGAGVRC SLLRLQETLS AADRCGAALA GHQLIRGLGQ ECVLSSSPAV LALQTSIiVFS 60 
RDFGLLVFVR KSLNS I E PRE CREE1LKFLC IPLEKMGQKI APYSVEIKNT CTSWTKDRA 120 
AKCKIPALOL LIKLLQTFRS SRLMDEFKIG ELFSKFYGEL ALKKKIPDTV LEKVYELLGL 180 
LGEVHPSEMI NNAENLPRAF LGELKTQMTS AVREPKLPVL AGCLKGLSSL LCNFTKSMEE 240 
DPQTSREIFN FVLKAIRPQI DLKRYAVPSA GLRLFALHAS QFSTCLLDNY VSLFEVLLKW 300 
CAHTNVELKK AALSALESFL, KQVSNMVAKN AEMHKNKLQY FMEQFYGIIR NVDSNNKELS 360 
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IAIRGYGLFA GPCKVINAKD VDPMYVELIQ RCKQMPLTQT DTGDDRVYQM PSFLQSVASV 420 

LLYLDTVPEV YTPVLEHLW MQIDSPPQYS PKMQLVCCRA IVKVFLALAA KGPVLRNCIS 480 

TWHQGLIRI CSKPWLPKG PESESEDHRA SGEVRTGKWK VPTYKDYVDL FRHLLSSDQM 540 

MDSILADEAF FSVNSSSESL NHLLYDEFVK SVLKIVEKLD LTLEIQTVGE QENGDEAPGV 600 

WMIPTSDPAA NLHPAKPKDF SAPINLVEPC REILPEKQAE FFEPWVYSFS YELILQSTRL 660 

PLISGFYKLL SITVRNAKKI KYPEGVSPKS LKHSPEDPEK YSCPALFVKF GKEVAVKMKQ 720 

YKDEIiLASCL TFLLSLFHNI IELDVRAYVP ALQMAPKLGL SYTPLAEVGL NALEEWSIYI 780 

DRHVMQPYYK DILPCLDGYL KTSALSDETK NNWEVSALSR AAQKGFNKW LKHLKKTKNL 840 

SSNEAISLEE IRIRWQMLG SLGGQINKNL LTVTSSDEMM KSYVAWDREK RLSFAVPFRE 900 

MKPVIFLDVP LPRVTELALT ASDRQTKVAA CELLHSMVMP MLGKATQMPE GGQGAPPMYQ 960 

LYXRTFPVLL RLACDVDQVT RQLYEPLVMQ LIHWFTNNKK FESQDTVALL EAILDGIVDP 1020 

VDSTLRDFCG RCIREPLKWS IKQITPQQQE KSPVNTKSLF KRLYSLALHP NAFKRLGASL 1080 

APNNIYREFR EEESLVEQFV FEALVIYMES LALAHADEKS LGTIQQCCDA IDHLCRIIEK 1140 

KHVSLNKAKK RRLPRGFPPS AS LCLLDLVK WLLAHCGRPQ TECRHKSIEL PYKPVPLLPG 1200 

NRSPNLWLKD VLKEEGVSFL INTFEGGGCG QPSGILAQPT LLYLRGPFSL QATLCWLDLL 1260 

LAALECYNTF IGERTVGALQ VLGTEAQSSL LKAV APPLES IAMHDIIAAE KCFGTGAAGN 1320 

RTSPQEGERY NYSKCTVWR IMEFTTTLLN TSPEGWKLLK KDLCNTHLMR VLVQTLCEPA 1380 

SIGFNIGEVQ VMAHLFDVCV NLMKALKMSP YKDILETHLR EKITAQSIEE LCAVNLYGPD 1440 

AQVDRSRLAA WSACKQLHR AGLLHNILPS QSTDLHHSVG TELLSLVYKG IAPGDERQCL 1500 

PSLDLSCKQL ASGIiLELAFA FGGLCERLVS LLLNP AVL ST ASLGSSQGSV IHFSHGEYFY 1560 

SLFSETINTE LLKNLDLAVL ELMQSSVDNT KMVSAVLNGM LDQSFRERAN QKHQGLKLAT 1620 

TILQHWKKCD SWWAKDSPLE TKMAVLALLA KILQIDSSVS FNTSHGSFPE VFTTYISLLA 1680 

DTKLDLHLKG QAVTLLPFFT SLTGGSLEEL RRVLEQLIVA HFPMQSREFP PGTPRFNNYV 1740 

DCMKKFLDAL ELSQSPMLLE LMTEVLCREQ QHVMEELFQS SFRRIARRGS CVTQVGLLES 1800 

VYEMFRKDDP RLSFTRQSFV DRSLLTLLWH CSLDALREFF STIWDAIDV LKSRFTKLNE 1860 

STFDTQ1TKK MGYYKILDVM YSRLPKDDVH AKESKINQVP HGSCITEGNE LTKTLIKLCY 1920 

DAFTENMAGE NQLLERRRLY HCAAYNCAIS VICCVFNELK PYQGFLFSEK PEKNLLIFEN 1980 

LIDLKRRYNF PVEVEVPMER KKKYIEIRKE AREAANGDSD GPSYMSSLSY LAD STL SEEM 2040 

SQFDFSTGVQ SYSYSSQDPR PATGRPRRRE QRDPTVHDDV LELEMDELNR HECMAPLTAL 2100 

VKHMHRSLGP PQGEEDSVPR DLPSWMKFLH GKLGNPIVPL NIRLPLAKLV INTEEVFRPY 2160 

AKHWLSPLLQ LAASENNGGE GIHYMWEIV ATI LSWTGLA TPTGVPKDEV LANRLLNFLM 2220 

KHVFHPKRAV FRHNLEIIKT LVECWKDCLS IPYRLIPEKF SGKDPNSKDN SVGIQLLGIV 2280 

MANDLPPYDP QCGIQSSEYF QALVNNMSPV RYKEVYAAAA EVLGL I LRYV MERKNILEES 2340 

LCELVAKQLK QHQNTMEDKF IVCLNKVTKS FPPLADRFMN AVFFLLPKFH GVLKTLCLEV 2400 

VLCRVEGMTE LYFQLKSKDF VQVMRHRDDE RQKVCLDIIY KMMPKLKPVE LRELLNPWE 2460 

FVSHPSTTCR EQMYNILMWI HDNYRDPESE TDNDSQEIFK LAKDVLIQGL IDENPGLQLI 2520 

rRNFWSHETR LPSMTLDRLL ALNSLYSPKI EVHFLSLATN FLLEMTSMSP DYPNPMFEHP 2580 

LSECEFQEYT IDSDWRFRST VLTPMFVETQ ASQGTLQTRT QEGSLSARWP VAGQIRATQQ 2640 

QHDFTLTQTA DGRSSFDWLT GSSTDPLVDH TSPSSDSLLF AHKRSERLQR APLKSVGPDF 2700 

GKKRLGLPGD EVDNKVKGAA GRTDLLRLRR RFMRDQEKLS LMYARKGVAE QKREKEIKSE 2760 

LKMKQDAQW LYRSYRHGDL PDIQIKHSSL ITPLQAVAQR DPIIAKQLFS SLFSGILKEM 2820 

DKFKTLSEKN NITQKLLQDF NRFLNTTFSF FPPFVSCIQD ISCQHAALL9 LDPAAVSAGC 2880 

LASLQQPVGI RLLEEALLRL LPAELPAKRV RGKARLPPDV LRWVELAKLY RSIGEYDVLR 2 940 

GIFTSEIGTK QITQSALLAE ARSDYSEAAK QYDEALNKQD WVDGEPTEAE KDFWELASLD 3000 

CYNHLAEWKS LEYCSTASID SENPPDLNKI WSEPFYQETY LPYMIRSKLK LLLQGEADQS 3060 

LLTFIDKAMH GELQKAILEL HYSQELSLLY LLQDDVDRAK YYIQNGIQSP MQNYSSIDVL 3120 

LHQSRLTKLQ SVQALTEIQE FISFISKQGN LSSQVPLKRL LNTWTNRYPD AKMDPMNIWD 3180 

DIITNRCFFL SKIEEKLTPL PEDNSMNVDQ DGDPSDRMEV QEQEEDISSL IRSCKFSMKM 3240 

KMIDSARKQN NF S LAMKLLK ELHKES KTRD DWLVSWVQSY CRLSHCRSRS QGCSEQVLTV 3300 

LKTVSLLDEN NVSSYLSKNI LAFRDQNILL GTTYRIIANA LSSEPACLAE IEEDKARRIL 3360 

ELSGSSSEDS EKVIAGLYQR AFQHLSEAVQ AAEEEAQPPS WSCGPAAGVI DAYMTLADFC 3420 

DQQLRKEEEN ASVIDSAELQ AYPALWEKM LKALKLNSNE ARLKFPRLLQ IIERYPEETL 3480 

SLMTKEISSV PCWQFISWIS HMVALLDKDQ AVAVQHSVEE ITDNYPQAIV YPFIISSESY 3540 

SFKDTSTGHK NKEFVARIKS KLDQGGVIQD FINALDQLSN PELLFKDWSN DVRAELAKTP 3600 

VNKKNIEKMY ERMYAALGDP KAPGLGAFRR KFIQTPGKEF DKHFGKGGSK LLRMKLSDFN 3660 

DITNMLLLKM NKDSKPPGNL KECSPWMSDF KVEPLRNELE IPGQYDGRGK PLPEYHVRIA 3720 

GFDERVTVMA SliRRPKRIII RGHDEREHPF LVKGGEDLRQ DQRVEQLFQV MNGILAQDSA 3780 

CSQRALQLRT YSWPMTSRL GLIEWLENTV TLKDLLLNTM SQEEKAAYLS DPRAPPCEYK 3840 

DWLTKMSGKH DVGAYMLMYK GANRTETVTS FRKRESKVPA DLLKRAFVRM STSPEAFLAL 3900 

RSHFASSHAL ICISHWILGI GDRHLNNFMV AMETGGVIGI DFGHAFGSAT QFLPVPELMP 3960 

FRLTRQPIWL MLPMKBTGLM YSIMVHALRA FRSDPGLLTN TMDVFVXEPS FDWKNFEQKM 4020 

LKKGGSWIQE INVAEKNWYP RQKICYAKRK LAGANPAVIT CDELLLGHEK APAFRDYVAV 4080 
ARGSKDHNIR AQEPESGLSE ETQVKCLMDQ ATDPN I LGRT WEGWEPWM 

Seq ID NO: 100 DNA sequence 
Nucleic Acid Accession #: NM_000673 
Coding sequence: 101-1225 

1 11 21 31 41 • 51 

I II I 1 I 

ATGTGAAGGC ACAAGCTGCT GTTATATACA ACAGAGTGAA CTGAG CATCA GTCAGAAAAA 60 

GTCTATGTTT GCAGAAATAC AGAT CCAAGA CAAAGACAGG ATGGGCACTG CTGGAAAAGT 120 

TATTAAATGC AAAGCAGCTG TGCTTTGGGA GCAGAAGCAA CCCTTCTCCA TTGAGGAAAT 180 

AGAAGTTGCC CCACCAAAGA CTAAAGAAGT TCGCATTAAG ATTTTGGCCA CAGGAATCTG 240 

TCGCACAGAT GACCATGTGA TAAAAGGAAC AATGGTGTCC AAGTTTCCAG TGATTGTGGG 300 

ACATGAGGCA ACTGGGATTG T AG AGAG CAT TGGAGAAGGA GTGACTACAG TGAAACCAGG 360 

TGACAAAGTC ATCCCTCTCT TTCTGCCACA ATGTAGAGAA TGCAATGCTT GTCGCAACCC 420 

AGATGGCAAC CTTTGCATTA GG AG CO AT AT TACTGGTCGT GGAGTACTGG CTGATGGCAC 480 

CACCAGATTT ACATG CAAGG GCAAACCAGT ACACCACTTC ATGAACACCA GTACATTTAC 540 

CGAGTACACA GTGGTGGATG AATCTTCTGT TGCTAAGATT GATGATGCAG CTCCTCCTGA 600 

GAAAGTCTGT TTAATTGGCT GTGGGTTTTC CACTGGATAT GGCGCTGCTG TTAAAACTGG 660 

CAAGGTCAAA CCTGGTTCCA CTTGCGTCGT CTTTGGCCTG GGAGGAGTTG GCCTGTCAGT 720 

CATCATGGGC TGTAAGTCAG CTGGTGCATC TAGGATCATT GGGATTGACC TCAACAAAGA 780 

CAAATTTGAG AAGGCCATGG CTGTAGGTGC CACTGAGTGT ATCAGTCCCA AGGACTCTAC 840 

CAAACCCATC AGTGAGGTGC TGTCAGAAAT GACAGGCAAC AACGTGGGAT ACACCTTTGA 900 

AGTTATTGGG CATCTTGAAA CCATGATTGA TGCCCTGGCA TCCTGCCACA TGAACTATGG 960 

GACCAGCGTG GTTGTAGGAG TTCCTCCATC AGCCAAGATG CTCACCTATG ACCCGATGTT 1020 
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GCTCTTCACT GGACGCACAT GGAAGGGATG TGTCTTTGGA GGTTTGAAAA GCAGAGATGA 
TGTCCCAAAA CTAGTGACTG AGTTCCTGGC AAAGAAATTT GACCTGGACC AGTTGATAAC 
TCATGTTTTA CCATTTAAAA AAATCAGTGA AGGATTTGAG CTGCTCAATT CAGGACAAAG 
CATTCGAACG GTCCTGACGT TTTGAGATCC AAAGTGGCAG GAGGTCTGTG TTGT CATGGT 
GAACTGGAGT TTCTCTTGTG AGAGTTCCCT CATCTGAAAT CATGTATCTG TCTCACAAAT 
ACAAGCATAA GTAGAAGATT TGTTGAAGAC ATAGAACCCT TATAAAGAAT TATTAACCTT 
TATAAACATT TAAAGTCTTG TGAGCACCTG GGAATTAGTA TAATAACAAT GTTAATATTT 
TTGATTTACA TTTTGTAAGG CTATAATTGT ATCTTTTAAG AAAACATACA CTTGGATTTC 
TATGTTGAAA TGGAGATTTT TAAGAGTTTT AACCAGCTGC TGCAGATATA TAACTCAAAA 
CAGATATAGC GTATAAAGAT ATAGTAAATG CATCTCCCAG AGTAATATTC ACTTAACACA 
TTGAAACTAT TATTTTTTAG ATTTGAATAT AAATGTATTT TTTAAACACT TGTTATGAGT 
TAACTTGGAT TACATTTTGA AATCAGTTCA TTCCATGATG CATATTACTG GATTAGATTA 
AGAAAGACAG AAAAGATTAA GGGACGGGCA CATTTTTCAA CGATTAAGAA TCATCATTAC 
ATAACTTGGT GAAACTGAAA AAGTATATCA TATGGGTACA CAAGGCTATT TGCCAGCATA 
TATTAATATT TTAGAAAATA TTCCTTTTGT AATACTGAAT ATAAACATAG AGCTAGAGTC 
AT ATT AT CAT ACTTATCATA ATGTT CAATT TGATACAGTA GAATTGCAAG TCCGTAAGTC 
CCTATTCACT GTGCTTAGTA GTGACTCCAT TTAATAAAAA GTGTTTTTAG TTTTTAACAA 
CTAAACCG 

Seq ID NO: 101 Protein sequence: 
Protein Accession tt: NP_000664 

1 11 21 31 41 51 

I | I I I I 

MGTAGKVIKC KAAVLWEQKQ PFSIEEIEVA PPKTKEVRIK ILATGICRTD DHVIKGTMVS 60 

KPPVIVGHBA TGIVESIGEG VTTVKPGDKV IPLFLPQCRE CNACRNPDGN LCIRSDITGR 120 

GVLADGTTRF TCKGKPVHHF MNT9TPTEYT WDESSVAKI DDAAPPEKVC IiIGCGFSTGY 180 

GAAVKTGKVK PGSTCWFGI* GGVGLSVIMG CKSAGASRII GIDIjNKDKFE KAMAVGATEC 240 

ISPKDSTKPI SEVLSEMTGN NVGYTFEVIG HLETMIDALA SCHMNYGTSV WGVPPSAKM 300 

LTYDPMLLFT GRTWKGCVFG GLKSRDDVPK LVTEFLAXKF DLDQLITHVL PFKKISEGPE 360 
LLNSGQSIRT VLTF 

Seq ID NO: 102 DNA sequence 

Nucleic Acid Accession NM_006783.1 

Coding sequence : 1 . . 786 

1 11 21 31 41 51 

i I I I I 1 

ATGGATTGGG GGACGCTGCA CACTTTCATC GGGGGTGTCA ACAAACACTC CACCAGCATC 60 
GGGAAGGTGT GGATCACAGT CATCTTTATT TTCCGAGTCA TGATCCTAGT GGTGGCTGCC 120 
CAGGAAGTGT GGGGTGACGA GCAAGAGGAC TTCGTCTGCA ACACACTGCA ACCGGGATGC 180 
AAAAATGTGT GCTATGACCA CTTTTTCCCG GTGTCCCACA TCCGGCTGTG GGCCCTCCAG 240 
CTGATCTTCG TCTCCACCCC AGCGCTGCTG GTGGCCATGC ATGTGGCCTA CTACAGGCAC 300 
GAAACCACTC GCAAGTTCAG GCGAGGAGAG AAGAGGAATG ATTTCAAAGA CATAGAGGAC 360 
ATTAAAAAGC ACAAGGTTCG GATAGAGGGG TCGCTGTGGT GGACGTACAC CAGCAGCATC 420 
TTTTTCCGAA TCATCTTTGA AGCAGCCTTT ATGTATGTGT TTTACTTCCT TTACAATGGG 480 
TACCACCTGC CCTGGGTGTT GAAATGTGGG ATTGACCCCT GCCCCAACCT TGTTGACTGC 540 
TTTATTTCTA GGCCAACAGA GAAGACCGTG TTTACCATTT TTATGATTTC TGCGTCTGTG 600 
ATTTGCATGC TGCTTAACGT GGCAGAGTTG TGCTACCTGC TGCTGAAAGT GTGTTTTAGG 660 
AGATCAAAGA GAGCACAGAC GCAAAAAAAT CACCCCAATC ATGCCCTAAA GGAGAGTAAG 720 
CAGAATGAAA TGAATGAGCT GATTTCAGAT AGTGGTCAAA ATGCAATCAC AGGTTTCCCA 780 
AGCTAA 

Seq ID NO: 103 Protein sequence: 
Protein Accession #: NP_006774.1 

1 11 21 31 41 51 

I I I I I I 

MDWGTLHTFI GGVNKHSTSI GKVWITVIFI FRVMILWAA QBVWGDEQED FVCNTLQPGC 60 
KNVCYDHFPP VSHIRLWALQ LIFVSTPALL VAMHVAYYRH ETTRKFRRGE KRNDFKDIED 120 
IKKHKVRIEG SLWWTYTSSI FFRIIFEAAF MYVFYFLYNG YHLPWVLKCG IDPCPNLVDC 180 
FISRPTEKTV FTIFMISASV ICMLLNVAEb CYLLLKVCFR RSKRAQTQKN HPNHALKESK 240 
QNEMNELISD SGQNAITGFP S 

Seq ID NO: 104 DNA sequence 
Nucleic Acid Accession #: NM_020411 
Coding sequence: 86-526 

1 11 21 31 41 51 

I I I I I I 

GGACCTGGGA AGGAGCATAG GACAGGGCAA GGCGGGATAA GGAGGGGCAC CACAGCCCTT 60 
AAGGCACGAG GGAACCTCAC TGCGCATGCT CCTTTGGTGC CCACCTCAGT GCGCATGTTC 120 
ACTGGGCGTC TTCCCATCGG CCCCTTCGCC AGTGTGGGGA ACGCGGCGGA GCTGTGAGCC 180 
GGCGACTCGG GTCCCTGAGG TCTGGATTCT TTCTCCGCTA CTGAGACACG GCGGACACAC 240 
ACAAACACAG AACCACACAG CCAGTCCCAG GAGCCCAGTA ATGGAGAGCC CCAAAAAGAA 300 
GAACCAGCAG CTGAAAGTCG GGATCCTACA CCTGGGCAGC AGACAGAAGA AGATCAGGAT 360 
ACAGCTGAGA TCCCAGTGCG CGACATGGAA GGTGATCTGC AAGAGCTGCA TCAGTCAAAC 420 
ACCGGGGATA AATCTGGATT TGGGTTCCGG CGTCAAGGTG AAGATAATAC CTAAAGAGGA 480 
ACACTGTAAA ATGCCAGAAG CAGGTGAAGA GCAACCACAA GTTTAAATGA AGACAAGCTG 540 
AAACAACGCA AGCTGGTTTT ATATTAGATA TTTGACTTAA ACTATCTCAA TAAAGTTTTG 600 
CAGCTTTCAC CAAAAAAAAA AAAAAA 



Seq ID NO: 105 Protein sequence: 
Protein Accession #: NP_065144.l 

1 li 21 31 41 51 
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10 
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r i i i i i 

MLLWCPPQCA CSLGVFPSAP SPVWGTRRSC EPATRVPEVW ILSPLLRHGG HTQTQNHTAS 
PRSPVMESPK KXNQQLKVGI LHLGSRQKKI RIQLRSQCAT WKVICKSCIS QTPGINLDLG 
SGVKVKIIPK EEHCKMPEAG EEQPQV 

Seq ID NO: 106 DNA sequence 
Nucleic Acid Accession #: J04129 
Coding sequence: 99-587 



PCT/US02/12476 



i 
I 

CATCCCTCTG 
TCACCCTGGG 
AGGACCTGGA 
ACATCTCCCT 
CCACCCCCGA 
AGAAGAAGGT 
TGGCGAACGA 
AGGACACCAC 
AGGACGATGA 
GGTACTTGCT 
CCAGGAAGAC 
TTTCAAAGAA 
TCCTGCTGCA 
GCAGAGGTTA 



11 
I 

GCTCCAGAGC 
CGTGGCCCTG 
GCTCCCAAAG 
CATGGCGACA 
GGACAACCTG 
CCTTGGAGAG 
GGCCACGCTG 
CACCCCCATC 
GATCATGCAG 
GGACTTGAAA 
CAGACTCCCA 
TAACCACAGC 
CACCTGCACC 
TTAATAAACC 



21 

I 

TCAGAGCCAC 
GTCTGTGGTG 
TTGGCAGGGA 
CTGAAGGCCC 
GAGATCGTTC 
AAGACTGGGA 
CTCGATACTG 
CAGAGCATGA 
GGATTCAT CA 
CAGATGGAAG 
CCCTTCCACA 
TCAGAAGACG 
ATTGCCATGG 
CTTGGAGCAT 



31 

I 

CCACAGCCGC 
TCCCGGCCAT 
CCTGGCACTC 
CTCTGAGGGT 
TGCACAGATG 
ATCCAAAGAA 
ACTACGACAA 
TGTGCCAGTA 
GGGCTTTCAG 
AGCCGTGCCG 
CCTCCAGAGC 
ATGACGTGGT 
GGAGGCTGCT 
G 



41 

I 

AGCCATGCTG 
GGACATCCCC 
CATGGCCATG 
CCACATCACC 
GGAGAACAAC 
GTTCAAGATC 
TTTCCTGTTT 
CCTGGCCAGA 
GCCCCTGCCC 
TTTCTAGCTC 
AGTGGGACTT 
CATCTGTGTC 
CCCTGGGGGC 



51 

I 

TGCCTCCTGC 
CAGACCAAGC 
GCGACCAACA 
TCACTGTTGC 
AGCTGTGTTG 
AACTATACGG 
CTCTGCCTAC 
GTCCTGGTGG 
AGGCACCTAT 
ACCTCCGCCT 
CCTCCTGCCC 
GCCATCCCCT 
AGAGTCTCTG 



60 
120 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



Seq ID NO: 107 Protein sequence: 
Protein Accession <h AAA60147 

1 li 21 31 41 51 

I I I I I I 

MDIPQTKQDI* ELPKLAGTWH SMAMATNNIS LMATLKAPLR VHITSLLPTP EDNLEIVLHR 
WENNSCVEKK VLGEKTGNPK KFKINYTVAN EATLLDTDYD NFLPLCLQDT TTPIQSMMCQ 
YIARVLVEDD EIMQGFIRAF RFliPRHLWYL LDLKQMEEPC RP 

Seq ID NO: 108 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 48-794 



TCCCAGGCAG 
GTCTGATCCA 
TCATGAAAGG 
CAGTAG CCTA 
TTGAGCAGAA 
GGGAGAAGGT 
GCCACCTCAT 
GTGACTACTA 
ACTCAGCCCG 
CCAACCCCAT 
ACAGCCCCGA 
TGCACACCCT 
ACAACCTGAC 
AGCCCCAGAG 
TGCCGAGAGG 
CTCCAAAGGG 
CACTCTTCTT 
CGCACCCGCT 
CTGCCCCTGC 
GGACAGTGGC 
CGCGCGCGCC 
TTCCTCTCAA 



11 
I 

CAGTTAGCCC 
GAAGGCCAAG 
CGCCGTGGAG 
TAAGAACGTG 
AAGCAACGAG 
GGAGACTGAG 
CAAGGAGGCC 
CCGCTACCTG 
GTCAGCCTAC 
CCGCCTGGGC 
GGAGGCCATC 
CAGCGAGGAC 
ACTGTGGACG 
CTGAGTGTTG 
ACTAGTATGG 
CTCCGTGGAG 
GCAGCTGTTG 
TCCTCCCGAC 
TGCCTCTGAT 
AGGGGCTGGA 
AGTGCAAGAC 
TAAAGTTCCC 



21 
I 

GCCGCCCGCC 
CTGGCAGAGC 
AAGGGCGAGG 
GTGGGCGGCC 
GAGGGCTCGG 
CTCCAGGGCG 
GGGGACGCCG 
GCCGAGGTGG 
CAGGAGGCCA 
CTGGCCCTGA 
TCTCTGGCCA 
TCCTACAAAG 
GCCGACAACG 
CCCGCCACCG 
GGTGGGAGGC 
AGGGACTGGC 
AGCGCACCTA 
CCCAGGACCA 
CGTAGGAATT 
GATGGGTGTG 
CGAGATTGAG 
CTGTGACACT 



31 
I 

TGTGTGTCCC 
AGGCCGAACG 
AGCTCTCCTG 
AGAGGGCTGC 
AGGAGAAGGG 
TGTGCGACAC 
AGAGCCGGGT 
CCACCGGTGA 
TGGACATCAG 
ACTTTTCCGT 
AGACCACTTT 
ACAGCACCCT 
CCGGGGAAGA 
CCCCGCCCTG 
CCCACCCTTC 
AGAGCTGAGG 
ACCACTGGTC 
GGCTACTTCT 
GAGGAGTGTC 
TGTGTGTGTG 
GGAAAGCATG 
C 



41 
I 

CAGAGCCATG 
CTATGAGGAC 
CGAAGAGCGA 
CTGGAGGGTG 
GCCCGAGGTG 
CGTGCTGGGC 
CTTCTACCTG 
CGACAAGAAG 
CAAGAAGGAG 
CTTCCACTAC 
CGACGAGGCC 
GATCATGCAG 
GGGGGGCGAG 
CCCCCTCCAG 
TCCCCTAGGC 
CCACCTGGGG 
ATGCCCCCAC 
CCCCTCCTCT 
CCGCCTTGTG 
TGTGTGTGTG 
TCTG CTGGGT 



51 
I 

GAGAGAGCCA 
ATGGCAGCCT 
AACCTGCTCT 
CTGTCCAGTA 
CGTGAGTACC 
CTGCTGGACA 
AAGATGAAGG 
CGCATCATTG 
ATGCCGCCCA 
GAGATCGCCA 
ATGGCTGATC 
CTGCTGCGAG 
GCTCCCCAGG 
TCCCCCACCC 
GCTGTTCTTG 
CTGGGGATCC 
CCCTGCTCTC 
TGCCTCCCTC 
GCTGAGAACT 

GTGACCATGT 



Seq ID NO: 109 Protein sequence: 
Protein Accession #: NP_006133.l 

1 11 



41 



51 



21 31 

I I J I 

MERASLIQKA KLAEGAERYE DMAAFMKGAV EKGEELSCEE RNLLSVAYKN WGGQRAAWR 
VLSSIEQKSN EEGSEEKGPE VREYRBKVET ELQGVCDTVL GLLDSHLIKE AGDAESRVFY 
LKMKGDYYRY LAEV ATGDD K KRI IDSARSA YQEAMDISKK EMPPTNPIRL GIiALNPSVFH 
YEIANSPEEA ISLAKTTFDE AMADLHTLSE DSYKDSTLIM QLLRDNLTLW TADNAGEEGG 
EAPQEPQS 

Seq ID NO: 110 DNA sequence 
Nucleic Acid Accession #: NM_000695 
Coding sequence: 407-1564 



CACGAGTTGG 
GAGGCCTGGG 
TGGAGGTGCA 
CACACTGCGG 
GGCTGCGCAG 



11 

1 

TTTGGGAGCT 
GGTAGGAGCA 
GCGAAGGACC 
CGGCTGCGTG 
CTCCAGGGCC 



21 
I 

GCCAGTCTCC 
GAGCCTGCGC 
CAGGGGCAGA 
AGGCCTTCAA 
TGGGCCACTT 



31 
I 

TGGGAGGATC 
ATCTGGAGGC 
GCCCACGCTG 
CTGAGGGCGC 
CCTTCAAGAA 



41 
I 

GCAGTCAGCA 
AG CAT GT CCA 
GGGATGGACC 
ACGCGGCCGG 
AACAAGCAGC 



60 
120 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 



60 
120 
180 
240 



51 
I 

GAGCAGGGCT 
AGAAAGGGAG 
CCTTCGAGGA 
CCGAGTTCCG 
TTCTGCGCGA 



60 
120 
180 
240 
300 



230 
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CGTGCTGGCC CAGGACCTGC ATAAGCCAGC TTTCGAGGCA GACATATCTG AGCTCATCCT 360 

TTGCCAGAAC GAGGTTGACT ACGCTCTCAA GAACCTTCAG GCCTGGATGA AGGATGAACC 420 

ACGGTCCACG AACCTGTTCA TGAAGCTGGA CTOGGTCTTC ATCTGGAAGG AACCCTTTGG 480 

CCTGGTCCTC ATCATCGCAC CCTGGAACTA CCCATTGAAC CTGACCCTGG 'TGCTCCTGGT 540 

GGGCACCCTC CCCGCAGGGA ATTGCGTGGT GCTGAAGCCG TCAGAAATCA GCCAGGGCAC 600 

AGAGAAGGTC CTGGCTGAGG TGCTGCCCCA GTACCTGGAC CAGAGCTGCT TTGCCGTGGT 660 

GCTGGGCGGA CCCCAGGAGA CAGGGCAGCT GCTAGAGCAC AAGTTGGACT ACATCTTCTT 720 

CACAGGGAGC CCTCGTGTGG GCAAGATTGT CATGACTGCT GCCACCAAGC ACCTGACGCC 780 

TGTCACCCTG GAGCTGGGGG GCAAGAACCC CTGCTACGTG GACGACAACT GCGACCCCCA 840 

GACCGTGGCC AACCGCGTGG CCTGGTTCTG CTACTTCAAT GCCGGCCAGA CCTGCGTGGC 900 

CCCTGACTAC GTCCTGTGCA GCCCCGAGAT GCAGGAGAGG CTGCTGCCCG CCCTGCAGAG 960 

CACCATCACC CGTTTCTATG GGGACGACCC CCAGAGCTCC CCAAACCTGG GCCGCATCAT 1020 

CAACCAGAAA CAGTTCCAGC GGCTGCGGGC ATTGCTGGGC TGCGGCCGCG TGGCCATTGG 1080 

GGGCCAGAGC AACGAGAGCG ATCGCTACAT CGCCCCCACG GTGCTGGTGG ACGTGCAGGA 1140 

GACGGAGCCT GTGATGCAGG AGGAGATCTT CGGGCCCATC CTGCCCATCG TGAACGTGCA 1200 

GAGCGTGGAC GAGGCCATCA AGTTCATCAA CCGGCAGGAG AAGCCCCTGG CCCTGTACGC 1260 

CTTCTCCAAC AGCAGACAGG TTGTGAACCA GATGCTGGAG CGGACCAGCA GCGGCAGCTT 1320 

TGGAGGCAAT GAGGGCTTCA CCTACATATC TCTGCTGTCC GTGCCATTCG GGGGAGTCGG 13 80 

CCACAGTGGG ATGGGCCGGT ACCACGGCAA GTTCACCTTC GACACCTTCT CCCACCACCG 1440 

CACCTGCCTG CTCGCCCCCT CCGGCCTGGA GAAATTAAAG GAGATCCGCT ACCCACCCTA 1500 

TAOCGACTGG AACCAGCAGC TGTTACGCTG GGGCATGGGC TCCCAGAGCT GCACCCTCCT 1560 

GTGAGCGTCC CACCCGCCTC CAACGGGTCA CACAO AGAAA CCTGAGTCTA GCCATGAGGG 1620 

GCTTATGCTC CCAACTCACA TTGTTCCTCC AGACCGCAGG CTCCCCCAGC CTCAGGTTGC 1680 

TGGAGCTGTC ACATGACTGC ATCCTGCCTG CCAGGGCTGC AAAGCAAGGT CTTGCTTCTA 1740 

TCTGGGGGAC GCTGCTCGAG AGAGGCCGAG AGGCCGCAGA ACATGCCAGG TGTCCTCACT 1800 

CACCCCACCC TCCCCAATTC CAGCCCTTTG CCCTCTCGGT CAGGGTTGGC CAGGCCCAGT 1860 

CACAGGGGCA GTGTCACCCT GGAAAATACA GTGCCCTGCC TTCTTAGGGG CATCAGCCCT 1920 

GAACGGTTGA GAGCGTGGAG CCCTCCAGGC CTTTGCTCTC CCCTCTAGGC ACACGCGCAC 1980 

TTCCACCTCT GCCCCATCCC AACTGCACCA GCACTGCCTC CCCCAGGGAT CCTCTCACAT 2040 

CCCACACTGG TCTCTGCACC ACCCCTCTGG TTCACACCGC ACCCTGCACT CACCCACAGC 2100 

AGCTCCATCC ACTGGGAAAA CTGGGGTTTG CATCACTCCA CTGCACAGTG TTAGTGGGAC 2160 

CTGGGGGCAA GTCCCTTGAC TTCTCTGAGC CTCAGTTTCC TTATGTGAAA GTTGCTGGAA 2220 

CCAAAATGGA GTCACTTATG CCAAACTCTA ATAAAATGGA GTCGGGGGGG CACATAGAAG 2280 

CCCTCACACA CACATGCCCG TAACAGGATT TATCACCAAG ACACGCCTGC ATGTAAGACC 2340 

AGACACAGGG CGTATGGAAA AGCACGTCCT CAAAGACTGT AGTATTCCAG ATGAGCTGCA 2400 

GATGCTTACC TACCACGGCC GTCTCCACCA GAAAACCATC GCCAACTCCT GCGATCAGCT 2460 

TGTGACTTAC AAACCTTGTT TAAAAGCTGC TTACATGGAC TTCTGTCCTT TAAAACGTTC 2520 

CCCTTGGCTG TGGCCCTCTG TGTATGCCTG GGATCCTTCC AAGCACTCAT AGCCCAGATA 2580 
GGAATCCTCT GCTCCTCCCA AATAAATTCA TCTGTTC 



Seq ID NOt 111 Protein sequence: 
protein Accession #: NP_000686 

1 11 21 31 41 51 

I I I I I I 

KKDEPRSTNLi FMKLDSVPIW KEPFGLVLII APWNYPLNLT LVLLVGTLPA GNCWLKPSE 60 

ISQGTEKVLA EVLPQYLDQS CPAWLGGPQ ETGQLLEKKL DYIFFTGSPR VGKIVMTAAT 120 

KHLTPVTLEL GGKNPCYVDD NCDPQTVANR VAWFCYFNAG QTCVAPDYVL CSPEMQERLL 180 

PALQSTITRP YGDDPQSSPN LGRI INQKQF QRLRALLGCG RVAIGGQSNE SDRYIAPTVL 240 

VDVQETEPVM QEEIFGPILP IVNVQSVDEA IKFINRQEKP LALYAFSNSR QWNQMLERT 300 

SSGSFGGNEG FTYISLLSVP FGGVGHSGMG RYHGKFTFDT FSHHRTCLLA PSGIiEKLKEI 360 
RYPPYTDWNQ QLLRWGMGSQ SCTl.Ii 

Seq ID NO: 112 DNA sequence 
Nucleic Acid Accession #t NM_0044S6 
Coding sequence: 58-2298 

1 11 21 31 41 51 

I I I I I I 

GAATTCCGGG CGACGCGCGG GAACAACGCG AGTCGGCGCG CGGGACGAAG AATAATCATG 60 

GGCCAGACTG GGAAGAAATC TGAGAAGGGA CCAGTTTGTT GGCGGAAGCG TGTAAAATCA 120 

GAGTACATGC GACTGAGACA GCTCAAGAGG TTCAGACGAG CTGATGAAGT AAAGAGTATG 180 

TTTAGTTCCA AT CGTCAGAA AATTTTGGAA AGAACGGAAA TCTTAAACCA AGAATGGAAA 240 

CAGCOAAGGA TACAG CCTGT GCACATCCTG ACTTCTGTGA GCTCATTGCG CGGGACTAGG 300 

GAGTGTTCGG TGACCAGTGA CTTGGATTTT CCAACACAAG TCATCCCATT AAAGACTCTG 360 

AATGCAGTTG CTTCAGTACC CATAATGTAT TCTTGGTCTC CCCTACAGCA GAATTTTATG 420 

GTGGAAGATG AAACTGTTTT ACATAACATT CCTTATATGG GAGATGAAGT TTTAGATCAG 480 

GATGGTACTT TCATTGAAGA ACTAATAAAA AATTATGATG GGAAAGTACA CGGGGATAGA 540 

GAATGTGGGT TTATAAATGA TGAAATTTTT GTGGAGTTGG TGAATGCCCT TGGTCAATAT 600 

AATGATGATG ACGATGATGA TGATGGAGAC GATCCTGAAG AAAGAGAAGA AAAGCAGAAA 660 

GATCTGGAGG ATCACCGAGA TGATAAAGAA AGCCGCCCAC CTCGGAAATT TCCTTCTGAT 720 

AAAATTTTGG AGGCCATTTC CTCAATGTTT CCAGATAAGG GCACAGCAGA AGAACTAAAG 780 

GAAAAATATA AAGAACTCAC CGAACAGCAG CTCCCAGGCG CACTTCCTCC TGAATGTACC 840 

CCCAACATAG ATGGACCAAA TGCTAAATCT GTTCAGAGAG AGCAAAGCTT ACACTCCTTT 900 

CATACGCTTT TCTGTAGGCG ATGTTTTAAA TATGACTGCT TCCTACATCC TTTTCATGCA 960 

ACACCCAACA CTTATAAGCG GAAGAACACA GAAACAGCTC TAGACAACAA ACCTTGTGGA 1020 

CCACAGTGTT ACCAGCATTT GGAGGGAGCA AAGGAGTTTG CTGCTGCTCT CACCGCTGAG 1080 

CGGATAAAGA CCCCACCAAA ACGTCCAGGA GGCCGCAGAA GAGGACGGCT TCCCAATAAC 1140 

AGTAGCAGGC CCAGCACCCC CACCATTAAT GTGCTGGAAT CAAAGGATAC AGACAGTGAT 1200 

AGGGAAGCAG GGACTGAAAC GGGGGGAGAG AACAATGATA AAGAAGAAGA AGAGAAGAAA 1260 

GATGAAACTT CGAGCTCCTC TGAAGCAAAT TCTCGGTGTC AAACACCAAT AAAGATGAAG 1320 

CCAAATATTG AACCTCCTGA GAATGTGGAG TGGAGTGGTG CTGAAGCCTC AATGTTTAGA 1380 

GTCCTCATTG GCACTTACTA TGACAATTTC TGTGCCATTG CTAGGTTAAT TGGGACCAAA 1440 

ACATGTAGAC AGGTGTATGA GTTTAGAGTC AAAGAATCTA GCATCATAGC TCCAGCTCCC 1500 

GCTGAGGATG TGGATACTCC TCCAAGGAAA AAGAAGAGGA AACACCGGTT GTGGGCTGCA 1560 

CACTGCAGAA AGATACAGCT GAAAAAGGAC GGCTCCTCTA ACCATGTTTA CAACTATCAA 1620 
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CCCTGTGATC ATCCACGGCA GCCTTGTGAC AGTTCGTGCC CTTQTOTGAT 
TTTTGTGAAA AGTTTTGTCA ATGTAGTTCA GAGTGTCAAA ACCGCTTTCC 
TGCAAAGCAC AGTGCAACAC CAAGCAGTGC CCGTGCTACC TGGCTGTCCG 
CCTGACCTCT GTCTTACTTG TGGAGCCGCT GACCATTGGG ACAGTAAAAA 
AAGAACTGCA GTATTCAGCG GGGCTCCAAA AAGCATCTAT TGCTGGCACC 
GCAGGCTGGG GGATTTTTAT CAAAGATCCT GTGCAGAAAA ATGAATTCAT 
TGTGGAGAGA TTATTTCTCA AGATGAAGCT GACAGAAGAG GGAAAGTGTA 
ATGTGCAGCT TTCTGTTCAA CTTGAACAAT GATTTTGTGG TGGATGCAAC 
AACAAAATTC GTTTTGCAAA TCATTCGGTA AATCCAAACT GCTATGCAAA 
GTTAAOGGTG ATCACAGGAT AGGTATTTTT GCCAAGAGAG CCATCCAGAC 
CTGTTTGTTG ATTACAGATA CAGCCAGGCT GATGCCCTGA AGTATGTCGG 
GAAATGGAAA TCCCTTGACA TCTGCTACCT CCTCCCCCTC CTCTGAAACA 
CTTCAGGAAC CTCGAGTACT GTGGGCAATT TAGAAAAAGA ACATGCAGTT 
AATTTGCAAA GTACTGTAAG AATAATTTAT AGTAATGAGT TTAAAAATCA 
GCCTTCTCAC CAGCTGCAAA GTGTTTTGTA CCAGTGAATT TTTGCAATAA 
TACATTTTTC AACTTTGAAT AAAGAATACT TGAACTTGAA AAAAAAAAAA 



Seq ID NOt 113 Protein sequence i 
Protein Accession 8: NP_004447 



PCT/US02/12476 



MGQTGKKSEK 
KQRRIQPVHI 
MVEDETVIiHN 
YNDDDDDDDG 
KEKYKELTEQ 
ATPNTYKRKN 
N3SRPSTPTI 
XPNIEPPENV 
PAEDVDTPPR 
NFCEKFCQCS 
CKNCSIQRGS 
YMCSFLFNLN 
ELPVDYRYSQ 



11 
I 

GPVCWRKRVK 
LTSVSSLRGT 
IPYMGDEVLD 
DDPEEREEKQ 
QLPGALPPEC 
TETALDNKPC 
NVLESKDTDS 
EWSGAEASMP 
KKKRKHRLWA 
SECQNRPPGC 
KKHLLLAPSD 
NDPWDATRK 
ADAliKYVGIB 



21 
I 

SEYMRLRQLK 
RECSVTSDLD 
QDGTFIEELI 
KDLEDHRDDK 
TPNIDGPNAK 
GPQCYQHLEG 
DREAGTETGG 
RVLIGTYYDN 
AHCRKIQLKK 
RCKAQCNTKQ 
VAGWGIFIKD 
GNKIRFANHS 
REMEIP 



31 
I 

RFRRADEVKS 
FPTQVIPLKT 
KNYDGKVHGD 
ESRPPRKFPS 
SVQREQSLHS 
AKEFAAALTA 
ENNDKEEEEK 
FCAIARLIGT 
DGSSNHVYNY 
CPCYLAVREC 
PVQKNEFISE 
VNFNCYAKVM 



Seq ID NO j 114 DMA sequence 
Nucleic Acid Accession #: NM__001827 
Coding sequence: 96-335 



1 
I 

AGTCTCCGGC 
CGCTCTCGTT 
CGGACAAGTA 
CCAAACAAGT 
AACAGAGTCT 
TTAGACGACC 
TTTTCAAATT 
ACAAATCTTT 
AAATGCAACT 
TTTCTCTTAA 
TATGTTGCAT 



11 

I 

GAGTTGTTGC 
TCATTTTCTG 
CTTCGACGAA 
ACCTAAAACT 
AGGCTGGGTT 
TCTTCCAAAA 
TAATGTATAT 
CATCCATACC 
GCAAGTAGGT 
GTGCCTGTTT 
TTAAAAAAAA 



21 
I 

CTGGGCTGGA 
CAGCGCGCCA 
CACTACGAGT 
CATCTGATGT 
CATTACATGA 
GATCAACAAA 
GTGTATATAA 
TGTGCATGAG 
TACTGTAAGA 
GAGTTTACTG 
AAAAAAA 



31 
I 

CGTGGTTTTG 
CGAGGATGGC 
ACCGGCATGT 
CTGAAGAGGA 
TTCATGAGCC 
AATGAAGTTT 
GGTAGTATTC 
CTGTATTCTT 
TGTTTAAGAT 
AAACAGTTTA 



41 

I 

MFSSNRQKIL 
LNAVASVPIM 
RECGFINDEI 
DKILEAISSM 
FHTLFCRRCF 
ERIKTPPKRP 
KDETSSSSEA 
KTCRQVYEPR 
QPCDHPRQPC 
DPDLCLTCGA 
YCGEI ISQDE 
MVNGDHRIGI 



41 
I 

TCTGCTGCGC 
CCACAAGCAG 
TATGTTACCC 
GTGGAGGAGA 
AGAACCACAT 
ATCTGGGGAT 
AGTGAATACT 
CACAGCAACA 
AAAAGTTCTT 
CTTTTGTTCA 



AGCACAAAAT 
GGGATGCCGC 
AGAGTGTGAC 
TGTGTCCTGC 
ATCTGACGTG 
CT CAGAAT AC 
TGATAAATAC 
CCGCAAGGGT 
AGTTATGATG 
TGGCGAAGAG 
CATCGAAAGA 
GCTGCCTTAG 
TGAAATTCTG 
ACTTTTTATT 
TGCAGTATGG 
AAAAAA 



51 
I 

ERTEILNQEW 
YSWSPLQQNF 
FVELVNALGQ 
FPDKGTAEEL 
KYDCFLHPFH 
GGRRRGRLPN 
NSRCQTPIKM 
VKESSIIAPA 
DSSCPCVIAQ 
ADHWDSKNVS 
ADRRGKVYDK 
FAKRAIQTGE 



51 
I 

CCGCTCTTCG 
ATCTACTACT 
AGAGAACTTT 
CTTGGTGTCC 
ATTCTTCTCT 
CGTCAAATCT 
TGAGAAATGT 
GAGCTCAGTT 
CCAGTCAGTT 
ATAAAGTTTG 



1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 



Seq ID NO i 115 Protein sequence: 
Protein Accession ft: NP_001818 



51 



1 11 21 31 41 

I I I I I I 

MAHKQIYYSD KYFDEHYEYR HVMLPRELSK QVPKTHLMSE EEWRRLGVQQ SLGWVHYMIH 
EPEPHILLFR RPLPKDQQK 

Seq ID NO: 116 DNA sequence 

Nucleic Acid Accession #: CAT cluster 



TCAGACCTCA 
GCATCTGGAC 
AGAGGTGTGT 
GCAGCCAACA 
AAGGACTGAT 
CCTTGGAAGG 
TCAAGAATTC 
AGTGTGGGAG 
CCCATCTCTA 
CCTGTAGTTC 
AGGCTGCAAT 
CCTGTCTCAA 
TTTGAGGTGC 
CCTGAAGGAG 
GACAGACCTT 
CTCTTCCCCC 
CCGCCTCCCA 
CAGCCGGATA 
TCCTGCTCCG 



11 
I 

TGAGTCACTT 
CCTTGGTGCT 
TCCAGGGAAA 
GAGTTCAAAA 
CCACATTCCC 
ACCTGGCTCA 
TTTGCTGAGC 
GATCTCTTGA 
AAATAATAAT 
CAGCTACCCA 
GAACTGTGAT 
ATAATAATAA 
CATTTGGGTA 
CAQAGGGATG 
GTCCTTCTTC 
TCCCTGTCCC 
TGTCTGCTGT 
CAGAGTGAAT 
TGTAAAGAGG 



21 
I 

GGACT CTTGA 
ATCGACGAAG 
GCCCCTATCT 
TGCAGGCTTG 
ACCAGGAAGT 
GGCTGGACCA 
ATGGTGCCTC 
GCCCAGGAGT 
AATAATAAAA 
GGAGGCTGAG 
TACCCCACTG 
TAATAATAAT 
GAAAGAAAAG 
CATCGCTGGA 
CTTGTGGAAA 
AGGGAACCAA 
GCCTTTGTAC 
AGTTAACCAC 
CCAGTGTTTG 



31 
I 

GCCACCTCTG 
CTTGGGTGGG 
CTCTGCAGAG 
GAAAGTACAG 
TTAGCAGAAC 
CCTCTTGAGA 
ATGCCTATAA 
TCAAGACTAG 
T AAAAAA TT A 
GCAAGAGGAT 
CACTCCAGCC 
CTTATTTTGG 
ACGTTTACAC 
GGTGACCTAC 
GTGTTTCCTC 
AGGGCTTTCT 
TCAGCAATTC 
ACTTAGGTCA 
TGTGTTGCAA 



41 

I 

GGGGTGGAGT 
GCTCTTAGCT 
GTCAAGTGAA 
GGGGCTCTGT 
CCCCGCGTGC 
GGGAGGAGCT 
TACCAACACT 
CCTGGGCAAC 
GCAGGGCATG 
GGCTGGAGCC 
TGGGCAAAAG 
AGAATAAAGA 
CGAGAAATAG 
AGTTGAAGAA 
TGCTGCTACT 
ACCACACCCT 
TTGTTTGCTC 
AATAGGATCT 
GCAGCCTTGG 



51 
I 

CTCTCTCCTG 
GCTATGTGCA 
AGCGACGGCC 
GGAGGATGGG 
CAACTGGACC 
CTGGATTTGA 
TTGGGAGGCC 
ACAGAGAGAA 
GTGGCATGTG 
TGGGATGTTG 
AGCGAGAGAA 
GACCTCTGGA 
TCTGTGTTGC 
GACTCATTAT 
GCTCATGAGA 
TTCTTGCCCC 
CATTATCTTC 
AAATTTTTGT 
AATAGTAACT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



60 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
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CTTCTCATTT GTTTGGGATC TGGCCACCAA GTTCCAGAAT GATACACGGA TCAGTGCAGA 1200 

AGTTCATCAG GCTCTCGGAC CTTAGGGCTG TTGGAGAAGG CTTCAGCAGC AGAACTGATG 1260 

GTGAAGGCTC GTGTTCTCCA TCCTCAACTT TCTTTGCTTC GATCATACAC AAGAATACAT 1320 

TTGGAAGGGC AAAAAATGAA CACTGTCGTT CATTGCAGCC GTGTTTTGTG ACACAGATGC 1380 

ACAGTCTGCT GTGAAGACCT TCTCTCAAGT GGCATTTGGG AGTCCATGCC AGATCATGGT 1440 

GCTTCATGAG AGACTGACAG CTATCAGGGG TTGTGGCACT TAGTGAGGAC TCTCCTCCCC 1500 

CAGTGTGTGC TGATGACACA TACACACCTG ACAATAGCTT GAGTCTTCTC TGTTCCTTTT IS 60 

ACTCTGTAGC CAACATACAC ATGATTTAAA ACCCTTTCTA AATATCTATC ATGGTTCATC 1620 

CTTGTCCAAA TGCAGAGTCA GAGCTATTTG TACTTCATTA TTATTTCCAA GGCGAATAGT 1680 

TGG CT TTCTT TTTGCAAAAA TAATTAAAGT TTTTGTATGT TGCAAAAAAA AAAAAAAAAA 1740 
AAACAAAAAA 

Seq ID NO: 117 DNA sequence 

Nucleic Acid Accession #: BC012178.1 

Coding sequence i 204-2285 

1 11 21 31 41 51 

I I I I I I 

CTTCTCTCCC GGGGCGCTGG GGCCCGCGCT CCGCTGCTGT TGCTCCATTC GGCGCTTTTC 60 

TGGCGGCTGG CTCCTCTCCG CTGCCGGCTG CTCCTCGACC AGGCCTCCTT CTCAACCTCA 120 

GCCCGCGGCG CCGACCCTTC CGGCACCCTC CCGCCCCGTC TCGTACTGTC GCCGTCACCG 180 

CCGCGGCTCC GGCCCTGGCC CCGATGGCTC TGTGCAACGG AGACTCCAAG CTGGAGAATG 240 

CTGGAGGAGA CCTTAAGGAT GGCCACCACC ACTATGAAGG AGCTGTTGTC ATTCTGGATG 300 

CTGGTGCTCA GTACGGGAAA GTCATAGACC GAAGAGTGAG GGAACTGTTC GTGCAGTCTG 3 60 

AAATTTTCCC CTTGGAAACA CCAGCATTTG CTATAAAGGA ACAAGGATTC CGTGCTATTA 420 

TCATCTCTGG AGGACCTAAT TCTGTGTATG CTGAAGATGC TCCCTGGTTT GATCCAGCAA 480 

TATTCACTAT TGGCAAGCCT GTTCTTGGAA TTTGCTATGG TATGCAGATG ATGAATAAGG 540 

TATTTGGAGG TACTGTGCAC AAAAAAAGTG TCAGAGAAGA TGGAGTTTTC AACATTAGTG 600 

TGGATAATAC ATGTTCATTA TTCAGGGGCC TTCAGAAGGA AGAAGTTGTT TTGCTTACAC 660 

ATGGAGATAG TGTAGACAAA GTAGCTGATG GATTCAAGGT TGTGGCACGT TCTGGAAACA 720 

TAGTAGCAGG CAT AG CAAAT GAATCTAAAA AGTTATATGG AGCACAGTTC CACCCTGAAG 780 

TTGGCCTTAC AGAAAATGGA AAAGTAATAC TGAAGAATTT CCTTTATGAT ATAGCTGGAT 640 

GCAGTGGAAC CTTCACCGTG CAGAACAGAG AACTTGAGTG TATTCGAGAG ATCAAAGAGA 900 

GAGTAGGCAC GTCAAAAGTT TTGGTTTTAC TCAGTGGTGG AGTAGACTCA ACAGTTTGTA 960 

CAGCTTTGCT AAATCGTGCT TTGAACCAAG AACAAGTCAT TGCTGTGCAC ATTGATAATG 1020 

GCTTTATGAG AAAACGAGAA AGCCAGTCTG TTGAAGAGGC CCTCAAAAAG CTTGGAATTC 1080 

AGGTCAAAGT GATAAATGCT GCTCATTCTT TCTACAATGG AACAACAACC CTACCAATAT 1140 

CAGATGAAGA TAGAACCCCA CGGAAAAGAA TTAGCAAAAC GTTAAATATG ACCACAAGTC 1200 

CTGAAGAGAA AAGAAAAATC ATTGGGGATA CTTTTGTTAA GATTGCCAAT GAAGTAATTG 12 60 

GAGAAATGAA CTTGAAACCA GAGGAGGTTT TCCTTGCCCA AGGTACTTTA CGGCCTGATC 1320 

TAATTGAAAG TGCATCCCTT GTTGCAAGTG GCAAAGCTGA ACTCATCAAA ACCCATCACA 13 80 

ATGACACAGA GCTCATCAGA AAGTTGAGAG AGGAGGGAAA AGTAATAGAA CCTCTGAAAG 1440 

ATTTTCATAA AGATGAAGTG AGAATTTTGG GCAGAGAACT TGGACTTCCA GAAGAGTTAG 1500 

TTTCCAGGCA TCCATTTCCA GGTCCTGGCC TGGCAATCAG AGTAATATGT GCTGAAGAAC 1560 

CTTATATTTG TAAGGACTTT CCTGAAA CCA ACAATATTTT GAAAATAGTA GCTGATTTTT 1620 

CTGCAAGTGT TAAAAAGCCA CATACCCTAT TACAGAGAGT CAAAGCCTGC ACAACAGAAG 1680 

AGGATCAGGA GAAGCTGATG CAAATTACCA GTCTGCATTC ACTGAATGCC TTCTTGCTGC 1740 

CAATTAAAAC TGTAGGTGTG CAGGGTGACT GTCGTTCCTA CAGTTACGTG TGTGGAATCT 1800 

CCAGTAAAGA TGAACCTGAC TGGGAATCAC TTATTTTTCT GGCTAGGCTT ATACCTCGCA 1860 

TGTGTCACAA CGTTAACAGA GTTGTTTATA TATTTGGCCC ACCAGTTAAA GAACCTCCTA 1920 

CAGATGTTAC TCCCACTTTC TTGACAACAG GGGTGCTCAG TACTTTACGC CAAGCTGATT 1980 

TTGAGGCCCA TAACATTCTC AGGGAGTCTG GGTATGCTGG GAAAATCAGC CAGATGCCGG 2040 

TGATTTTGAC ACCATTACAT TTTGATCGGG ACCCACTTCA AAAG CAGCCT TCATGCCAGA 2100 

GATCTGTGGT TATTCGAACC TTTATTACTA GTGACTTCAT GACTGGTATA CCTGCAACAC 2160 

CTGGCAATGA GATCCCTGTA GAGGTGGTAT TAAAGATGGT CACTGAGATT AAGAAGATTC 2220 

CTGGTATTTC TCGAATTATG TATGACTTAA CATCAAAGCC CCCAGGAACT ACTGAGTGGG 2280 
AGTAATAAAC TTCTTGTTCT ATTAAAA 



Seq ID NO : 118 Protein sequence: 
Protein Accession #: AAH12178.1 

1 11 21 31 41 51 

I I I I I I 

MALCNGDSKL ENAGGDLKDG HHHYEGAWI LDAGAQYGKV IDRRVRELFV QSEIFPLETP 60 

APAIKEQGFR AHISGGPNS VYAEDAPWFD PAIPTIGKPV LGICYGMQMM NKVFGGTVHK 120 

KSVREDGVFN ISVPNTCSIiP RGLQKESWL LTHGD9VDKV ADGFKWARS GNIVAGIANK 180 

SKKLYGAQPH PEVGLTENGK VILKNFLYDI AGCSGTFTVQ NRKLECIREI KERVGTSKVL 240 

VLLSGGVDST VCTALLNRAI* NQEQVIAVHI DNGFMRKRES QSVEEALKKL GIQVKVINAA 3 00 

HSFYNGTTTI* PISDEDRTPR KRISKTLNMT TSPEEKRKII GDTPVKIANE VIGEMNLKPE 360 

EVFLAQGTLR PDLIESASLV ASGKAELIKT HHNDTELIRK LREEGKVIEP LKDFHKDEVR 420 

ILGRELGIjPE ELVSRHPPPG PGLAIRVICA EEPYICKDFP BTNNILKIVA DPSASVKKPH 480 

TLLQRVKACT TEBDQEKLMQ ITSLHSLNAP LLPIKTVGVQ GDCRSYSYVC GISSKDEPDW 540 

ESLIFLARLI PRMCHNVNRV VYIFGPPVKE PPTDVTPTFIj TTGVLSTLRQ ADFEAHNILR 600 

ESGYAGKISQ MPVILTPLHF DRDPLQKQPS CQRSWIRTF ITSDFMTGIP ATPGNEIPVE 660 
WLKMVTEIK KIPGISRIMY DLTSKPPGTT EWE 

Seq ID NO* 119 DNA sequence 

Nucleic Acid Accession tt; NM_006500.1 

Coding sequence: 27.. 1967 

1 ll 21 31 41 51 

) I I I I I 

ACTTGCGTCT CGCCCTCCGG CCAAGCATGG GGCTT CCCAG GCTGGTCTGC GCCTTCTTGC 60 
TCGCCGCCTG CTGCTGCTGT CCTCGCGTCG CGGGTGTGCC CGGAGAGGCT GAGCAGCCTG 120 
CGCCTGAGCT GGTGGAGGTG GAAGTGGGCA GCACAGCCCT TCTGAAGTGC GGCCTCTCCC 180 
AGTCCCAAGG CAACCTCAGC CATGTCGACT GGTTTTCTGT CCACAAGGAG AAGCGGACGC 240 



233 
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TCATCTTCCG TGTGCGCCAG GGCCAGGGCC AGAGCGAACC TGGGGAGTAC GAGCAGCGGC 300 

TCAGCCTCCA GGACAGAGGG GCTACTCTGQ CCCTGACTCA AGTCACCCCC CAAGACGAGC 360 

GCATCTTCTT GTGCCAGGGC AAGCGCCCTC GGTCCCAGGA GTACCGCATC CAGCTCCGCG 420 

TCTACAAAGC TCCGGAGGAG CCAAACATCC AGGTCAACCC CCTGGGCATC CCTGTGAACA 480 

GTAAGGAGCC TGAGGAGGTC GCTACCTGTG TAGGGAGGAA CGGGTACCCC ATTCCTCAAG 540 

TCATCTGGTA CAAGAATGGC CGGCCTCTGA AGGAGGAGAA GAACCGGGTC CACATTCAGT 600 

CGTCCCAGAC TGTGGAGTCG AGTGGTTTGT ACACCTTGCA GAGTATTCTG AAGGCACAGC 660 

TGGTTAAAGA AGACAAAGAT GCCCAGTTTT ACTGTGAGCT CAACTACCGG CTGCCCAGTG 720 

GGAACCACAT GAAGGAGTCC AGGGAAGTCA CCGTCCCTGT TTTCTACCCG ACAGAAAAAG 780 

TGTGGCTGGA AGTGGAGCCC GTGGGAATGC TGAAGGAAGG GGACCGCGTG GAAATCAGGT 840 

GTTTGGCTGA TGGCAACCCT CCACCACACT TCAGCATCAG CAAGCAGAAC CCCAGCACCA 900 

GGGAGGCAGA GGAAGAGACA ACCAACGACA ACGGGGTCCT GGTGCTGGAG CCTGCCCGGA 960 

AGGAACACAG TGGGCGCTAT GAATGTCAGG CCTGGAACTT GGACACCATG ATATCGCTGC 1020 

TGAGTGAACC ACAGGAACTA CTGGTGAACT ATGTGTCTGA CGTCCGAGTG AGTCCCGCAG 1080 

CCCCTGAGAG ACAGGAAGGC AGCAGCCTCA CCCTGACCTG TGAGGCAGAG AGTAGCCAGG 1140 

ACCTCGAGTT CCAGTGGCTG AGAGAAGAGA CAGACCAGGT GCTGGAAAGG GGGCCTGTGC 1200 

TTCAGTTGCA TGA CCTG AAA CGGGAGGCAG GAGGCGGCTA TCGCTGCGTG GCGTCTGTGC 1260 

CCAGCATACC CGGCCTGAAC CGCACACAGC TGGTCAAGCT GGCCATTTTT GGCCCCCCTT 1320 

GGATGGCATT CAAGGAGAGG AAGGTGTGGG TGAAAGAGAA TATGGTGTTG AATCTGTCTT 1380 

GTGAAGCGTC AGGGCACCCC CGGCCCACCA TCTCCTGGAA CGTCAACGGC ACGGCAAGTG 1440 

AACAAGACCA AGATCCACAG CGAGTCCTGA GCACCCTGAA TGTCCTCGTG ACCCCGGAGC 1500 

TGTTGGAGAC AGGTGTTGAA TGCACGGCCT CCAACGACCT GGGCAAAAAC ACCAGCATCC 1560 

TCTTCCTGGA GCTGGTCAAT TTAACCACCC TCACACCAGA CTCCAACACA ACCACTGGCC. 1620 

TCAGCACTTC CACTGCCAGT CCTCATACCA GAGCCAACAG CACCTCCACA GAGAGAAAGC 1680 

TGCCGGAGCC GGAGAGCCGG GGCGTGGTCA TCGTGGCTGT GATTGTGTGC ATCCTGGTCC 1740. 

TGGCGGTGCT GGGCGCTGTC CTCTATTTCC TCTATAAGAA GGGCAAGCTG CCGTGCAGGC 1800 

GCTCAGGGAA GCAGGAGATC ACGCTGCCCC CGTCTCGTAA GACCGAACTT GTAGTTGAAG 1860 

TTAAGTCAGA TAAGCTCCCA GAAGAGATGG GCCTCCTGCA GGGCAGCAGC GGTGACAAGA 1920 

GGGCTCCGGG AGACCAGGGA GAGAAATACA TCGATCTGAG GCATTAGCCC CGAATCACTT 1980 

CAGCTCCCTT CCCTGCCTGG ACCATTCCCA GCTCCCTGCT CACTCTTCTC TCAGCCAAAG 2040 

CCTCCAAAGG GACTAGAGAG AAGCCTCCTG CTCCCCTCAC CTGCACACCC CCTTTCAGAG 2100 

GGCCACTGGG TTAGGACCTG AGGACCTCAC TTGGCCCTGC AAGCCGCTTT TCAGGGACCA 2160 

GTCCACCACC ATCTCCTCCA CGTTGAGTGA AGCTCATCCC AAGCAAGGAG CCCCAGTCTC 2220 

CCGAGCGGGT AGGAGAGTTT CTTGCAGAAC GTGTTTTTTC TTTACACACA TTATGGCTGT 2280 

AAATACCTGG CTCCTGCCAG CAGCTGAGCT GGGTAGCCTC TCTGAGCTGG TTTCCTGCCC 2340 

CAAAGGCTGG CTTCCACCAT CCAGGTGCAC CACTGAAGTG AGGACACACC GGAGCCAGGC 2400 

GCCTGCTCAT GTTGAAGTGC GCTGTTCACA CCCGCTCCGG AGAGCACCCC AGCGGCATCC 2460 

AGAAGCAGCT GCAGTGTTGC TGGCACCACC CTCCTGCTCG CCTCTTCAAA GTCTCCTGTG 2520 

ACATTTTTTC TTTGGTCAGA AGCCAGGAAC TGGTGTCATT CCTTAAAAGA TACGTGCCGG 2580 

GGCCAGGTGT GGTGGCTCAC GCCTGTAATC CCAGCACTTT GGGAGGCCGA GGCGGGCGGA 2640 

TCACAAAGTC AGGACGAGAC CATCCTGGCT AACACGGTGA AACCCTGTCT CTACTAAAAA 2700 

TACAAAAAAA AAT TAG CT AG GCGTAGTGGT TGGCACCTAT AGTCCCAGCT ACTCGGAAGG 2760 

CTGAAGCAGG AGAATGGTAT GAATCCAGGA GGTGGAGCTT GCAGTGAGCC GAGACCGTGC 2820 

CACTGCACTC CAGCCTGGGC AACACAGCGA GACTCCGTCT CGAGGAAAAA AAAAGAAAAG 2880 

ACGCGTACCT GCGGTGAGGA AGCTGGGCGC TGTTTTCGAG TTCAGGTGAA TTAGCCTCAA 2940 

TCCCCGTGTT CACTTGCTCC CATAGCCCTC TTGATGGATC ACGTAAAACT GAAAGGCAGC 3000 

GGGGAGCAGA CAAAGATGAG GTCTACACTG TCCTTCATGG GGATTAAAGC TATGGTTATA 3060 

TTAGCACCAA ACTTCTACAA ACCAAGCTCA GGGCCCCAAC CCTAGAAGGG CCCAAATGAG 3120 

AGAATGGTAC TTAGGGATGG AAAACGGGGC CTGGCTAGAG CTTCGGGTGT GTGTGTCTGT 3180 

CTGTGTGTAT GCATACATAT GTGTGTATAT ATGGTTTTGT CAGGTGTGTA AATTTGCAAA 3240 

TTGTTTCCTT TATATATGTA TGTATATATA TATATGAAAA TATATATATA TATGAAAAAT 3300 

AAAGCTTAAT TGTCCCAGAA AATCATACAT TGCTTTTTTA TTCTACATGG GTACCACAGG 3360 

AACCTGGGGG CCTGTGAAAC TACAACCAAA AGGCACACAA AACCGTTTCC AGTTGGCAGC 3420 

AGAGATCAGG GGTTACCTCT GCTTCTGAGC AAATGGCTCA AGCTCTACCA GAGCAGACAG 3480 

CTACCCTACT TTTCAGCAGC AAAACGTCCC GTATGACGCA GCACGAAGGG CCTGGCAGGC 3540 
TGTTAGCAGG AGCTATGTCC CTTCCTATCG TTTCCGTCCA CTT 



Seq ID NOi 120 Protein sequence: 
Protein Accession #: NP_006491.1 

1 11 21 31 41 51 

1 I I I I I 

MGLPRLVCAF LLAACCCCPR VAGVPGEAEQ PAPELVEVEV GSTALLKCGL SQSQGHLSHV 60 

DWFSVHKEKR TItlFRVRQGQ GQSBPGEYEQ RLSLQDRGAT LALTQVTPQD BRIFLCQGKR 120 

PRSQEYRIQL RVYKAPBBPN IQVNPLGIPV NSKEPEEVAT CVGRNGYPIP QVIWYKNGRP 180 

LKEEKNRVHI QSSQTVESSG LYTLQS ILKA QLVKEDKDAQ FYCELNYRLP SGNHMKESRE 240 

VTVPVFYPTB KVWLEVEPVG MLKEGDRVEI RCLADGNPPP HFSISKQNPS TREAEEETTN 300 

DNGVLVLEPA RKEHSGRYEC QAWNLDTMIS LLSEPQELLV NYVSDVRVSP AAPERQEGSS 360 

LTLTCEAESS QDLEPQWLRE ETDQVLERGP VLQLHDLKRE AGGGYRCVAS VPSIPGLNRT 420 

QLVKLAIFGP PWMAFKERKV WVKENMVLNL SCRASGHPRP T I SWNVNGTA SEQDQDPQRV 480 

LSTLNVLVTP EIiLBXGVECT ASNDLGKNTS ILFLELVNLT TLTPDSNTTT GLSTSTASPH 540 

TRANSTSTER KLPEPESRGV VIVAVIVCTI, VLAVLGAVLY FLYKKGKLPC RRSGKQEITL 600 
PPSRKTELW EVKSDKLPEE MGLLQGSSGD KRAPGDQGEK YIDLRH 

Seq ID NO i 121 DNA sequence 
Nucleic Acid Accession «» NM_018306 
Coding sequence: 60-671 

41 51 
1 I 

AAGGACCACA GGAAAAGCCA 60 
TCAAGTCCAC AGAGAAACAG 120 
CGGGAAGGCT GGACTCTTTT 180 
CTTCTCTTCC TCCTCATCCT 240 
TGGGGAGCCT GACGTTTTGA 300 



1 11 21 31 

1 I I I 

ATAGTCTACA CAGAGCTCCC CTTGCTGCCC AGACAAGCTG 
TGGAGACTTC AGCATCCTCC TCCCAGCCTC AGGACAACAG 
AAGATGTAGA CTATGGAGAG ACAGATTTCC ACAAGCAAGA 
CCCAAGAACA ATATGAGAGA AACAAGTCTT CTTCCTCCTC 
CCTCATCTTC TTCATCCTCC TCCTCCTCAG GTCCTGGGCA 



234 
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AGGATGAGCT TCAACTCTAT GQAGATQCTC CTGGAGAGGT GGTACCCTCT GGGGAATCAG 360 

GACTCCGAAG GAGAGGCTCT GACCCAGCAA GTGGAGAAGT GGAGGCCTCT CAGTTAAGAA 420 

GACTGAATAT AAAGAAAGAT GATGAGTTTT TCCATTTCGT CCTCCTGTGC TTTGCCATCG 480 

GGGCCTTGCT GGTGTGTTAT CACTATTACG CAGACTGGTT CATGTCTCTT GGGGTCGGCC 540 

5 TGCTCACCTT CGCCTCCCTG GAAACCGTTG GCATCTACTT CGGACTAGTG TACCGTATCC 600 

ACAGCGTCCT CCAAGGCTTC ATCCCCCTCT TCCAGAAGTT TAGGCTGACA GGGTTCAGGA 660 

AGACTGACTG AGGCCACTTC CAGGTGGGCA GCAGAGGCAG GCCCCAGTGT GACCACCACT 720 

GCGACCCCTG AGCCCACAAG GGCAGAGCAG CATTCTGAGA GACGCACAGG AGACCAAGCC 780 

AGACCAATAA ACAGAACACT TTTCCTTCCA TGTGGTCTGA ATGTTGGCAC CAGCCCGGGC 840 

10 AGGGGCATCT CATTTGGGCA GTACTGCTGT GCAACCCAGC TGCAAGGATG GAAGGCAGAG 900 

GGTGGGTGTG GGGCCTGAGG CTTCACAGTA CCTGGACCAG CAGGAAGATT CTGGGAGGTC 960 

ACTGCTCTCA GAGGACAGCA AGGGACCCTG AGCTCTGCAA GCTGTGATCT GTCTGGGTTC 1020 

ATGGTTTTTC TCAAATCCCA GGCTATCTGC ATGCGCTCTC AGGTGCTACC GAGCCATCCT 1080 

GGGAGAGATG GATGGTCCAC TGCTTTGAGG CAGGGAGCCA TCGGGCTGGG GCCCCTTGGT 1140 

15 GAACCTGATG CAGGTAAGAT GCTGAGGACT AAAACCATTT TTTTTGCACC CAAAAAAAAA 1200 

GGCAGGAAAA TGATCATCAG AAACTAAATG GCAGCCAGGC ATGGGGGCTC ACGACTGTAA 1260 

TCCTCGCACT TTGGGAGGCT CAGGCTAAGG GTCGCTTGAA GCTGAGAGTT CAAGACCAAC 1320 

CTGGGCAACA TAGTGAGACC CCCATCTCTA CAATTTTTTT TTAATGACCA AATGTGGCGG 1380 

TACATACCTG TACATACCTG CGGTTCCAGC TACTCAAGAG GCTGAGGCAG GAGGACTGCT 1440 

20 TGAGCCCAGG AGTTCAGGGC TGCAGTGAGG TACGATCAAG CCACTGCACT CCAGCCTGGG 1500 
CGACAGAGCA AGATCGTTTC TCTAAAATT 
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Seq ID NO: 122 Protein sequence » 
Protein Accession #: NPJ>60776 

1 11 21 31 41 51 

I i I I 1 I 

METSASSBQP QDNSQVHRET EDVDYGETDF HKQDGKAGLF SQEQYERNKS SSSSFSSSSS 60 

SSSSSSSSSS GPGHGEPDVL KDELQLYGDA PGEWPSGES GLRRRGSDPA SGEVEASQLR 120 

RLNI KKDDEF FHFVLLCFAI GALLVCYHYY ADWFMSLGVG LLTFASLETV GIYFGLVYRI 180 
HSVLQGFIPL FQKFRLTGFR KTD 

Seq ID NO: 123 DNA sequence 
Nucleic Acid Accession tf: BC022542 
Coding sequence: 243.. 896 

1 11 21 31 41 51 

I I I I I I 

ACTTGGTCCC AGCCGATAAA TCTGGGGCAG CGCGCGGTAG GAGCTGCGGG CGGCCAGGCC 60 

CCTTCCTGCG TCCGCACCTG GCCCCGCGCG CCCCTCTCGG GCGTCCGGCT TCCGGCGTCC 120 

TGGCGGCTCG GGTGGCGGCG GTTCGGGCGG CCGCCTGGCT GCTCCTCGGG GCGGCGACGG 180 

GGCTCACGCG CGGGCCCGCC ACGGCCTTCA CCGCCGCGCG CTCTGACGCC GGCATAAGGG 240 

CCATGTGTTC TGAAATTATT TTGAGGCAAG AAGTTTTGAA AGATGGTTTC CACAGAGACC 300 

TTTTAATCAA AGTGAAGTTT GGGGAAAGCA TTGAGGACTT GCACACGTGC CGTCTCTTAA 360 

TTAAACAGGA CATTCCTGCA GGACTTTATG TGGATCCGTA TGAGTTGGCT TCATTACGAG 420 

A GAG AAA CAT AACAGAGGCA GTGATGGTTT CAGAAAATTT TGATATAGAG GCCCCTAACT 480 

ATTTGTCCAA GGAGTCTGAA GTTCTCATTT ATGCCAGACG AGATTCACAG TGCATTGACT 540 

GTTTTCAAGC CTTTTTGCCT GTGCACTGCC GCTATCATCG GCCGCACAGT GAAGATGGAG 600 

AAGCCTCGAT TGTGGTCAAT AACCCAGATT TGTTGATGTT TTGTGACCAA GAGTTCCCGA 660 

TTTTGAAATG CTGGGCTCAC TCAGAAGTGG CAGCCCCTTG TGCTTTGGAT AATGAGGATA 720 

TATGCCAATG GAACAAGATG AAGTATAAAT CAGTATATAA GAATGTGATT CTACAAGTTC 780 

CAGTGGGACT GACTGTACAT ACCTCTCTAG TATGTTCTGT GACTCTGCTC ATT ACAAT C C 840 

TGTGCTCTAC ATTGATCCTT GTAGCAGTTT TCAAATATGG CCATTTTTCC CTATAAGTTT 900 

TATGTAGTTA AATGCTTCCT AGAAACCTAA ATAAGATCTA TTAATTTCTG ACGAGAGGTG 960 

TTCTTCTAGA ATTAATTACT TTTATCTTTT GTCTTCATTT GTGGCCAAAA TTATGTTTAC 1020 

TAGAGGAAAT TTGGGATCAT TCTCAGCTAA TTCCAAAATG TAGTGCTCTA TTGCATGGAT 1080 

CCTTGGTAAT CCTCAAGCAT CAGATGCCAT AAGGGGAAAC TTAATTCTGC TAAATTAATG 1140 

TTTATTTTGT GAGAAGTGAC TTTATCTTCA TTTGGGGTAG AAAAATTATT TCTTTATGTA 1200 

GTAGAGACAA ATTATTCTCA TTTTGCAAGT ACTTTCAATT TAAGCTACAA ATTGAGAAAA 1260 

CCGTTATAAA TAAGAATAAA ATAGGCCAGG CACAGTGGCT CACACCTGTA ATCCCAGCAC 1320 

TTTGGGAGGC CGAGGTGGGC GGATCACCAG AGGTCAAGAG TTTGAGACCA GCTTGGTGAA 1380 

ACCCTGTCTC TACTAAAAAT ACAAAAGTTA GCTGGGGCTG GTGGTGGGCA TCTGTAGTCC 1440 

CAGCTAATTG GAAGGGTGAG GCGGGAGGAT CGCTTGAACC TGGGAGGCGG AGGTTCCAGA 1500 

GAGCCAAGAT CGCACCACTG CACTACAGCC TGGGCGACAG AACGAGACCC TGTCTCCAAA 1560 

GGAAAAACAA AAAAGAAGAA TAAAATAATT TGGATGAAAA TCATGTTTAT TTAAATAGTA 1620 

ATGTCATGAG ACTATTAAAG ATGTGCCAGA GTTTCAATGA AAATCATTAA AGTAGGACAG 1680 

CTAAGAAATT AATATTAATA TAAAAATTAT TGATAATCTT AAATTATTGA TTATTCCTTA 1740 

ACGCACTCCA TTCTCCTTTT ACATTTTATC ATGTTTCTTT TGAATATATG AATTGGCAAA 1800 

GGACTTGATG AAACTGAGTA CTAAGATTTG GTACAGAGTA TGTCAGGAAG ACAACTCAGA 1860 
TTGCCATTTT AAATAAAGTT GTACATGAAC AAAAAAAAAA AAAAAA 



Seq ID NO: 124 Protein sequence: 
75 Protein Accession &: AAH22542 

1 11 21 31 41 51 

1 1 I I I I 

MCSEIILRQE VLKDGFHRDL LIKVKFGESI EDLHTCRLLI KQDIPAGLYV DPYELASLRE 60 
80 RNITEAVMVS ENFDI EAPNY LSKESEVLIY ARRDSQCIDC FQAFLPVHCR YHRPHSEDGE 120 

ASIWNNPDL LMFCDQAGSR RMIRFRFDSF DKTIEFPILK CWAHSEVAAP CALENEDICQ 180 
WNKMKYKSVY KNVILQVPVG LTVHTSLVCS VTLLITILCS KKKKK 

0 Seq ID NO: 125 DNA sequence 

85 Nucleic Acid Accession #: NM_004994.1 
Coding sequence: 20.. 2143 



235 



WO 02/086443 

11 



5 

10 
15 
20 
25 
30 
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60 
65 
70 
75 
80 
85 



1 

I I 
AGACACCTCT GCCCTCACCA 
GGGCTGCTGC TTTGCTGCCC 
CCTGAGAACC AATCTCACCG 
CACTCGGGTG GCAGAGATGC 
CCAGAAGCAA CTGTCCCTGC 
GCGAACCCCA CGGTGCGGGG 
CAAGTGGCAC CACCACAACA 
GGCGGTGATT GACGACGCCT 
CACCTTCACT CGCGTGTACA 
GCACGGAGAC GGGTATCCCT 
TGGCCCCGGC ATTCAGGGAG 
GGGCGTCGTG GTTCCAACTC 
CATCTTCGAG GGCCGCTCCT 
CTGGTGCAGT ACCACGGCCA 
GAGACTCTAC ACCCGGGACG 
CCAAGGCCAA TCCTACTCCG 
CGCCACCACC GCCAACTACG 
CTCGACGGTG ATGGGGGGCA 
GGGTAAGGAG TACTCGACCT 
TACCACCTCG AACTTTGACA 
TTTGTTCCTC GTGGCGGCGC 
GCCGGAGGCG CTCATGTACC 
CGACGTGAAT GGCATCCGGC 
AACCACCACC ACACCGCAGC 
TGTCCACCCC TCAGAGCGCC 
AGGTCCCCCC ACTGCTGGCC 
TGCCTGCAAC GTGAACATCT 
CAAGGATGGG AAGTACTGGC 
CCTTATCGCC GACAAGTGGC 
GCTCTCCAAG AAGCTTTTCT 
GGTGCTGGGC CCGAGGCGTC 
CGGGGCCCTC CGGAGTGGCA 
GTTCGACGTG AAGGCGCAGA 
CCCCGGGGTG CCTTTGGACA 
CCAGGACCGC TTCTACTGGC 
GGGCTACGTG ACCTATGACA 
GCAGTGCCAT GTAAATCCCC 
CAAACTGGTA TTCTGTTCTG 
TCACCTTTGT 



21 
I 

TGAGCCTCTG 
CCAGACAGCG 
ACAGGCAGCT 
GTGGAGAGTC 
CCGAGACCGG 
TCCCAGACCT 
TCACCTATTG 
TTGCCCGCGC 
GCCGGGACGC 
TCGACGGGAA 
ACGCCCATTT 
GGTTTGGAAA 
ACTCTGCCTG 
ACTACGACAC 
GCAATGCTGA 
CCTGCACCAC 
ACCGGGACAA 
ACTCGGOGGG 
GTACCAGCGA 
GCGACAAGAA 
ATGAGTTCGG 
CTATGTACCG 
ACCTCTATGG 
CCACGGCTCC 
CCACAGCTGG 
CTTCTACGGC 
TCGACGCCAT 
GATTCTCTGA 
CCGCGCTGCC 
TCTTCTCTGG 
TGGACAAGCT 
GGGGGAAGAT 
TGGTGGATCC 
CGCACGACGT 
GCGTGAGTTC 
TCCTGCAGTG 
ACTGGGACCA 
GAGGAAAGGG 
AGTGTTTCTA 



31 
I 

GCAGCCCCTG 
CCAGTCCACC 
GGCAGAGGAA 
GAAATCTCTG 
TGAGCTGGAT 
GGGCAGATTC 
GATCCAAAAC 
CTTCGCACTG 
AGACATCGTC 
GGACGGGCTC 
CGACGATGAC 
CGCAGATGGC 
CACCACCGAC 
CGACGACCGG 
TGGGAAACCC 
GGACGGTCGC 
GCTCTTCGGC 
GGAGCTGTGC 
GGGCCGCGGA 
GTGGGGCTTC 
CCACGCGCTG 
CTTCACTGAQ 
TCCTCGCCCT 
CCCGACGGTC 
CCCCACAGGT 
CACTACTGTG 
CGCGGAGATT 
GGGCAGGGGG 
CCGCAAGCTG 
GCGCCAGGTG 
GGGCCTGGGA 
GCTGCTGTTC 
CCGGAGCGCC 
CTTCCAGTAC 
CCGGAGTGAG 
CCCTGAGGAC 
ACCCTGGGGA 
AGGAGTGGAG 
ATAAACTTGG 



41 
I 

GTCCTGGTG C 
CTTGTGCTCT 
TACCTGTACC 
GGGCCTGCGC 
AGCGCCACGC 
CAAACCTTTG 
TACTCGGAAG 
TGGAGCGCGG 
ATCCAGTTTG 
CTGGCACACG 
GAGTTGTGGT 
GCGGCCTGCC 
GGTCGCTCGG 
TTTGGCTTCT 
TGCCAGTTTC 
TCCGACGGCT 
TTCTGCCCGA 
GTCTTCCCCT 
GATGGGCGCC 
TGCCCGGACC 
GGCTTAGATC 
GGGCCCCCCT 
GAACCTGAGC 
TGCCCCACCG 
CCCCCCTCAG 
CCTTTGAGTC 
GGGAACCAGC 
AGCCGGCCGC 
GACTCGGTCT 
TGGGTGTACA 
GCCGACGTGG 
AGCGGGCGGC 
AGCGAGGTGG 
CGAGAGAAAG 
TTGAACCAGG 
TAGGGCTCCC 
AGGAGCCAGT 
GTGGGCTGGG 
ATTCTCTAAC 



51 
I 

TCCTGGTGCT 
TCCCTGGAGA 
GCTATGGTTA 
TGCTGCTTCT 
TGAAGGCCAT 
AGGGCGACCT 
ACTTGCCGCG 
TGACGCCGCT 
GTGTCGCGGA 
CCTTTCCTCC 
CCCTGGGCAA 
ACTTCCCCTT 
ACGGCTTGCC 
GCCCCAGGGA 
CATTCATCTT 
ACCGCTGGTG 
CCCGAGCTGA 
TCACTTTCCT 
TCTGGTGCGC 
AAGGATACAG 
ATTCCTCAGT 
TGCATAAGGA 
CACGGCCTCC 
GACCCCCCAC 
CTGGCCCCAC 
CGGTGGACGA 
TGTATTTGTT 
AGGGCCCCTT 
TTGAGGAGCC 
CAGGCGCGTC 
CCCAGGTGAC 
GCCTCTGGAG 
ACCGGATGTT 
CCTATTTCTG 
TGGACCAAGT 
GTCCTGCTTT 
TTGCCGGATA 
CCCTCTCTTC 
CTTT 
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Seq ID NO: 126 Protein sequence: 
Protein Accession NP 004985.1 



MSLWQPLVLV 
RGESKSLGPA 
1TYWIQNYSE 
PDGKDGIiLAH 
YSACTTDGRS 
ACTTDGRSDG 
CTSEGRGDGR 
PMYRPTEGPP 
PTAGPTGPPS 
RFSEGRGSRP 
LDKLGLGADV 
THDVFQYREK 



11 
I 

LLVLGCCFAA 
LLLIjQKQLSL 
DLPRAVIDDA 
AFPPGPGIQG 
DGLPWCSTTA 
YRWCATTANY 
LWCATTSNFD 
LHKDDVNGIR 
AGPTGPPTAG 
QGPFLIADKW 
AQVTGALRSG 
AYPCQDRFYW 



21 
I 

PRQRQSTLVL 
PETGELDSAT 
PARAPALWSA 



NYDTDDRFGF 
DRDKLFGFCP 
SDKKWGFCPD 
HLYGPRPEPE 
PSTATTVPLS 
PALPRKLDSV 
RGKMLLFSGR 
RVSSRSELNQ 



31 
I 

F PGDLRTNIiT 
LKAMRTP RCG 
VTPLTFTRVY 
SIiGKGVWPT 
CPSERLYTRD 
TRADSTVMGG 
QGYSLFLVAA 
PRPPTTTTPQ 
PVDDACNVNI 
FEEFLSKKLF 
RLWRFDVKAQ 
VDQVGYVTYD 



Seq ID NO: 127 DNA sequence 
Nucleic Acid Accession #i NM_004181 
Coding sequence: 32-670 



GCAGAAATAG 
GGTCGCCGGC 
GGTGCCAGCG 
CTTCAGGAAA 
CATGAAGCAG 
TAATCAAGAC 
AGAGAAAATG 
AGCCCATGAT 
TTTTATTCTG 
TCCGGTGAAC 
CAGAGAATTC 
GGCAGCCTAA 
AATATATACC 
TGTTCTGCAG 
ACAGCTGTCC 
TATGTCTTGT 
AAGACCTTGG 



11 
I 

CCTAGGGAGA 
CAGTGGCGCT 
CCTGCCTGCG 
AAGCAGATTG 
ACCATTGGGA 
AAACTGGGAT 
TCCCCTGAAG 
GCCGTGGCAC 
TTTAACAACG 
CATGGCGCCA 
ACCGAGCGTG 
TGCTCTGTGG 
CCCCATGCAG 
ACACGCCTTC 
ACTGGGCCAT 
ATCCGATATC 
ATGTGGTTAT 



21 
1 

TCAACCCCGA 
TCGTGGACGT 
CGCTGCTGCT 
AAGAGCTGAA 
ATTCCTGTGG 
TTGAGGATGG 
ACAGAGCAAA 
AGGAAGGCCA 
TGGATGGCCA 
GTTCAGAGGA 
AGCAAGGAGA 
GAGGGACTTT 
TCTAAAATGC 
CCCTCAGCCA 
TGTGGTGTGA 
TAACGCTTTA 
GTTGTCCTAA 



31 
I 

GATGCTGAAC 
GCTGGGGCTG 
GCTGTTTCCC 
GGGACAAGAA 
CACAATCGGA 
ATCAGTTCTG 
ATGCTTTGAA 
ATGTCGGGTA 
CCTCTATGAA 
CACCCTGCTG 
AGTCCGCTTC 
GCTGATTTCC 
TTCAGTACTT 
CACCCAGGCA 
GCTTCAGATG 
AATGGCTACT 
AGAATAAATT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
186.0 
1920 
1980 
2040 
2100 
2160 
2220 
22B0 



41 


51 




1 

DRQLAEEYIiY 


1 

RYGYTRVAEM 


60 


VPDLGRFQTF 


EGDLKWHHHN 


120 


SRDADIVIQF 


GVAEHGDGYP 


180 


RFGNADGAAC 


HFPFIFEGRS 


240 


GNADGKPCQF 


PFIFQGQSYS 


300 


NSAGELCVFP 


FTFLGKEYST 


360 


HEFGHALGLD 


HSSVPEALMY 


420 


PTAPPTVCPT 


GPPTVHPSER 


480 


FDAIAEIGNQ 


LYLFKDGKYW 


540 


FFSGRQVWVY 


TGASVLGPRR 


600 


MVDPRSASEV 


DRMFPGVPLD 


660 


XLQCPED 






41 


51 




1 

AAAGTGCTGT 


1 

CCCGGCTGGG 


60 


GAAGAGGAGT 


CTCTGGGCTC 


120 


CTCACGGCCC 


AGCATGAGAA 


180 


GTTAGTCCTA 


AAGTGTACTT 


240 


CTTATTCACG 


CAGTGGCCAA 


300 


AAACAGTTTC 


TTTCTGAAAC 


360 


AAGAATGAGG 


CCATACAGGC 


420 


GATGACAAGG 


TGAATTTCCA 


480 


CTTGATGGAC 


GAATGCCTTT 


540 


AAGGAGGCTG 


CCAAGGTGTG 


600 


TCTGCCGTGG 


CTCTCTGCAA 


660 


CCTCTTCCCT 


TCAACATGAA 


720 


GTGAAACACA 


GCTGTTCTTC 


780 


CTTAAGCACA 


AGCAGAGTGC 


840 


GTGAAGCATT 


CTCCCCAGTG 


900 


TTGGTTTCTG 


TCTGTAAGTT 


960 


TTGCTGATAG 


TAGC 





Seq ID NO: 128 Protein sequence-. 
Protein Accession ft: NP_004172 
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i WO 02/086443 n 

I I I I I I 

MLNKVLSRLG VAGQWRFVDV LGI/EEESLGS VPAPACALLL LFPLTAQHEN FRKKQIEELK 60 

GQEVSPKVYF MKQTIGNSCG TIGLIHAVAN NQDKLGFEDG SVLKQFLSET EKMSPEDRAX 120 

CPEKNEAIQA AHDAVAQEGQ CRVDDKVNFH FILFNNVDGH LYELDGRMPF PVNH GASSED 180 
TLLKDAAKVC REFTEREQGE VRFSAVALCK AA 

Seq ID NOt 129 DNA sequence 
Nucleic Acid Accession #: NM_000213 
Coding sequence* 127-5385 

1 11 21 31 41 51 

I I I I 1 1 

CGCCCGCGCG CTGCAGCCCC ATCTCCTAGC GGCAGCCCAG GCGGGGAGGG AGCGAGTCCG 60 

CCCCGAGGTA GGTCCAGGAC GGGCGCACAG CAGCAGCCGA GGCTGGCCGG GAGAGGGAGG 120 

AAGAGGATGG CAGGGCCACG CCCCAGCCCA TGGGCCAGGC TGCTCCTGGC AGCCTTGATC 180 

AGCGTCAGCC TCTCTGGGAC CTTGGCAAAC CGCTGCAAGA AGGCCCCAGT GAAGAGCTGC 240 

ACGGAGTGTG TCCGTGTGGA TAAGGACTGC GCCTACTGCA CAGACGAGAT GTTCAGGGAC 300 

CGGCGCTGCA ACACCCAGGC GGAGCTGCTG GCCGCGGGCT GCCAGCGGGA GAGCATCGTG 360 

GTCATGGAGA GCAGCTTCCA AATCACAGAG GAGACCCAGA TTGACACCAC CCTGCGGCGC 420 

AGCCAGATGT CCCCCCAAGG CCTGCGGGTC CGTCTGCGGC CCGGTGAGGA GCGGCATTTT 480 

GAGCTGGAGG TGTTTGAGCC ACTGGAGAGC CCCGTGGACC TGTACATCCT CATGGACTTC 540 

TCCAACTCCA TGTCCGATGA TCTGGACAAC CTCAAGAAGA TGGGGCAGAA CCTGGCTCGG 600 

GTCCTGAGCC AGCTCACCAG CGACTACACT ATTGGATTTG GCAAGTTTGT GGACAAAGTC 660 

AGCGTCCCGC AGACGGACAT GAGGCCTGAG AAGCTGAAGG AGCCCTGGCC CAACAGTGAC 720 

CCCCCCTTCT CCTTCAAGAA CGTCATCAGC CTGACAGAAG ATGTGGATGA GTTCCGGAAT 780 

AAACTGCAGG GAGAGCGGAT CTCAGGCAAC CTGGATGCTC CTGAGGGCGG CTTCGATGCC 840 

ATCCTGCAGA CAGCTGTGTG CACGAGGGAC ATTGGCTGGC GCCCGGACAG CACCCACCTG 900 

CTGGTCTTCT CCACCGAGTC AGCCTTCCAC TATGAGGCTG ATGGCGCCAA CGTGCTGGCT 960 

GGCATCATGA GCCGCAACGA TGAACGGTGC CACCTGGACA CCACGGGCAC CTACACCCAG 1020 

TACAGGACAC AGGACTACCC GTCGGTGCCC ACCCTGGTGC GCCTGCTCGC CAAGCACAAC 1080 

ATCATCCCCA TCTTTGCTGT CACCAACTAC TCCTATAGCT ACTACGAGAA GCTTCACACC 1140 

TATTTCCCTG TCTCCTCACT GGGGGTGCTG CAGGAGGACT CGT CCAACAT CGTGGAGCTG 1200 

CTGGAGGAGG CCTTCAATCG GATCCGCTCC AACCTGGACA TCCGGGCCCT AGACAGCCCC 1260 

CGAGGCCTTC GGACAGAGGT CACCTCCAAG ATGTTCCAGA AGACGAGGAC TGGGTCCTTT 1320 

CACATCCGGC GGGGGGAAGT GGGTATATAC CAGGTGCAGC TGCGGGCCCT TGAGCACGTG 1380 

GATGGGACGC ACGTGTGCCA GCTGCCGGAG GACCAGAAGG GCAACATCCA TCTGAAACCT 1440 

TCCTTCTCCG ACGGCCTCAA GATGGACGCG GGCATCATCT GTGATGTGTG CACCTGCGAG 1500 

CTGCAAAAAG AGGTGCGGTC AGCTCGCTGC AGCTTCAACG GAGACTTCGT GTGCGGACAG 1560 

TGTGTGTGCA GCGAGGGCTG GAGTGGCCAG ACCTGCAACT GCTCCACCGG CTCTCTGAGT 1620 

GACATTCAGC CCTGCCTGCG GGAGGGCGAG GACAAGCCGT GCTCCGGCCG TGGGGAGTGC 1680 

CAGTGCGGGC ACTGTGTGTG CTACGGCGAA GGCCGCTACG AGGGTCAGTT CTGCGAGTAT 1740 

GACAACTTCC AGTGTCCCCG CACTTCCGGG TTCCTCTGCA ATGACCGAGG ACGCTGCTCC 1800 

ATGGGCCAGT GTGTGTGTGA GCCTGGTTGG ACAGGCCCAA GCTGTGACTG TCCCCTCAGC 1860 

AATGCCACCT GCATCGACAG CAATGGGGGC ATCTGTAATG GACGTGGCCA CTGTGAGTGT 1920 

GGCCGCTGCC ACTGCCACCA GCAGTCGCTC TACACGGACA CCATCTGCGA GATCAACTAC 1980 

TCGGCGATCC ACCCGGGCCT CTGCGAGGAC CTACGCTCCT GCGTGCAGTG CCAGGCGTGG 2040 

GGCACCGGCG AGAAGAAGGG GCGCACGTGT GAGGAATGCA ACTTCAAGGT CAAGATGGTG 2100 

GACGAGCTTA AGAGAGCCGA GGAGGTGGTG GTGCGCTGCT CCTTCCGGGA CGAGGATGAC 2160 

GACTGCACCT ACAGCTACAC CATGGAAGGT GACGGCGCCC CTGGGCCCAA CAGCACTGTC 2220 

CTGGTGCACA AGAAGAAGGA CTGCCCTCCG GGCTCCTTCT GGTGGCTCAT CCCCCTGCTC 2280 

CTCCTCCTCC TGCCGCTCCT GGCCCTGCTA CTGCTGCTAT GCTGGAAGTA CTGTGCCTGC 2340 

TGCAAGGCCT GCCTGGCACT TCTCCCGTGC TGCAACCGAG GTCACATGGT GGGCTTTAAG 2400 

GAAGACCACT ACATGCTGCG GGAGAACCTG ATGGCCTCTG ACCACTTGGA CACGCCCATG 2460 

CTGCGCAGCG GGAACCTCAA GGGCCGTGAC GTGGTCCGCT GGAAGGTCAC CAACAACATG 2520 

CAGCGGCCTG GCTTTGCCAC TCATGCCGCC AGCATCAACC CCACAGAGCT GGTGCCCTAC 2580 

GGGCTGTCCT TGCGCCTGGC CCGCCTTTGC ACCGAGAACC TGCTGAAGCC TGACACTCGG 2640 

GAGTGCGCCC AGCTGCGCCA GGAGGTGGAG GAGAACCTGA ACGAGGTCTA CAGG CAGATC 2700 

TCCGGTGTAC ACAAGCTCCA GCAGACCAAG TTCCGGCAGC AGCCCAATGC CGGGAAAAAG 2760 

CAAGACCACA CCATTGTGGA CACAGTGCTG ATGGCGCCCC GCTCGGCCAA GCCGGCCCTG 2820 

CTGAAGCTTA CAGAGAAGCA GGTGGAACAG AGGGCCTTCC ACGACCTCAA GGTGGCCCCC 2880 

GGCTACTACA CCCTCACTGC AGACCAGGAC GCCCGGGGCA TGGTGGAGTT CCAGGAGGGC 2 940 

GTGGAGCTGG TGGACGTACG GGTGCCCCTC TTTATCCGGC CTGAGGATGA CGACGAGAAG 3000 

CAGCTGCTGG TGGAGGCCAT CGACGTGCCC GCAGGCACTG CCACCCTCGG CCGCCGCCTG 3060 

GTAAACATCA CCATCATCAA GGAGCAAGCC AGAGACGTGG TGTCCTTTGA GCAGCCTGAG 3120 

TTCTCGGTCA GCCGCGGGGA CCAGGTGGCC CGCATCCCTG TCATCCGGCG TGTCCTGGAC 3180 

GGCGGGAAGT CCCAGGTCTC CTACCGCACA CAGGATGGCA CCGCGCAGGG CAACCGGGAC 3240 

TACATCCCCG TGGAGGGTGA GCTGCTGTTC CAGCCTGGGG AGGCCTGGAA AGAGCTGCAG 3300 

GTGAAGCTCC TGGAGCTGCA AGAAGTTGAC TCCCTCCTGC GGGGCCGCCA GGTCCGCCGT 3360 

TTCCACGTCC AGCTCAGCAA CCCTAAGTTT GGGGCCCACC TGGGCCAGCC CCACTCCACC 3420 

ACCATCATCA TCAGGGACCC AGATGAACTG GACCGGAGCT TCACGAGTCA GATGTTGTCA 3480 

TCACAGCCAC CCCCTCACGG CGACCTGGGC GCCCCGCAGA ACCCCAATGC TAAGGCCGCT 3540 

GGGTCCAGGA AGATCCATTT CAACTGGCTG CCCCCTTCTG GCAAGCCAAT GGGGTACAGG 3600 

GTAAAGTACT GGATTCAGGG TGACTCCGAA TCCGAAGCCC ACCTGCTCGA CAGCAAGGTG 3660 

CCCTCAGTGG AGCTCACCAA CCTGTACCCG TATTGCGACT ATGAGATGAA GGTGTGCGCC 3720 

TACGGGGCTC AGGGCGAGGG ACCCTACAGC TCCCTGGTGT CCTGCCGCAC CCACCAGGAA 3780 

GTGCCCAGCG AGCCAGGGCG TCTGGCCTTC AATGTCGTCT CCTCCACGGT GACCCAGCTG 3840 

AGCTGGGCTG AGCCGGCTGA GACCAACGGT GAGATCACAG CCTACGAGGT CTGCTATGGC 3900 

CTGGTCAACG ATGACAACCG ACCTATTGGG CCCATGAAGA AAGTGCTGGT TGACAACCCT 3960 

AAGAACCGGA TGCTGCTTAT TGAGAACCTT CGGGAGTCCC AGCCCTACCG CTACACGGTG 4020 

AAGGCGCGCA ACGGGGCCGG CTGGGGGCCT GAGOGGGAGG CCATCATCAA CCTGGCCACC 4080 

CAGCCCAAGA GGCCCATGTC CATCCCCATC ATCCCTGACA TCCCTATCGT GGACGCCCAG 4140 

AGCGGGGAGG ACTACGACAG CTTCCTTATG TACAGCGATG ACGTTCTACG CTCTCCATCG 4200 

GGCAGCCAGA GGCCCAGCGT CTCCGATGAC ACTGAGCACC TGGTGAATGG CCGGATGGAC 4260 

TTTGCCTTCC CGGGCAGCAC CAACTCCCTG CACAGGATGA CCACGACCAG TGCTGCTGCC 4320 

TATGGCACCC ACCTGAGCCC ACACGTGCCC CACCGCGTGC TAAGCACATC CTCCACCCTC 4380 
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ACACGGGACT ACAACTCACT GACCCGCTCA GAACACTCAC ACTCGACCAC ACTGCCGAGG 4440 

GACTACTCCA CCCTCACCTC CGTCTCCTCC CACGACTCTC GCCTGACTGC TGGTGTGCCC 4500 

GACACGCCCA CCCGCCTGGT GTTCTCTGCC CTGGGGCCCA CATCTCTCAG AGTGAGCTGG 4560 

CAGGAGCCGC GGTGCGAGCG GCCGCTGCAG GGCTACAGTG TGGAGTACCA GCTGCTGAAC 4620 

GGCGGTGAGC TGCATCGGCT CAACATCCCC AACCCTGCCC AGACCTCGGT GGTGGTGGAA 46 BO 

GACCTCCTGC CCAACCACTC CTACGTGTTC CGCGTGCGGG CCCAGAGCCA GGAAGGCTGG 4740 

GGCCGAGAGC GTGAGGGTGT CATCACCATT GAATCCCAGG TGCACCCGCA GAGCCCACTG 4800 

TGTCCCCTGC CAGGCTCCGC CTTCACTTTG AGCACTCCCA GTGCCCCAGG CCCGCTGGTG 4860 

TTCACTGCCC TGAGCCCAGA CTCGCTGCAG CTGAGCTGGG AGCGGCCACG GAGGCCCAAT 4920 

GGGGATATCG TCGGCTACCT GGTGACCTGT GAGATGGCCC AAGGAGGAGG GCCAGCCACC 4980 

GCATTCCGGG 7GGATGGAGA CAGCCCCGAG AGCCGGCTGA CCGTGCCGGG CCTCAGCGAG 5040 

AACGTGCCCT ACAAGTTCAA GGTGCAGGCC AGGACCACTG AGGGCTTCGG GCCAGAGCGC 5100 

GAGGGCATCA TCACCATAGA GTCCCAGGAT GGAGGACCCT TCCCGCAGCT GGGCAGCCGT 5160 

GCCGGGCTCT TCCAGCACCC GCTGCAAAGC GAGTACAGCA GCATCACCAC CACCCACACC 5220 

AGCGCCACCG AGCCCTTCCT AGTGGATGGG CCGACCCTGG GGGCCCAGCA CCTGGAGGCA S280 

GGCGGCTCCC TCACCCGGCA TGTGACCCAG GAGTTTGTGA GCCGGACACT GACCACCAGC 5340 

GGAACCCTTA GCACCCACAT GGACCAACAG TTCTTCCAAA CTTGACCGCA CCCTGCCCCA 5400 

CCCCCGCCAT GTCCCACTAG GCGTCCTCCC GACTCCTCTC CCGGAGCCTC CTCAGCTACT 5460 

CCATCCTTGC ACCCCTGGGG GCCCAGCCCA CCCGCATGCA CAGAGCAGGG GCTAGGTGTC 5520 

TCCTGGGAGG CATGAAGGGG GCAAGGTCCG TCCTCTGTGG GCCCAAACCT ATTTGTAACC 55 BO 

AAAGAGCTGG GAGCAGCACA AGGACCCAGC CTTTGTTCTG CACTTAATAA ATGGTTTTGC 5640 
ACTG 

Seq ID NO: 130 Protein sequence: 
Protein Accession #: NP_000204 

1 11 21 31 41 51 

I I 1 I I I 

MAGPRPSPWA RLLLAALISV SLSGTLANRC KKAPVKSCTE CVRVDKDCAY CTDEMFRDRR 60 

CNTQAELLAA GCQRESIWM ESSFQITEET QIDTTLRRSQ MSPQGLRVRL RPGEERHPEL 120 

EVFEPLESPV DLYILMDFSN SMSDDLDNLK KMGQNLARVL SQLTSDYTIG FGKFVDKVSV 180 

PQTDMRPEKL KEPWPNSDPP FSFKNVISLT EDVDEFRNKL QGERISGNLD APEGGFDAIL 240 

QTAVCTRDIG WRPDSTHLLV FSTESAFHYE ADGANVL.AGI MSRNDERCHL DTTGTYTQYR 300 

TQDYPSVPTL VRLLAKHNII P I F AVTNYSY SYYEKLHTYF PVSSLGVUJE DSSNIVELLE 360 

EAFNRIRSNL DIRALDSPRG LRTEVTSKMF QKTRTGSFHI RRGEVGIYQV QLRALEHVDG 420 

THVCQLPEDQ KGNIHLKPSF SDGLKMDAGI ICDVCTCELQ KEVRSARCSF NGDFVCGQCV 480 

CSEGWSGQTC NCSTG8L.SDI QPCLREGEDK PCSGRGBCQC GHCVCYGEGR YEGQFCEYDN 540 

FQCPRTSGFL CNDRGRCSMG QCVCEPGWTG PSCDCPLSNA TCIDSNGGIC NGRGHCECGR 600 

CHCHQQSLYT DTICEINYSA IHPGLCEDLR SCVQCQAWGT GEKKGRTCEE CNFKVKMVDE 660 

LXRAEEWVR CSFRDEDDDC TYSYTMBGDG APGPNSTVLV HKKKDCPPGS FVWIjIPIiljMj 720 

LLPLLlALLLL LCWKYCACCK ACLALLPCCN RGHMVGFKED HYMLRENLMA SDHLDTPMLR 780 

SGNLKGRDW RWKVTNNMQR PGFATHAASI NPTELVPYGL SLRLARLCTE NIjLKPDTREC 840 

AQLRQEVEEN LNEVYRQISG VHKLQQTKFR QQPNAGKKQD HTIVDTVLMA PRSAKPALI»K 900 

LTEKQVEQRA FHDLKVAPGY YTLTADQDAR GMVEFQEGVE LVDVRVPLFI RPEDDDEKQL 960 

LVEAI DVPAG TATLGRRLVN ITIIKEQARD WSFEQPEFS VSRGDQVARI PVIRRVLDGG 1020 

KSQVSYRTQD GTAQGNRDYI P VEGELL FQP GEAWKELQVK LLELQEVDSL LRGRQVRRFH 1080 

VQLSMPKFGA HLGQPHSTTI IIRDPDELDR SFTSQMLSSQ PPPHGDLGAP QNPNAKAAGS 1140 

RKIHFNWLPP SGKPMGYRVK YWIQGDSESE AHLLDSKVPS VELTNLYPYC DYEMKVCAYG 1200 

AQGEGPYSSL VSCRTHQEVP SBPGRLAFNV VSSTVTQLSW AEPAETNGEI TAYEVCYGLV 1260 

NDONRPIGPM KKVLVDNPKN RMLLIENLRB SQPYRYTVKA RNGAGWGPER EAIINLATQP 1320 

KRPMSIPIIP DIPIVDAQSG EDYDSFLMYS DDVLRSPSGS QRPSVSDDTE HLVNGRMDFA 1380 

FPGSTNSLHR MTTT9AAAYG THLSPHVPHR VLSTSSTLTR DYNSLTRSEH SHSTTLPRDY 1440 

STLTSVSSHD SRLTAGVPDT PTRLVFSALG PTSLRVSWQE PRCERPLQGY SVEYQLLNGG 1500 

E1HRLN1PNP AQTSVWEDL LPNHSYVFRV RAQSQEGWGR EREGVITIES QVHPQSPLCP 1560 

LPGSAFTLST PSAPGPLVFT ALSPDSLQLS WERPRRPNGD IVGYLVTCEM AQGGGPATAF 1620 

RVDGDSPESR LTVPGLSENV PYKFKVQART TEGFGPEREG IITXESQPGG PFPQLGSRAG 1680 

LFQHPLQSEY SSITTTHTBA TEPFLVDGPT LGAQHLEAGG SLTRHVTQEF VSRTLTTSGT 1740 
LSTHMDQQFF QT 

Seq ID NOt 131 DNA sequence 
Nucleic Acid Accession #i BC004372 
Coding sequence: 132. .2231 

1 ll 21 31 , 41 51 

I I I 1 I I 

CCTCGTGCCG CGGACCCCAG CCTCTGCCAG GTTCGGTCCG CCATCCTCGT CCCGTCCTCC 60 

GCCGGCCCCT GCCCCGCGCC CAGGGATCCT CCAGCTCCTT TCGCCCGCGC CCTCCGTTCG 120 

CTCCGGACAC CATGGACAAG TTTTGGTGGC ACGCAGCCTG GGGACTCTGC CTCGTGCCGC 180 

TGAGCCTGGC GCAGATCGAT TTGAATATAA CCTGCCGCTT TGCAGGTGTA TTCCACGTGG 240 

AGAAAAATGG TCGCTACAGC ATCTCTCGGA CGGAGGCCGC TGACCTCTGC AAGGCTTTCA 300 

ATAGCACCTT GCCCACAATG GCCCAGATGG AGAAAGCTCT GAGCATCGGA TTTGAGACCT 360 

GCAGGTATGG GTTCATAGAA GGGCATGTGG TGATTCCCCG GATCCACCCC AACTCCATCT 420 

GTGCAGCAAA CAACACAGGG GTGTACATCC TCACATCCAA CACCTCCCAG TATGACACAT 480 

ATTGCTTCAA TGCTTCAGCT CCACCTGAAG AAGATTGTAC ATCAGTCACA GACCTGCCCA 540 

ATGCCTTTGA TGGACCAATT ACCATAACTA TTGTTAACCG TGATGGCACC CGCTATGTCC 600 

AGAAAGGAGA ATACAGAACG AATCCTGAAG ACATCTACCC CAGCAACCCT ACTGATGATG 660 

ACGTGAGCAG CGGCTCCTCC AGTGAAAGGA GCAGCACTTC AGGAGGTTAC ATCTTTTACA 720 

CCTTTTCTAC TGTACACCCC ATCCCAGACG AAGACAGTCC CTGGATCACC GACAGCACAG 780 

ACAGAATCCC TGCTACCAGT ACGTCTTCAA ATACCATCTC AGCAGGCTGG GAGCCAAATG B40 

AAGAAAATGA AGATGAAAGA GACAGACACC TCAGTTTTTC TGGATCAGGC ATTGATGATG 900 

ATGAAGATTT TATCTCCAGC ACCATTTCAA CCACACCACG GGCTTTTGAC CACACAAAAC 960 

AGAACCAGGA CTGGACCCAG TGGAACCCAA GCCATTCAAA TCCGGAAGTG CTACTTCAGA 1020 

CAACCACAAG GATGACTGAT GTAGACAGAA ATGGCACCAC TGCTTATGAA GGAAACTGGA 1080 

ACCCAGAAGC ACACCCTCCC CTCATTCACC ATGAGCATCA TGAGGAAGAA GAGACCCCAC 1140 

ATTCTACAAG CACAATCCAG GCAACTCCTA GTAGTACAAC GGAAGAAACA GCTACCCAGA 1200 

AGGAACAGTG GTTTGGCAAC AGATGGCATG AGGGATATCG CCAAACACCC AGAGAAGACT 1260 
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CCCATTCGAC AACAGGGACA GCTGCAGCCT 
GGACAACACC AAGCCCAGAG GACAGTTCCT 
CCATGGGACG AGGTCATCAA GCAGGAAGAA 
CGCTTCAGCC TACTGCAAAT CCAAACACAfl 
CTCTTTCAAT GACAACGCAG CAGAGTAATT 
TGGAAGAAGA TAAAGACCAT CCAACAACTT 
TCACAGGTGG AAGAAGAGAC CCAAATCATT 
ATACCTCTCA TTACCCACAC ACGAAGGAAA 
AGACTGGGTC CTTTGGAGTT ACTGCAGTTA 
GTTCCTTATC AGGAGACCAA GACACATTCC 
GATCTGAATC AGATGGACAC TCACATGGGA 
GTCCTATAAG GACACCCCAA ATTCCAGAAT 
TGGCTTTGAT TCTTGCAGTT TGCATTGCAG 
AAAAGCTAGT GATCAACAGT GGCAATGGAG 
ACGGAGAGGC CAGCAAGTCT CAGGAAATGG 
CTCCAGACCA GTTTATGACA GCTGATGAGA 
TTGGGGTGTA ACACCTACAC CATTATCTTG 
TACAGGGAGC TGGGACACTT AACAGATGCA 
TTTTTAGCAT AAAATTTTCT ACTCTTAAAA 



Beq ID NO: 132 Protein sequence-. 
Protein Accession #: AAH04372 



1 
I 

MDKFWWHAAW 
PTMAQMEKAL 
ASAPPEEDCT 
GSSSERSSTS 
DERDHHIiSFS 
MTDVDRNGTT 
FGNRWHEGYR 
GHQAGRRMDM 
KDHPTT8TLT 
FGVTAVTVGD 
TPQIPEWLII 
SKSQEMVHLV 



11 

I 

GLCIiVPLSLA 
SIGFETCRYG 
SVTDLPNAFD 
GGYIPYTFST 
GSGIDDDEDF 
AYEGNWNPEA 
QTPREDSHST 
DSSHSTTLQP 
SSNRNDVTGG 
SNSNVNRSLS 
LASLLALALI 
NKESSETPDQ 



21 
I 

QIDLNITCRP 
PIEGHWIPR 
GPITITIVNR 
VHPIPDEDSP 
ISSTISTTPR 
HPPLIHHEHH 
TGTAAASAHT 
TANPNTGLVE 



GDQDTFHPSG 
LAVCIAVNSR 
FMTADETRNL 



CAGCTCATAC 
GGACTGATTT 
GGATGGATAT 
GTTTGGTGGA 
CTCAGAGCTT 
CTACTCTGAC 
CTGAAGGCTC 
GCAGGACCTT 
CTGTTGGAGA 
ACCCCAGTGG 
GTCAAGAAGG 
GGCTGATCAT 
TCAACAGTCG 
CTGTGGAGGA 
TGCATTTGGT 
CAAGGAACCT 
GAAAGAAACA 
ATGTGCTACT 
AAAAAAAAAA 



31 
I 

AGVFHVEKNG 
IHPNSICAAN 
DGTRYVQKGE 
WITDSTDRIP 
AFDHTKQNQD 
EEEETPHSTS 
SHPKQGRTTP 
DLDRTGPLSM 
TTLLEGYTSH 
GSHTTHGSES 
RRCGQKKKLV 
QNVDMKIGV 



CAGCCATCCA 
CTTCAACCCA 
GGACTCCAGT 
AGATTTGGAC 
CTCTACATCA 
ATCAAGCAAT 
AACTACTTTA 
CATCCCAGTG 
TTCCAACTCT 
GGGGTCCCAT 
TGGAGCAAAC 
CTTGGCATCC 
AAGAAGGTGT 
CAGAAAGCCA 
GAACAAGGAG 
GCAGAATGTG 
ACCGTTGGAA 
GATTGTTTCA 
AAAAAAA 



41 
I 

RYSISRTEAA 
NTGVYILTSN 
YRTNPEDIYP 
ATSTSSNTIS 
WTQWNPSHSN 
TIQATPSSTT 
SPEDSSWTDF 
TTQQSNSQSF 
YPHTKESRTF 
DGHSHGSQEG 
INSGNGAVED 



ATGCAAGGAA 
ATCTCACACC 
CATAGTACAA 
AGGACAGGAC 
CATGAAGGCT 
AGGAATGATG 
CTGGAAGGTT 
ACCTCAGCTA 
AATGTCAATC 
ACCACTCATG 
ACAACCTCTG 
CTCTTGGCCT 
GGGCAGAAGA 
AGTGGACTCA 
TCGTCAGAAA 
GACATGAAGA 
ACATAACCAT 
TTGCGAATCT 



51 
I 

DLCKAFNSTL 
TSQYDTYCFN 
SNPTDDDVSS 
AGWEPNEENE 
PEVLLQTTTR 
EETATQKEQW 
FNPISHPMGR 



1PVTSAKTGS 
GANTTSGPIR 
RKPSGLNGEA 



Seq ID NO i 133 DNA sequence 
Nucleic Acid Accession NM_002882 
Coding sequence: 150-755 



CGAGGTTCGG 
GCGGAGGGAA 
AGCCGAGCGG 
ATGATACTTC 
TTTCTCTTCC 
TGCGGGCAAA 
GCACTGGTGA 
GGAGGGACAA 
AGCCCAACGC 
AGTGCCCCAA 
TCAAAACAAA 
CAGGCAAAAA 
AGGAGACCAA 
TCTCTTTCCT 
ATTCTTTCAT 



11 

I 

GTCGTGGGGC 
GGAGCTACGA 
CCGCCGCCGC 
CACTGAGAAT 
TGAGCAAGAA 
ACTGTT CCG A 
CGTCAAGCTC 
GACCCTGAAG 
AGGTAGCGAC 
GCCAGAGCTG 
GTTTGAAGAA 
TGATCATGCC 
GGAGGATGCT 
TTCCTTTTTT 
TTTTACAAGG 



21 
I 

GGAGGGAAGA 
GTAGCCGCCG 
CGCGCCCCCA 
ACAGACGAGT 
ATTAAAACAC 
TTTGCCTCTG 
CTGAAGCACA 
ATCTGTGCCA 
CGTGCCTGGG 
CTGGCCATCC 
TGCAGGAAAG 
GAAAAAGTGG 
GAGGAGAAGC 
TAAAAAATTT 
GACGTTATAT 



31 
I 

GCGGGCGGGC 
AGAGGCCGCG 
TGGCGGCCGC 
CCAACCATGA 
TGGAAGAAGA 
AGAACGATCT 
AGGAGAAAGG 
ACCACTACAT 
TCTGGAACAC 
GCTTCCTGAA 
AGATCGAAGA 
CGGAAAAGCT 
AATAAATCGT 
TACCCTGCCC 
AAAGAACTGA 



41 
I 

GGGAGGCGCC 
GAGCCAGCGA 
CAAGGACACT 
CCCTCAGTTT 
TGAAGAGGAA 
CCCAGAATGG 
GGCCATCCGC 
CACGCCGATG 
CCACGCTGAC 
TGCTGAGAAT 
GAGAGAAAAG 
AGAAGCTCTC 
CTTATTTTAT 
CTCTTTTTCG 
ACTC 



51 
I 

GGCGCCAGAC 
CGACCGACCC 
CATGAGGACC 
GAG CCAAT AG 
CTTTTTAAAA 
AAGGAGCGAG 
CTCCTCATGC 
ATGGAGCTGA 
TTCGCCGACG 
GCACAGAAAT 
AAAGCAGGAT 
TCGGTGAAGG 
TTTCTTTTCC 
GTTTGTTTTT 



Seq ID NOt 134 Protein sequence: 
Protein Accession #: NP 002873 



31 



41 



51 



1 11 21 

11(111 
MAAAKDTHED HDTSTBNTDE SNHDPQFEPI VSLPEQEIKT LEEDEEELFK MRAKLFRFAS 
ENDLPEWKER GTGDVKLLKH KEKGAIRLLM RRDKTLKICA NHYITPMMEL KPNAGSDRAW 
VWNTHADFAD ECPKPELLAI RFLNAENAQK FKTKFEECRK EIEEREKKAG SGKNDHAEKV 
AEKLEALSVK EETKEDAEEK Q 

Seq ID NO: 135 DNA sequence 

Nucleic Acid Accession #: NM_O0O077.2 

Coding sequence: 277-742 



1 
I 

CCCAACCTGG 
TCCTCCGAGC 
GGATTTGAGG 
GGGCTGGCTG 
GGAGAGCAGG 
GCCTTCGGCT 
GCTGCTGGAG 
GGTCATGATG 



11 
I 

GGCGACTTCA 
ACTCGCTCAC 
GACAGGGTCG 
GTCACCAGAG 
CAGCGGGCGG 
GACTGGCTGG 
GCGGGGGCGC 
ATGGGCAGCG 



21 

I 

GGTGTGCCAC 
GGCGTCCCCT 
GAGGGGGCTC 
GGTGGGGCGG 
CGGGGAGCAG 
CCACGGCCGC 
TGCCCAACGC 
CCCGAGTGGC 



31 
I 

ATTCGCTAAG 
TGCCTGGAAA 
TTCCGCCAGC 
ACCGCGTGCG 
CATGGAGCCG 
GGCCCGGGGT 
ACCGAATAGT 
GGAGCTGCTG 



41 

I 

TGCTCGGAGT 
GATACCGCGG 
ACCGGAGGAA 
CTCGGCGGCT 



1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1660 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
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CGGGTAGAGG 
TACGGTCGGA 
CTGCTCCACG 



51 
I 

TAATAGCACC 
TCCCTCCAGA 
GAAAGAGGAG 
GCGGAGAGGG 
GCAGCATGGA 
AGGTGCGGGC 
GGCCGATCCA 
GCGCGGAGCC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 



60 
120 
180 



60 
120 
180 
240 
300 
360 
420 
480 
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CAACTGCGCC GACCCCGCCA CTCTCACCOG ACCCGTGCAC GACQCTGCCC GGQA6GGCTT 540 

CCTGGACACG CTGGTGGTGC TGCACCGGGC CGGGGCGCGG CTGGACGTGC GCGATGCCTG 600 

GGGCCGTCTG CCCGTGGACC TGGCTGAGGA GCTGGGCCAT CGCGATGTCG CACGGTACCT 660 

GCGCGCGGCT GCGGGGGGCA CCAGAGGCAG TAACCATGCC CGCATAGATG CCGCGGAAGG 720 

5 TCCCTCAGAC ATCCCCGATT GAAAGAACCA GAGAGGCTCT GAGAAACCTC GGGAAACTTA 780 

GATCATCAGT CACCGAAGGT CCTACAGGGC CACAACTGCC CCCGCCACAA CCCACCCCGC 840 

TTTCGTAGTT TTCATTTAGA AAATAGAGCT TTTAAAAATG TCCTGCCTTT TAACGTAGAT 900 

ATATGCCTTC CCCCACTACC GTAAATGTCC ATTTATATCA TTTTTTATAT ATTCTTATAA 960 

AAATGTAAAA AAGAAAAACA CCGCTTCTGC CTTTTCACTG TGTTGGAGTT TTCTGGAGTG 1020 

10 AG CACTCACG CCCTAAGCGC ACATTCATGT GGGCATTTCT TGCGAGCCTC GCAGCCTCCG 1080 

GAAGCTGTCG ACTTCATGAC AAGCATT T TG TGAACTAGGG AAGCTCAGGG GGGTTACTGG 1140 

CTTCTCTTGA GTCACACTGC TAGCAAATGG CAGAACCAAA GCTCAAATAA AAATAAAATA 1200 
ATTTTCATTC ATTCACTC 

15 Seq ID HOt 136 Protein sequence: 
Protein Accession #: NP_000068.1 

1 11 21 31 41 51 

20 MEPAAGSSME PSADWLATAA ARGRVEEVRA LLEAGALPNA PNSYGRRPIQ VMMMGSARVA 60 

ELLLLHGAEP NCADPATLTR PVHDAAREGP LDTL.WLHRA GARLDVRDAW GRIiPVDLAEE 120 
LGHRDVARYL RAAAGGTRGS NHARIDAAEG PSDIPD 



25 Seq ID NO: 137 DNA sequence 

Nucleic Acid Accession #: NM_05B196.1 
Coding sequence: 104-421 



30 i ii 21 

1 I I 

TGTGTGGGGG TCTGCTTGGC GGTGAGGGGG 
GCCCCCACCC TGGCTCTGAC CATTCTGTTC 
CCGAGTGGCG GAGCTGCTGC TGCTCCACGG 

35 TCTCACCCGA CCCGTGCACG ACGCTGCCCG 
GCACCGGGCC GGGGCGCGGC TGGACGTGCG 
GGCTGAGGAG CTGGGCCATC GCGATGTCGC 
CAGAGGCAGT AACCATGCCC GCATAGATGC 
AAAGAACCAG AGAGGCTCTG AGAAACCTCG 

40 CTACAGGGCC ACAACTGCCC CCGCCACAAC 
AATAGAGCTT TTAAAAATGT CCTGCCTTTT 
TAAATGTCCA TTTATATCAT TTTTTATATA 
CGCTTCTGCC XTTTCACTGT GTTGGAGTTT 
CATTCATGTG GGCATTTCTT GCGAGCCTCG 

45 AGCATTTTGT GAACTAGGGA AGCTCAGGGG 

AGCAAATGGC AGAACCAAAG CTCAAATAAA 



31 41 51 

I I I 

CTCTACACAA GCTTCCTTTC CGTCATGCCG 60 

TCTCTGGCAG GTCATGATGA TGGGCAGCGC 120 

CGCGGAGCCC AACTGCGCCG ACCCCGCCAC 180 

GGAGGGCTTC CTGGACACGC TGGTGGTGCT 240 

CGATGCCTGG GGCCGTCTGC CCGTGGACCT 300 

ACGGTACCTG CGCGCGGCTG CGGGGGGCAC 360 

CGCGGAAGGT CCCTCAGACA TCCCCGATTG 420 

GGAAACTTAG ATCATCAGTC ACCGAAGGTC 480 

CCACCCCGCT TTCGTAGTTT TCATTTAGAA 540 

AACGTAGATA TAAGCCTTCC CCCACTACCG 600 

TTCTTATAAA AATGTAAAAA AGAAAAACAC 660 

TCTGGAGTGA GCACTCACGC CCTAAGCGCA 720 

CAGCCTCCGG AAGCTGTCGA CTTCATGACA 7 B0 

GGTTACTGGC TTCTCTTGAG TCACACTGCT 840 
AATAAAATAA TTTTCATTCA TTCACTC 



Seq ID NO: 138 Protein sequence: 
50 Protein Accession ft: NP_478103.1 

1 11 21 31 41 51 

I I I I I I 

MMMGSARVAE LLLLHGAEPN CADPATLTRP VHDAAREGFL DTXjWLHRAG ARLDVRDAWG 
55 RLPVDIiAEEL GHRDVARYLR AAAGGTRGSN HARIDAAEGP SDIPD 



60 



Seq ID NO: 139 DNA sequence 

Nucleic Acid Accession #t NM_058197.1 

Coding sequence: 272-684 



1 11 21 31 41 51 

I I I I I I 

CCCAACCTGG GGCGACTTCA GGTGTGCCAC ATTCGCTAAG TGCTCGGAGT TAATAGCACC 60 

TCCTCCGAGC ACTCGCTCAC GGCGTCCCCT TGCCTGGAAA GATACCGCGG TCCCTCCAGA 120 

65 GGATTTGAGG GACAGGGTCG GAGGGGGCTC TTCCGCCAGC ACCGGAGGAA GAAAGAGGAG 180 

GGGCTGGCTG GTCACCAGAG GGTGGGGCGG ACCGCGTGCG CTCGGCGGCT GCGGAGAGGG 240 

GGAGAGCAGG CAGCGGGCGG CGGGGAGCAG CATGGAGCCG GCGGCGGGGA GCAGCATGGA 300 

GCCGGCGGCG GGGAGCAGCA TGGAGCCTTC GGCTGACTGG CTGGCCACGG CCGCGGCCCG 360 

GGGTCGGGTA GAGGAGGTGC GGGCGCTGCT GGAGGCGGGG GCGCTGCCCA ACGCACCGAA 420 

70 TAGTTACGGT CGGAGGCCGA TCCAGGTGGG TAGAAGGTCT GCAGCGGGAG CAGGGGATGG 480 

CGGGCGACTC TGGAGGACGA AGTTTGCAGG GGAATTGGAA TCAGGTAGCG CTTCGATTCT 540 

CCGGAAAAAG GGGAGGCTTC CTGGGGAGTT TTCAGAAGGG GTTTGTAATC ACAGACCTCC 600 

TCCTGGCGAC GCCCTGGGGG CTTGGGAAAC CAAGGAAGAG GAATGAGGAG CCACGCGCGT 660 

ACAGATCTCT CGAATGCTGA GAAGATCTGA AGGGGGGAAC ATATTTGTAT TAGATGGAAG 720 

75 TCATGATGAT GGGCAGCGCC CGAGTGGCGG AGCTGCTGCT GCTCCACGGC GCGGAGCCCA 780 

ACTGCGCCGA CCCCGCCACT CTCACCCGAC CCGTGCACGA CGCTGCCCGG GAGGGCTTCC 840 

TGGACACGCT GGTGGTGCTG CACCGGGCCG GGGCGCGGCT GGACGTGCGC GATGCCTGGG 900 

GCCGTCTGCC CGTGGACCTG GCTGAGGAGC TGGGCCATCG CGATGTCGCA CGGTACCTGC 960 

GCGCGGCTGC GGGGGGCACC AGAGGCAGTA ACCATGCCCG CATAGATGCC GCGGAAGGTC 1020 

80 CCTCAGACAT CCCCGATTGA AAGAACCAGA GAGGCTCTGA GAAACCTCGG GAACTTAGAT 1080 

CATCAGTCAC CGAAGGTCCT ACAGGGCCAC AACTGCCCCC GCCACAACCC ACCCCGCTTT 1140 

CGTAGTTTTC ATTTAGAAAA TAGAGCTTTT AAAAATGTCC TGCCTTTTAA CGTAGATATA 1200 

TGCCTTCCCC CACTACCGTA AATGTCCATT TATATCATTT OTTATATATT CTTATAAAAA 1260 

TGTAAAAAAG AAAAACACOG CTTCTGCCTT TTCACTGTGT TGGAGTTTTC TGGAGTGAGC 1320 

85 ACTCACGCCC TAAGCGCACA TTCATGTGGG CATTTCTTGC GAGCCTCGCA GCCTCCGGAA 1380 

GCTGTCGACT TCATGACAAG CATTTTGTGA ACTAGGGAAG CTCAGGGGGG TTACTGGCTT 1440 

CTCTTGAGTC ACACTGCTAG CAAATGGCAG AACCAAAGCT CAAATAAAAA TAAAATAATT 1500 



240 



WO 02/086443 

TTCATTCATT CACTC 
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Seq ID NO: 140 Protein sequence: 
Protein Accession #i NP_478104.1 

1 11 21 31 41 51 

I I I I I I 

MEPAAGSSME PAAGSSMEPS ADWLATAAAR GRVEEVRALL EAGALPNAPN SYGRRPIQVG 60 
RRSAAGAGDG GRLWRTKFAG ELESGSASIIi RKKGRLPGEF SEGVCNHRPP PGDALGAWET 120 



Seq ID NO: 141 DNA sequence 

Nucleic Acid Accession If t NM_058195.1 

Coding sequence: 163-684 



1 11 21 31 41 51 

I I 1 1 I I 

CCTCCCTACG GGCGCCTCCG GCAGCCCTTC CCGCGTGCGC AGGGCTCAGA GCCGTTCCGA 60 

GATCTTGGAG GTCCGGGTGG GAGTGGGGGT GGGGTGGGGG TGGGGGTGAA GGTGGGGGGC 120 

20 GGGCGCGCTC AGGGAAGGCG GGTGCGCGCC TGCGGGGCGG AGATGGGCAG GGGGCGGTGC 180 

GTGGGTCCCA GTCTGCAGTT AAGGGGGCAG GAGTGGCGCT GCTCACCTCT GGTGCCAAAG 240 

GGCGGCGCAG CGQCTGCCGA GCTCGGCCCT GGAGGCGGCG AGAACATGGT GCGCAGGTTC 300 

TTGGTGACCC TCCGGATTCG GCGCGCGTGC GGCCCGCCGC GAGTGAGGGT TTTCGTGGTT 360 

_ _ CACATCCCGC GGCTCACGGG GGAGTGGGCA GCGCCAGGGG CGCCCGCCGC TGTGGCCCTC 420 

25 GTGCTGATGC TACTGAGGAG CCAGCGTCTA GGGCAGCAGC CGCTT CCT AG AAGACCAGGT 480 

CATGATGATG GGCAGCGCCC GAGTGGCGGA GCTGCTGCTG CTCCACGGCG CGGAGCCCAA 540 

CTGCGCCGAC CCCGCCACTC TCACCCGACC CGTGCACGAC GCTGCCCGGG AGGGCTTCCT 600 

GGACACGCTG GTGGTGCTGC ACCGGGCCGG GGCGCGGCTG GACGTGCGCG ATGCCTGGGG 660 

CCGTCTGCCC GTGGACCTGG CTGAGGAGCT GGGCCATCGC GATGTCGCAC GGTACCTGCG 720 

30 CGCGGCTGCG GGGGGCACCA GAGGCAGTAA CCATGCCCGC ATAGATGCCG CGGAAGGTCC 780 

CTCAGACATC CCCGATTGAA AGAACCAGAG AGGCTCTGAG AAACCTCGGG AAACTTAGAT 840 

CATCAGTCAC CGAAGGTCCT ACAGGGCCAC AACTGCCCCC GCCACAACCC ACCCCGCTTT 900 

CGTAGTTTTC ATTTAGAAAA TAGAGCTTTT AAAAATGTCC TGCCTTTTAA CGTAGATATA 960 

TGCCTTCCCC CACTACCGTA AATGTCCATT TATATCATTT TTTATATATT CTTATAAAAA 1020 

35 TQTAAAAAAG AAAAACACCG CTTCTGCCTT TTCACTGTGT TGGAGTTTTC TGGAGTGAGC 1080 

ACTCACGCCC TAAGCGCACA TTCATGTGGG CATTTCTTGC GAGCCTCGCA GCCTCCGGAA 1140 

GCTGTCGACT TCATGACAAG CATT TTGTG A ACTAGGGAAG CTCAGGGGGG TTACTGGCTT 1200 

CTCTTGAGTC ACACTGCTAG CAAATGGCAG AACCAAAGCT CAAATAAAAA TAAAATAATT 1260 
TTCATTCATT CACTC 



Seq ID NO: 142 Protein sequence: 
Protein Accession #: NP_478102.1 



45 1 11 21 31 41 51 

I I I I I I 

MGRGRCVGPS LQLRGQEWRC SPLVPKGGAA AAELGPGGGE NMVRRPLVTL RIRRACGPPR 60 
VRVFWHIPR IjTGEWAAPGA PAAVAIiVLML LRSQRLGQQP LPRRPGHDDG QRPSGGAAAA 120 
PRRGAQLRRP RHSHPTRARR CPGGLPGHAG GAAPGRGAAG RARCLGPSAR GPG 



Seq ID NO: 143 DNA sequence 
Nucleic Acid Accession #: NM_018131 
Coding sequence: 412.. 1107 



1 11 21 31 41 51 

I I 1 I I I 

GAAATTGCAC ACTTAAAGAC ATCAGTGGAT GAAATCACAA GTGGGAAAGG AAAGCTGACT 60 

GATAAAGAGA GACAGAGACT TTTGGAGAAA ATTCGAGTCC TTGAGGCTGA GAAGGAGAAG 120 

60 AATGCTTATC AACTCACAGA GAAGGACAAA GAAATACAGC GACTGAGAGA CCAACTGAAG 180 

GCCAGATATA GTACTACCGC ATTGCTTGAA CAGCTGGAAG AGACAACGAG AGAAGGAGAA 240 

AGGAGGGAGC AGGTGTTGAA AGCCTTATCT GAAGAGAAAG ACGTATTGAA ACAACAGTTG 300 

TCTGCTGCAA CCTCACGAAT TGCTGAACTT GAAAGCAAAA CCAATACACT CCGTTTATCA 360 

CAGACTGTGG CTCCAAACTG CTTCAACTCA TCAATAAATA ATATTCATGA AATGGAAATA 420 

65 CAGCTGAAAG ATGCTCTGGA GAAAAATCAG CAGTGGCTCG TGTATGATCA GCAGCGGGAA 480 

GTCTATGTAA AAGGACTTTT AGCAAAGATC TTTGAGTTGG AAAAGAAAAC GGAAACAGCT 540 

GCTCATTCAC TCCCACAGCA GACAAAAAAG CCTGAATCAG AAGGTTATCT TCAAGAAGAG 600 

AAGCAGAAAT GTTACAACGA TCTCTTGGCA AGTGCAAAAA AAGATCTTGA GGTTGAACGA 660 

CAAACCATAA CTCAGCTGAG TTTTGAACTG AGTGAATTTC GAAGAAAATA TGAAGAAACC 720 

70 CAAAAAGAAG TTCACAATTT AAATCAGCTG TTGTATTCAC AAAGAAGGGC AGATGTGCAA 780 

CATCTGGAAG ATGATAGGCA TAAAACAGAG AAGATACAAA AACTCAGGGA AGAGAATGAT 840 

ATTGCTAGGG GAAAACTTGA AGAAGAGAAG AAGAGATCCG AAGAGCTCTT ATCTCAGGTC 900 

CAGTCTCTTT ACACATCTCT GCTAAAGCAG CAAGAAGAAC AAACAAGGGT AGCTCTGTTG 960 

GAACAACAGA TGCAGGCATG TACTTTAGAC TTTGAAAATG AAAAACTCGA CCGTCAACAT 1020 

75 GTGCAGCATC AATTGCATGT AATTCTTAAG GAGCTCCGAA AAGCAAGAAA AAATAACACA 1080 

GTTGGAATCC TTGAAACAGC TTCATGAGTT TGCCATCACA GAGCCATTAG TCACTTTCCA 1140 

AGGAGAGACT GAAAACAGAG AAAAAGTTGC CGCCTCACCA AAAAGTCCCA CTGCTGCACT 1200 

CAATGGAAGC CTGGTGGAAT GTCCCAAGTG CAATATACAG TATCCAGCCA CTGAGCATCG 1260 

CGATCTGCTT GTCCATGTGG AATACTGTTC AAAGTAGCAA AATAAGTATT TGTTTTGATA 1320 

80 TTAAAAGATT CAATACTGTA TTTTCTGTTA GCTTGTGGGC ATTTTGAATT ATATATTTCA 1380 

CATTTTGCAT AAAACTGCCT ATCTACCTTT GACACTCCAG CATGCTAGTG AATCATGTAT 1440 

CTTTTAGGCT GCTGTGCATT TCTCTTGGCA GTGATACCTC CCTGACATGG TTCATCATCA 1500 

GGCTGCAATG ACAGAATGTG GTGAGCAGCG TCTACTGAGA TACTAACATT TTGCACTGTC 1560 

_ _ AAAATACTTG GTGAGGAAAA GATAGCTCAG GTTATTGCTA ATGGGTTAAT GCACCAGCAA 1620 

85 GCAAAATATT TTATGTTTCG GGGGTTTTGA AAAATCAAAG ATAATTAACC AAGGATCTTA 1680 

ACTGTGTTCG CATTTTTTAT CCAAGCACTT AGAAAACCTA CAATCCTAAT TTTGATGTCC 1740 

ATTGTTAAGA GGTGGTGATA GATACTATTT TTTTTTCATA TTGTATAGCG GTTATTAGAA 1800 
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WO 02/086443 

AAGTTGGGGA TTTTCTTGAT CTTTATTGCT GCTTACCATT GAAACTTAAC 
TCCCCAACTC TGTTCTGCGC ACGAAACAGT ATCTGTTTGA GGCATAATCT 
CACACAATGT TTTCTCTTAT GTTATCTGGC AGTAACTGTA ACTTGAATTA 
TTCTGCTTAG CTAAAATTGT TAAAATAAAC TTTAATAAAC CCATGTAGCC 
ATTGACAGTA TTTTAGTTAT TTTTGGCATT CTTAAAGCTG GGCAATGTAA 
TTTGTTTGTC TGAACAGGTA TTTTTATACA TGCTTTTTGT AAACCAAAAA 
TCTTCAGGTT TTCTAACATG CTTACCACTG GGCTACTGTA AATGAGAAAA 
ATTTAATGTT TT 



Seq ZD NOz 144 Protein sequence: 
Protein Accession ft: NP 060601 



CCAGCTGTGT 1860 

TAAGTGGCCA 1920 

CATTAGCACA 1980 

CTCTCATTTG 2040 

TGATCAGATC 2100 

CTTTTAAATT 2160 

GAATAAAATT 2220 



PCT/US02/12476 



11 



21 31 41 51 

I I I 1 

MEIQL.KDALE KNQQVJLiVYDQ QREVYVKGLL AKIFELEKKT ETAAHSLPQQ TKKPESEGYL 
QEEKQKCYND LLASAKKDLE VERQTITQLS FELSEFRRKY EETQKEVHNL NQLLYSQRRA 
DVQHLEDDRH KTEKIQKLRE ENDIARGKLE EEKKRSEELL SQVQSLYTSL IaKQQEEQTRV 
ALItEQQMQAC TIiDFENEKLD RQHVQHQLHV ILKELRKARK NNTVGILETA S 

Seq ID NOt 145 DMA sequence 
Nucleic Acid Accession #: NM_001168 
Coding sequencer 50.. 478 



CCGCCAGATT 
GACGTTGCCC 

CCACTGCCCC 
GGAAGGCTGG 
GGCTTT CCTT 
GGACAGAGAA 
TGAGGAAACT 
CCTCTGGCCG 
GTGCCACCAG 
CAAATTAGAT 
TGCCTGTGCA 
GGGGGCTCAT 
AAGGCAGTGT 
GTGAATGTGT 
GGTGCCTGTT 
ACAGTTTTTT 
GTGATGAGAG 
TATTTTGTTT 
AGCCATTCTA 
AGTGATAGGA 
AGTGAGCCGC 
CTTTTTAAAT 
TCTGTCAGCC 
CCAGGTCCCC 
GATGGATTTG 
GCTGGAAACC 



11 
I 

TGAATCGCGG 
CCTGCCTGGC 
TTGGAGGGCT 
ACTGAGAACG 
GAGCCAGATG 
TCTGTCAAGA 
AGAGCCAAGA 
GCGAAGAAAG 
GAGCTGCCTG 
CCTTCCTGTG 
GTTTCAACTG 
GCGGGTGCTG 
TTTTGCTGTT 
CCCTTTTGCT 
CTGGACCTCA 
GAATCTGAGC • 
TGTTGTTGTG 
AATGGAGACA 
GAATTGTTAA 
AGTCATTGGG 
AGCGTCTGGC 
GGGGCACATG 
GACTTGGCTC 
CAACCTTCAC 
GCTTTCTTTG 
ATTCGCCCTC 
TCTGGAGGTC 



21 
I 

GACCCGTTGG 
AGCCCTTTCT 
GCGCCTGCAC 
AGCCAGACTT 
ACGACCCCAT 
AGCAGTTTGA 
ACAAAATTGC 
TGCGCCGTGC 
GTCCCAGAGT 
GGCCCCTTAG 
TGCTCCTGTT 
CTGGTAACAG 
TTGATTCCCG 
AGAGCTGACA 
TGTTGTTGAG 
TGCAGGTTCC 
TTTTTTTGTT 
GAGTCCCTGG 
TTCACAGAAT 
GAAACGGGGT 
AGATACTCCT 
CTGGCCGCTC 
GATGCTGTGG 
ATCTGTCACG 
GAGGCAGCAG 
CTCCCTGTCA 
ATCTCGGCTG 



31 
I 

CAGAGGTGGC 
CAAGGACCAC 
CCCGGAGCGG 
GGCCCAGTGT 
AGAGGAACAT 
AGAATTAACC 
AAAGGAAACC 
CAT CG AG CAG 
GGCTGCACCA 
CAATGTCTTA 
TTGTCTTGAA 
TGGCTGCTTC 
GGCTTACCAG 
GCTTTGTTCG 
GCTGTCACAG 
TTATCTGTCA 
TTTTTTTTTT 
CTCCTCTACT 
AGCACAAACT 
GAACTTCAGG 
TTTGCCACTG 
CTCCCTCAGA 
GGGACTGGCT 
TTCTCCACAC 
CTCCCGCAGG 
TAGAGCTGCA 
TTCCTGAGAA 



41 
I 

GGCGGCGGCA 
CGCATCTCTA 
ATGGCCGAGG 
TTCTTCTGCT 
AAAAAGCATT 
CTTGGTGAAT 
AACAATAAGA 
CTGGCTGCCA 
CTTCCAGGGT 
GGAAAGGAGA 
AGTGGCACCA 
TCTCTCTCTC 
GTGAGAAGTG 
CGTGGGCAGA 
TCCTGAGTGT 
CACCTGTGCC 
GGTAGATGCA 
GTTTAACAAC 
ACAATTAAAA 
TGGATGAGGA 
CTGTGTGATT 
AAAAGGCAGT 
GGGCTGCTGC 
GGGGGAGAGA 
GCTGAAGTCT 
GGGTGGATTG 
ATAAAAAGCC 



51 
I 

TGGGTGCCCC 
CATTCAAGAA 
CTGGCTTCAT 
TCAAGGAGCT 
CGTCCGGTTG 
TTTTGAAACT 
AGAAAGAATT 
TGGATTGAGG 
TTATTCCCTG 
TCAACATTTT 
GAGGTGCTTC 
TCTCTTTTTT 
AGGGAGGAAG 
GCCTTCCACA 
GGACTTGGCA 
TCCTCAGAGG 
TGACTTGTGT 
ATGGCTTTCT 
CTAAGCACAA 
GACAGAATAG 
AGACAGGCCC 
GG CCTAAATC 
AGGCCGTGTG 
CGCAGTCCGC 
GGCGTAAGAT 
TTACAGCTTC 
TGTCATTTC 



Seq ID NO: 146 Protein sequence: 
Protein Accession #j NP_001159 



1 11 21 31 41 51 

I I I I I I 

MGAPTLPPAW QPFLKDHRIS TFKNWPFLEG CACTPERMAE AGPIHCPTEN EPDLAQCPPC 
FKELEGWEPD DDPIEEHKKH SSGCAPLSVK RQFEELTLGE FLKLDRERAK NKIAKETNNK 
KKEFEETAKK VRRAIEQLAA MD 

Seq ID NO: 147 DNA sequence 

Nucleic Acid Accession #: NM_014176.1 

Coding sequence: 127-720 



1 
I 

GCGCGCAGCG 
AGTGCATCCC 
GGGATCATCC 
CCCCCAGGCA 
TTAGGTGGAG 
GAGAGGTACC 
ATTGATTCTG 
AGACCATCCC 
AACCCTGATG 
TTCCTCAAGA 
GAGGAAGAGA 
CAGAAAAGGA 
GGGACTTGTC 
ACCTTGAATT. 
GTACATATGT 



11 
I 

CTGGTACCCC 
AGGCAGCTCT 
AGAGAGCTTC 
TCACATGTTG 
CCAACACACC 
CATTTGAACC 
CTGGAAGGAT 
TCAACATCGC 
ACCCGCTCAT 
ATGCCAGACA 
TGCTTGATAA 
AGGCCAGTCA 
CTGGTTCATC 
TTTTTTTAAA 
ATTTTGAAAT 



21 
I 

GTTGGTCCGC 
TAGTGTGGAG 
ACGTCTGAAG 
GCAAGATAAA 
TTATGAGAAA 
TCCTCAGATC 
TTGTCTGGAT 
AACTGTGTTG 
GGCTGACATA 
GTGGACAGAG 
TCTACCAGAG 
GCTAGTAGGC 
TTAGTTAATG 
TATATTTGAT 
CTTTTAAACC 



31 

I 

GCGTTGCTGC 
CAGTGAACTG 
AGAGAGCTGC 
GACCAAATGG 
GGTGTTTTTA 
CGATTTCTCA 
GTTCTCAAAT 
ACCTCTATTC 
TCCTCAGAAT 
AAGCATGCAA 
GCTGGTGACT 
ATAGAAAAGA 
TGTTCTTTGC 
GACATAATTT 
TGAAAAATAA 



41 
I 

GTTGTGAGGG 
TGTGTGGTTC 
ACATGTTAGC 
ATGACCTGCG 
AGCTAGAAGT 
CTCCAATTTA 
TGCCACCAAA 
AGCTGCTCAT 
TTAAATATAA 
GACAGAAACA 
CCAGAGTACA 
AATTTCATCC 
CAAGGTGATC 
TTGTGTAGTT 
ATAGTCATTT 



51 * 
I 

GTGTCAGCTC 
CTTCTACTTG 
CACAGAGCCA 
AGCTCAAATA 
TATCATTCCT 
TCATCCAAAC 
AGGTGCTTGG 
GTCAGAACCC 
TAAGCCAGCC 
AAAGGCTGAT 
CAACTCAACA 
TGATGTTTAG 
TAAGTTGCCT 
TATTTATCTT 
AATGTTGAAA 



60 
120 
180 
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WO 02/086443 

AAAAAAAAAA AAAAAAAAAA AAAAAAAA 

Seq ID NO: 148 Protein sequence: 
Protein Accession #: NP_054895.l 

1 11 21 31 ' 41 SI 

I I I I I 1 

MQRASRLKRE LHMLATEPPP GITCWQDKDQ MDDLRAQILG GANTPYEKGV FKLEVIIPER 60 

YPPEPPQIRP LTPIYHPNID SAQRICLDVL KLPPKGAWRP SLiNIATVLTS I QLLMSEPNP 120 

DDPLMADISS EFKYNKPAFL KNARQWTEKH ARQKQKADEE EMLDNIiPEAG DSRVHNSTQK 180 
RKASQLVGIE KKFHPDV 

Seq ID NO: 149 DNA sequence 
Nucleic Acid Accession ft: NM_003812 
Coding sequence: 224-2722 

1 11 21 31 41 51 

I I I I I I 

TCCTCTGCGT CCCGCCCCGG GAGTGGCTGC GAGGCTAGGC GAGCCGGGAA AGGGGGCGCC 60 

GCCCAGCCCC GAGCCCCGCG CCCCGTGCCC CGAGCCCGGA GCCCCCTGCC CGCGGCGGCA 120 

CCATGCGOGC CGAGCCGGCG TGACCGGCTC CGCCCGCGGC CGCCCCGCAG CTAGCCOGGC 180 

GCTCTCGCCG GCCACACGGA GCGGCGCCCG GGAGCTATGA GCCATGAAGC CGCCCGGCAG 240 

CAGCTCGCGG CAGCCGCCCC TGGCGGGCTG CAGCCTTGCC GGCGCTTCCT GCGGCCCCCA 300 

ACGCGGCCCC GCCGGCTCGG TGCCTGCCAG CGCCCCGGCC CGCACGCCGC CCTGCCGCCT 360 

GCTTCTCGTC CTTCTCCTGC TGCCTCCGCT CGCCGCCTCG TCCCGGCCCC GCGCCTGGGG 420 

GGCTGCTGCG CCCAGCGCTC CGCATTGGAA TGAAACTGCA GAAAAAAATT TGGGAGTCCT 480 

GGCAGATGAA GACAATACAT TGCAACAGAA TAGCAGCAGT AATATCAGTT ACAGCAATGC 540 

AATGCAGAAA GAAATCACAC TGCCTTCAAG ACT CAT AT AT TACATCAACC AAGACTCGGA 600 

AAGCCCTTAT CACGTTCTTG ACACAAAGGC AAGACACCAG CAAAAACATA ATAAGGCTGT 660 

CCATCTGGCC CAGGCAAGCT TCCAGATTGA AGCCTTCGGC TCCAAATTCA TTCTTGACCT 720 

CATACTGAAC AATGGTTTGT TGTCTTCTGA TTATGTGGAG ATTCACTACG AAAATGGGAA 780 

ACCACAGTAC TCTAAGGGTG GAGAGCACTG TT ACT AC CAT GGAAGCATCA GAGGCGTCAA 840 

AGACT CCAAG GTGGCTCTGT CAACCTGCAA TGGACTTCAT GGCATGTTTG AAGATGATAC 900 

CTTCGTGTAT ATGATAGAGC CACTAGAGCT GGTTCATGAT GAGAAAAGCA CAGGTCGACC 960 

ACATATAATC CAGAAAACCT TGGCAGGACA GTATTCTAAG CAAATGAAGA ATCTCACTAT 1020 

GGAAAGAGGT GACCAGTGGC CCTTTCTCTC TGAATTACAG TGGTTGAAAA GAAGGAAGAG 1080 

AG CAGTGAAT CCATCACGTG GTATATTTGA AGAAATGAAA TATTTGGAAC TTATGATTGT 1140 

TAATGATCAC AAAACGTATA AGAAGCATCG CTCTTCTCAT GCACATACCA ACAACTTTGC 1200 

AAAGTCCGTG GTCAACCTTG TGGATTCTAT TTACAAGGAG CAGCTCAACA CCAGGGTTGT 1260 

CCTGGTGGCT GTAGAGACCT GGACTGAGAA GGATCAGATT GACATCACCA CCAACCCTGT 1320 

GCAGATGCTC CATGAGTTCT CAAAATACCG GCAGCGCATT AAGCAGCATG CTGATGCTGT 1380 

GCACCTCATC TCGCGGGTGA CATTTCACTA TAAGAGAAGC AGTCTGAGTT ACTTTGGAGG 1440 

TGT CTGTTCT CGCACAAGAG GAGTTGGTGT GAATGAGTAT GGTCTTCCAA TGGCAGTGGC 1500 

ACAAGTATTA TCGCAGAGCC TGGCTCAAAA CCTTGGAATC CAATGGGAAC CTTCTAGCAG 1560 

AAAGCCAAAA TGTGACTGCA CAGAATCCTG GGGTGGCTGC ATCATGGAGG AAACAGGGGT 1620 

GTCCCATTCT CGAAAATTTT CAAAGTGCAG CATTTTGGAG TATAGAGACT TTTTACAGAG 1680 

AGGAGGTGGA GCCTGCCTTT TCAACAGGCC AACAAAGCTA TTTGAGCCCA CGGAATGTGG 1740 

AAATGGATAC GTGGAAGCTG GGGAGGAGTG TGATTGTGGT TTTCATGTGG AATGCTATGG 1800 

ATTATGCTGT AAGAAATGTT CCCTCTCCAA CGGGGCTCAC TGCAGCGACG GGCCCTGCTG 1860 

TAACAATACC TCATGTCTTT TTCAGCCACG AGGGTATGAA TGCCGGGATG CTGTGAACGA 1920 

GTGTGATATT ACTGAATATT GTACTGGAGA CTCTGGTCAG TGCCCACCAA ATCTTCATAA I960 

GCAAGACGGA TATGCATGCA ATCAAAATCA GGGCCGCTGC TACAATGGCG AGTGCAAGAC 2040 

CAGAGACAAC CAGTGTCAGT ACATCTGGGG AACAAAGGCT GCAGGGTCTG ACAAGTTCTG 2100 

CTATGAAAAG CTGAATACAG AAGGCACTGA GAAGGGAAAC TGCGGGAAGG ATGGAGACCG 2160 

GTGGATTCAG TGCAGCAAAC ATGATGTGTT CTGTGGATTC TTACTCTGTA CCAATCTTAC 2220 

TCGAGCTCCA CGTATTGGTC AACTTCAGGG TGAGATCATT CCAACTTCCT TCTACCATCA 2280 

AGGCCGGGTG ATTGACTGCA GTGGTGCCCA TGTAGTTTTA GATGATGATA CGGATGTGGG 2340 

CTATGTAGAA GATGGAACGC CATGTGGCCC GTCTATGATG TGTTTAGATC GGAAGTGCCT 2400 

ACAAATTCAA GCCCTAAATA TGAGCAGCTG TCCACTCGAT TCCAAGGGTA AAGTCTGTTC 2460 

GGGCCATGGG GTGTGTAGTA ATGAAGCCAC CTGCATTTGT GATTTCACCT GGGCAGGGAC 2520 

AGATTGCAGT ATCCGGGATC CAGTTAGGAA CCTTCACCCC CCCAAGGATG AAGGACCCAA 2580 

GGGTCCTAGT GCCACCAATC TCATAATAGG CTCCATCGCT GGTGCCATCC TGGTAGCAGC 2640 

TATTGTCCTT GGGGGCACAG GCTGGGGATT TAAAAATGTC AAGAAGAGAA GGTTCGATCC 2700 

TACTCAGCAA GGCCCCATCT GAATCAGCTG CGCTGGATGG ACACCGCCTT GCACTGTTGG 2760 

ATTCTGGGTA TGACATACTC GCAGCAGTGT TACTGGAACT ATTAAGTTTG TAAACAAAAC 2820 

CTTTGGGTGG TAATGACTAC GGAGCTAAAG TTGGGGTGAC AAGGATGGGG TAAAAGAAAA 2880 

CTGTCTCTTT TGGAAATAAT GTCAAAGAAC ACCTTTCACC ACCTGTCAGT AAACGGGGGA 2940 

GGGGGCAAAA GACCATGCTA TAAAAAGAAC TGTTCCAGAA TCTTTTTTTT TCCCTAATGG 3000 
ACGAAGGAAC AACACACACA CAAAAATTAA ATGCAATAAA GGAATCATTA AAAA 



Seq ID NO: 150 Protein sequence: 
Protein Accession #: NPJ>03803 

1 11 21 31 41 51 

I 1 1 I 1 1 

MKPPGSSSRQ PPLAGCSLAG ASCGPQRGPA GSVPASAPAR TPPCRLLLVL LLLPPLAASS 60 

RPRAWGAAAP SAPHWNETAE KNtiGVUVDED NTliQQNSSSN ISYSNAMQKE ITI»PSRLIYY 120 

INQDSESPYH VLDTKARHQQ KHNKAVHLAQ ASFQIEAFGS KFILDLILNN GLLSSDYVEI 180 

HYENGKPQYS KGGEHCYYHG SIRGVKDSKV ALSTCNGLHG MFEDDTFVYM IEPLELVHDE 240 

KSTGRPHIIQ KTIxAGQYSKQ MKNLTMERGD QWPFLSELQW LKRRKRAVNP SRGIFEEMKY 300 

LELMIVNDHK TYKKHRSSHA HTNNFAKSW NLVDSIYKEQ LNTRWLVAV BTWTEKDQID 360 

ITTNPVQMLH EFSKYRQRIK QHADAVHLIS RVTFHYKRSS LSYFGGVCSR TRGVGVNEYG 420 

LPMAVAQVLS QSLAQNLGIQ WEPSSRKPKC DCTESWGGCI MEETGVSHSR KFSKCSILEY 480 

RDFLQRGGGA CLFNRPTKLF EPTECGNGYV EAGEECDCGP HVECYGLCCX KCSLSNGAHC 540 

SDGPCCNNTS CLFQPRGYEC RDAVNECDIT EYCTGDSGQC PPNLHKQDGY ACNQNQGRCY 600 

NGECKTRDNQ CQYIWGTKAA GSDKFCYEKL NTEGTEKGNC GKDGDRWIQC SKHDVFCGFL 660 
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LCTNLTRAPR IGQLQGEIIP TSFYHQGRVI DCSGAHWLD DDTDVGYVED GTPCGPSMMC 
LDRKCLQIQA LNMSSCPLDS K6KVCS6HGV CSNEATCICD FTWAGTDCSI RDPVRNLHPP 
KDBGPKGPSA TNLIIGSIAQ AILVAAIVLQ 6TGWGFKNVK KRRFDPTQQG PI 

Seq ID NO: 151 DNA sequence 
Nucleic Acid Accession ft: NM__023915 
Coding sequence: 250-1326 



720 
780 



PCT/US02/12476 



i 
I 

GGCACGAGGG 
TCAAAGCTTA 
GTGAATGGAC 
CCCACGCCTC 
AACTGAAGAA 
CAAGAGAGTC 
AATGAATTTG 
TTGCTGAATG 
TTCTATCTCA 
ATAGTCCATG 
TCAGTTTTGT 
GATCGCTATC 
ACGAAGGTTT 
ATCCTGACAA 
CCTTTGGGGG 
GTGCTGGTGA 
AGGCAATTCA 
GTGGCTGTGT 
AGTCACTTAG 
ATTACACTTT 
TGTAGGTCAT 
AT CAGATCAC 
GTGTAGGCCT 
TTCATTATCC 



11 
I 

TTTCGTTTTC 
TTCTTAATTA 
AGCCAGCCAC 
AATCGTCCCC 
TGGGGTT CAA 
ACAATTCAGG 
ACACAATTGT 
GTTTAGCAGT 
AAAACATAGT 
ATGCAGGATT 
TTTATGCAAA 
TGAAGGTGGT 
TATCTGTTTG 
ATGGTCAGCC 
TCAAATGGCA 
TTCTGATCGG 
TAAGTCAGTC 
TTTTTACCTG 
ACAGGCTTTT 
TCTTGTCTGC 
TTTCAAGAAG 
TGCAAAGTGT 
TTTATTGTTT 
TTAAAAAAAA 



21 
I 

ATGCTTTACC 
GAGACAAGAA 
CACAATGAAA 
AAGTGTTTCC 
CTTGACGCTT 
CAACAGGAGC 
CTTGCCGGTG 
GTGGATCTTC 
GGTTGCAGAC 
TGGACCTTGG 
CATGTATACT 
CAAGCCATTT 
TGTTTGGGTG 
AACAGAGGAC 
TACGGCAGTC 
ATGTTACATA 
AAGCCGAAAG 
CTTTCTACCA 
AGATGAATCT 
GTGTAATGTT 
GCTGTTCAAA 
GAGAAGATCG 
GTTGGAATOG 
AA 



31 
I 

AGAAAATCCA 
ACCTGTTTCA 
GAAATCAAAC 
TGACACGCAT 
GCAAAATTAC 
GACGGGCCAG 
CTTTATCTCA 
TT CCACATT A 
CTCATAATGA 
TACTTCAAGT 
TCCATCGTGT 
GGGGACTCTC 
ATCATGGCTG 
AATATCCATG 
ACCTATGTGA 
GCCATATCCA 
CGAAAACATA 
TATCACTTGT 
GCACAAAAAA 
TGCCTGGATC 
AAATCAAATA 
GAAGTTCGCA 
ATATGTACAA 




Seq ID NOi 152 Protein sequence: 
Protein Accession #: NP_076404 



1 
I 

MGFNLTLAKL 
GLAVWIFFHI 
FYANMYTSIV 
NGQPTEDNIH 
ISQSSRKHKH 
FLSACNVCLD 



11 
I 

PNNELHGQES 
RNKTSFIFYL 
FLGLISIDRY 
DCSKLKSPLG 
NQSIRVWAV 
PIIYFFMCRS 



21 

i 

HNSGNR5DGP 
KNIWADLIM 
LKWKPFGDS 
VKWHTAVTYV 
FFTCFLPYHL 
FSRRLFKKSN 



31 
I 

GKNTTLHNEF 
TI*TFPFRIVH 
RMYSITFTKV 
NSCLFVAVLV 
CRIPFTFSHIi 
IRTRSESIRS 



Seq ID NO i 153 DNA sequence 
Nucleic Acid Accession #: D80008.1 
Coding sequence: 149-739 



GTTCGGCGCC 
CGAAAGGAGT 
AAGGCCGCGG 
CGAGCTGCAT 
AGTTCTGGAG 
GTCAGGTGGA 
AAATCGACGC 
ATGGGAATAT 
GGAGTGGTTT 
TGAAGGTTTG 
GTGTCTAAAA 
AAATAGCCAG 
GGAGCACATC 

TAGACATTGT 
AGGACTTTCT 
GTTTTGTAGA 
AGTCCTCCCA 
CCCCTACTCC 
GTGTGTTTTT 
TTGGCTGGAC 
CAAGCTAGAG 
TGGTCTGTAG 
TATTTGGGAA 
CTTGTGGCTA 
CTAGAGAAGG 
AGAGTTGATT 
TCCAGTTTAT 
TCCCAAGATC 
TACTTTGGTC 
GTTTTGAGAT 
CACTGCAATC 
GGGATTACAG 



11 

I 

AAAGCGCGGA 
GAGGCGCCGA 
GAGTGGGAAG 
CGCGCGCCCG 
GAGATGAAAG 
CGAAGTGATT 
TGCACTGTAG 
GGTAGCGTCT 
AATAATTATA 
GACATTACAC 
GACTATGGAG 
CACTTTTTAC 
CTGTCATGAC 
CTCACTCTCT 
TTAAGATAAC 
TTTTTTAATG 
GACTGTCTCA 
CCTTAGCTTC 
TTTTTCTAAT 
TAAATGAAAG 
AGGAAGAAGG 
AGCTGAATTT 
AAATTTTCAG 
GGAAGGACAC 
TGGGGTGATC 
AACTTTGTAC 
GTCTTTTAAT 
TCGTTTGTTC 
ACAATTTTTT 
TATGACCCGT 
GGAGTCTTGT 
TCTATCCCCT 
GCACAGGCCG 



21 

1 

GCGGAGGCCG 
GAGCCCAGAT 
CGTCCGCCAT 
AAGGGCAACT 
CTTTGTATGA 
TGATACCAAC 
CATACCTGTA 
TGCCAAATGC 
AAAGATCTCT 
AGGATATGAA 
AATTTGAAGT 
CTCGATGGAA 
CATGCGCCGA 
CCACCACTCC 
TAAGAATACT 
TTGTACACTA 
CTATGTTGCC 
TCAAAGTGTT 
AAGCTGTATC 
TAAACATGGT 
TAGATCCTGT 
CTGAGATACA 
TATATATAAT 
ACATGGATTT 
ACCAGTATCA 
AGTTTTCCCT 
GGTATGTTTT 
TTTTATGCTT 
TTCCTTTTTA 
TTTTTTTTTT 
TCTGTCACCC 
GGGTTCAAGT 
CCACGCCTGG 



31 

I 

AGGCGAGAGC 
ACCATTTTGG 
GTTCTGCGAA 
GCCTGCCTTC 
ACAAAACCAG 
TATCAAATTT 
TGACCGCTTG 
ATTACGATTT 
TGCTACTTAT 
ACCACCAAAA 
TGATGATGGC 
ATGTGAGCAG 
GGCACTTCCA 
CTTCACCTCC 
TGGCTAAGAA 
TTCTTCCTAC 
CAAGCTGGTC 
GAGATCACAG 
TGTAATCACA 
TACATTTGAA 
GTGTCTTGTT 
CATTTTCAAA 
GTTTAATGAC 
TGCACATTTC 
CCACTTTGGA 
GAGATTCAGA 
AAACAGCTGA 
TGGGTGTTGC 
CTTCTAGAAG 
GTTTTGTTTT 
AGGCTGGGGT 
GATTCTCTTG 
CTAATTTTTG 



ACTGCTCAAA 
ACAGCTGCTT 
GGTACATCCA 
ACCAGAGCAT 
GCAGAATTCC 
TCCTATATTA 
CAATAATTTA 
TCAGAACCAG 
TATATTATGA 
AGTGTAAATA 



41 

I 

DTIVLPVLYL 
DAGFGPWYFK 
LSVCVWVIMA 
ILIGCYIAIS 
DRLLDESAQK 
LQSVRRSEVR 



41 
I 

CTGGCGCTGT 
CGTGAGAGCT 
AAAGCCATGG 
AACGAGGATG 
TCTGATGTGA 
CGACACTGTT 
CTTCGGATCA 
CACATGGCTG 
ATGAGGTCAC 
AGCCTATATA 
ACTTCAGTCC 
CTGATCAGAC 
GGCTTCACTC 
CTCTTTGATT 
GTATAATTTG 
TCTTTTTTGG 
TCAAACTCCT 
GCGTGAGCCA 
GCATTCCTAC 
TCTCTTAAAT 
TTCTGGTCAT 
TCACATGCAA 
ATACTAATTT 
CACCATGGTG 
AGGGGACAGT 
TTGACTGAAA 
CATTTTAAAT 
ATCCGAGAAA 
TGTTATAATT 
GTTTTTTCGT 
GCAGTGGCGT 
TCTCAGCCTC 
TATTTTTAGT 



51 

I 

GACCTTAGTT 
ACCGTATGAG 
CTATGCTGAA 
AGTGCATCAC 
GCTGCACGGC 
CACCCTTCAC 
GGCAAGCATC 
CAGCTTCATA 
TCCATTTCGA 
CAGATACACT 
GATAAGCATT 
CATAACCTTC 
GCCAAACATC 
ACTTAAAAGT 
GTTTGTGGCC 
CAAATCCAGC 
CAGGGT TGTT 
TTTTACTTTT 
CTGCAAAGAA 
CTTTTTCATG 
GAGTGAAAGC 
TTACACTGAT 
AATGTTTCTT 



51 
I 

IIFVASILIiN 
FILCRYTSVL 
VLSLPNIIIiT 
RYIHKSSRQF 
IIjYYCKEITL 
IYYDYTDV 



51 
i 

AGGACTAGAA 
GGTGGTTGGC 
AACTGATCCG 
GACTCAGACA 
ATGAAGCAAA 
CTCTGTTAAG 
GAGCACTCAG 
CTGAAGAAAT 
TGGGAGGAGA 
TTGAAGTCCG 
TATTAAAAAA 
AAGGAGTCCT 
AACTCATGGA 
TTAGAAGCTA 
CTAACTATTA 
TTTTGGTTTT 
GGCCTCAAGC 
CTGCACCCGG 
AGTTGTTACA 
AAGCAGTCAC 
GTGTATTGTA 
GTGAAGATGA 
ATCATCTGGC 
GCTGGTGTGG 
GAAATTGGGG 
AGTCACATGA 
TTTGATGAAA 
TCTTTTCCCA 
TTAAGCTTTA 
TTGTTTCTTT 
GATCTTGGCT 
CCAAGTAGCT 
AGAGACAGAG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



60 
120 
180 
240 
300 



60 
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1740 
1800 
1860 
1920 
1980 



244 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

TTTTACCATG TTGGCCAGQC TGGTTTCAAA CTCCTGACCT CAAGTGACCC ACCTTGGCCT 
CCCAAAGTTT TGGGATTACA AGTGTGGGCC ACCGCGGCCA GCCTATGATC CATTTTGAAT 
GAATTTTTTA TATGGTGCAA GGTGTCAATC CACCTTCACT TTTTCTTGGG AATATAGATA 
TCCAGCTGTT TCACTACCAT TTTTTGAAAG GACTGCCCTT TGCTCTATCA CCTTTGCATT 
TTTGTTAAAA AGTAGTTGTC AATGTATATG TGGGTTTATT TCAGGACTCT GTTTTGTTCC 
ATTGACCTGT TTTTCTCTCC TGAATGCCAA TACCATATTT GTATGTAGTG TATGTAATTT 
TCTAATAATT CTTGAAACAG ATAGTATTAA TGTGT CAT AT TTTTGCTGTT GTTTGTATTT 
TTTGTAGAGA TGGGGTTTCA CCGTGTTGGC CAGGCTGTGT TGAACTCCTG AGCTAAAGCA 
ATACACTTGC CTCGTCCTCC CCATGTGCTG GGATTACAGG CGTGAGCCTT GGTGCTGGCC 
CAGTGTACCA CATTTCTTTT TGAGATTTGT TTTGGCTATG TTAAGTCCTT TGCTTTTGAT 
GTGAAATTTG GGAACAGGCA GGGTGTGGTG GCTTATGCCT GTAATCCTAG AACTTTGGGA 
GGCCTAGATG GGTGGATCAC TTGAGCTCAG GAGTTCCAGA CCAGCCCGGG CCTATGGCAA 
AACTCCGTCT CTACAAAAAA TAGAAAAAAT TAGCCAGGTG TGGTGGTGCA TGCCTGTAGT 
CACAGTTACA CGGCAGGCTG AGGTGGGAGG ATCACTTGAA CCCCAGAGGT CAAGACTGCA 
GTGAGCTGAG ATCACACCAC TGTACTCCAG CCTGGGTGAC AAAGTGAGAC TCTATCTCAA 
AAAGAAATTA GGATCAATTT GTCAATTTCT ACAACAACAA CAACAAAAAC CCCTGTTGGG 
CACCTTGATT GAGATTGCAT TGAATTTATA TAAAACTGTT GGGAGAATTG ACATCTTAAT 
AATATTGAGT CTTCTGGCCT ATAAACAAGG TCTGTCTTCC TAGGTATTAA TGTTTTGTCT 
TCTATTTCTC TTAATAATCT TTTGTAGTTT TCAGTGTACA GGTCTACCAT GTCAGCATTT 
CATAGTTTTG ATGCTAAATG GTATTTTAAA ATTTCAAATT CTAACCACTT GTTGCTAGTA 
AATAGAAATA CAATTGATGT TGAACTTGTA TCCTTCAGCC TTGCTAAACT GTGAGTTCTC 
ATGGTGTTTT TGTAAATTAC ATCAACAGTC ATGTGTTCTA TGAATAAAGA GTTTTACTCC 
TTC 

Seq ID NO: 154 Protein sequence: 
Protein Accession #: BAA11503.1 

1 11 21 31 41 51 

I I I I I 1 

MFCEKAMELI RELHRAPEGQ LPAFNEDGLR QVLEEMKALY EQNQSDVNEA KSGGRSDLIP 
TIKFRHCSLL RNRRCTVAYI> YDRL LR I HAL RWEYGSVLPN AL.RFHMAAEE MEWFNNYKRS 
LATYMRSLGG DEGLDITQDM KPPKSLYIEV RCLKDYGEFE VDDGTSVLLK KNSQHPLPRW 
KCEQLIRQGV LEHILS 

Seq ID NO: 155 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 149-709 



1 

! 

GTTCGGCGCC 
CGAAAGGAGT 
AAGGCCGCGG 
CGAGCTGCAT 
AGTTCTGGAG 
GTCAGGTGGA 
AAATCGACGC 
ATGGGAATAT 
GGAGTGGTTT 
TGAAGGTTTG 
ATGCAGTGGC 
CAACCTCCAC 
GCACTTCAGT 
AGCTGATCAG 
CAGGCTTCAC 
CCCTCTTTGA 
AAGTATAATT 
ACTCTTTTTT 
TCTCAAACTC 
AGGCQTGAGC 
CAGCATTCCT 
AATCTCTTAA 
TTTTCTGGTC 
AATCACATGC 
ACATACTAAT 
TCCACCATGG 
GAAGGGGACA 
GATTGACTGA 
GACATTTTAA 
GCATCCGAGA 
AGTGTTATAA 
TTGTTTTTTC 
GTGCAGTGGC 
TGTCTCAGCC 
TGTATTTTTA 
CTCAAGTGAC 
CAGCCTATGA 
CTTTTTCTTG 
TTTGCTCTAT 
TTTCAGGACT 
TTGTATGTAG 
ATTTTTGCTG 
GTTGAACTCC 
GGCGTGAGCC 
TGTTAAGTCC 
CTGTAATCCT 
GACCAGCCCG 



11 
I 

AAAGCGCGGA 
GAGGCGCCGA 
GAGTGGGAAG 
CGCGCGCCCG 
GAGATGAAAG 
CGAAGTGATT 
TGCACTGTAG 
GGTAGCGTCT 
AATAATTATA 
GACATTACAC 
GCGATCTCGG 
CTCCCAGGTC 
CCTATTAAAA 
ACAAGGAGTC 
TCAACTCATG 
TTTTAGAAGC 
TGCTAACTAT 
GGTTTTGGTT 
CTGGCCTCAA 
CACTGCACCC 
ACAGTTGTTA 
ATAAGCAGTC 
ATGTGTATTG 
AAGTGAAGAT 
TTATCATCTG 
TGGCTGGTGT 
GTGAAATTGG 
AAAGTCACAT 
ATTTTGATGA 
AATCTTTTCC 
TTTTAAGCTT 
GTTTGTTTCT 
GTGATCTTGG 
TCCCAAGTAG 
GTAGAGACAG 
CCACCTTGGC 
TCCATTTTGA 
GGAATATAGA 
CACCTTTGCA 
CTGTTTTGTT 
TGTATGTAAT 
TTGTTTGTAT 
TGAGCTAAAG 
TTGGTGCTGG 
TTTGCTTTTG 
AGAACTTTGG 
GGCCTATGGC 



21 
I 

GCGGAGGCCG 
GAGCCCAGAT 
CGTCCGCCAT 
AAGGGCAACT 
CTTTGTATGA 
TGATACCAAC 
CATACCTGTA 
TGCCAAATGC 
AAAGATCTCT 
AGGATATGAA 
CTCAACCTGC 
CGGTGTCTAA 
AAAAATAGCC 
CTGGAG CACA 
GACTCCTCTG 
TATAGACATT 
TAAGGACTTT 
TTGTTTTGTA 
GCAGTCCTCC 
GGCCCCTACT 
CAGTGTGTTT 
ACTTGGCTGG 
TACAAGCTAG 
GATGGTCTGT 
GCTATTTGGG 
GGCTTGTGGC 
GGCTAGAGAA 
GAAGAGTTGA 
AATCCAGTTT 
CATCCCAAGA 
TATACTTTGG 
TTGTTTTGAG 
CTCACTGCAA 
CTGGGATTAC 
AGTTTTACCA 
CTCCCAAAGT 
ATGAATTTTT 
TATCCAGCTG 
TTTTTGTTAA 
CCATTGACCT 
TTT CT AATAA 
XTTTTGTAGA 
CAATACACTT 
CCCAGTGTAC 
ATGTGAAATT 
GAGGCCTAGA 
AAAACTCCGT 



31 
I 

AGGCGAGAGC 
ACCATTTTGG 
GTTCTGCGAA 
GCCTGCCTTC 
ACAAAACCAG 
TATCAAATTT 
TGACCGCTTG 
ATTACGATTT 
TGCTACTTAT 
ACCACCAAAA 
AACCTCCACC 
AAGACTATGG 
AGCACTTTTT 
TCCTGTCATG 
TACTCACTCT 
GTTTAAGATA 
CTTTTTTTAA 
GAGACTGTCT 
CACCTTAGCT 
CCTTTTTCTA 
TTTAAATGAA 
ACAGGAAGAA 
AGAGCTGAAT 
AGAAATTTTC 
AAGGAAGGAC 
TATGGGGTGA 
GGAACTTTGT 
TTGTCTTTTA 
ATTCGTTTGT 
TCACAATTTT 
TCTATGACCC 
ATGGAGTCTT 
TCTCTATCCC 
AGGCACAGGC 
TGTTGGCCAG 
TTTGGGATTA 
TATATGGTGC 
TTTCACTACC 
AAAGTAGTTG 
GTTTTTCTCT 
TTCTTGAAAC 
GATGGGGTTT 
GCCTCGTCCT 
CACATTTCTT 
TGGGAACAGG 
TGGGTGGATC 
CTCTACAAAA 



41 
I 

CTGGCGCTGT 
CGTGAGAGCT 
AAAGCCATGG 
AACGAGGATG 
TCTGATGTGA 
OGACACTGTT 
CTTCGGATCA 
CACATGGCTG 
ATGAGGTCAC 
AG CCTATAT A 
TCCCAGGTTC 
AGAATTTGAA 
ACCTCGATGG 
ACCATGCGCC 
CTCCACCACT 
ACTAAGAATA 
TGTTGTACAC 
CACTATGTTG 
TCTCAAAGTG 
ATAAGCTGTA 
AGTAAACATG 
GGTAGATCCT 
TTCTGAGATA 
AGTATATATA 
ACACATGGAT 
TCACCAGTAT 
ACAGTTTTCC 
ATGGTATGTT 
TCTTTTATGC 
TTTTCCTTTT 
GTTTTTTTTT 
GTTCTGTCAC 
CTGGGTTCAA 
CGCCACGCCT 
GCTGGTTTCA 
CAAGTGTGGG 
AAGGTGTCAA 
ATTTTTTGAA 
TCAATGTATA 
CCTGAATGCC 
AGATAGTATT 
CACCGTGTTG 
CCCCATGTGC 
TTTGAGATTT 
CAGGGTGTGG 
ACTTGAGCTC 
AATAGAAAAA 



CACATTTTCA 
ATGTTTAATG 
TTTGCACATT 
CACCACTTTG 
CTGAGATTCA 
TTAAACAGCT 
TTTGGGTGTT 
TACTTCTAGA 
TTGTTTTGTT 
CCAGGCTGGG 
GTGATTCTCT 
GGCTAATTTT 
AACTCCTGAC 
CCACCGCGGC 
TCCACCTTCA 
AGGACTGCCC 
TGTGGGTTTA 
AATACCATAT 
AATGTGTCAT 
GCCAGGCTGT 
TGGGATTACA 
GTTTTGGCTA 
TGGCTTATGC 
AGGAGTTCCA 
ATTAGCCAGG 



2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
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60 
120 
180 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 



245 
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TGTGGTGGTG CATGCCTGTA GTCACAGTTA CACGQCAGQC TGAGGTGGGA GGATCACTTG 2880 

AACCCCAGAG GTCAAGACTG CAGTGAGCTG AGATCACACC ACTGTACTCC AGCCTGGGTG 2940 

ACAAAGTGAG ACTCTATCTC AAAAAGAAAT TAGGATCAAT TTGTCAATTT CTACAACAAC 3000 

AACAACAAAA ACCCCTGTTG GGCACCTTGA TTGAGATTGC ATTGAATTTA TATAAAACTG 3060 

TTGGGAGAAT TGACATCTTA ATAATATTGA GTCTTCTGGC CTATAAACAA GGTCTGTCTT 3120 

CCTAGGTATT AATGTTTTGT CTTCTATTTC TCTTAATAAT CTTTTGTAGT TTTCAGTGTA 3180 

CAGGTCTACC ATGTCAGCAT TTCATAGTTT TGATGCTAAA TGGTATTTTA AAATTTCAAA 3240. 

TTCTAACCAC TTGTTGCTAG TAAATAGAAA TACAATTGAT GTTGAACTTG TATCCTTCAG 3300 

CCTTGCTAAA CTGTGAGTTC TCATGGTGTT TTTGTAAATT ACATCAACAG TCATGTGTTC 3360 
TATGAATAAA GAGTTTTACT CCTTC 



Seq ID NO: 1S6 Protein sequence: 
Protein Accession #: Bos sequence 

1 li 21 31 41 51 

I I I I I I 

MFCEKAMEL I RELHRAPEGQ LPAFNEDGLR QVLEEMKALY EQNQSDVNEA KfiGGRSDLIP 60 
TIKFRHCSLL RNRRCTVAYIi YDRLLRIRAL RWEYGSVLPN ALRFHMAAEE MEWFNNYKRS 120 
LATYMRSLGG DEGLDITQDM KPPKSLYIEA GCSGAISAQP ATSTSQVHLN CNLHLPGPVS 180 
KRLWRI 



Seq ID NO: 157 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 148-621 

1 11 21 31 41 51 

I ! I I I I 

TTCGGCGCCA AAGCGCGGAG CGGAGGCCGA GGCGAGAGCC TGGCGCTGTA GGACTAGAAC ' 60 
GAAAGGAGTG AGGCGCCGAG AGCCCAGATA CCATTTTGGC GTGAGAGCTG GTGGTTGGCA 120 
AGGCCGCGGG AGTGGGAAGC GTCCGCCATG TTCTGCGAAA AAGCCATGGA ACTGATCCGC 180 
GAGCTGCATC GCGCGCCCGA AGGGCAACTG CCTGCCTTCA ACGAGGATGG ACTCAGACAA 240 
GTTCTGGAGG AGATGAAAGC TTTGTATGAA CAAAACCAGT CTGATGTGAA TGAAGCAAAG 300 
TCAGGTGGAC GAAGTGATTT GATACCAACT ATCAAATTTC GACACTGTTC TCTGTTAAGA 3 60 
AATCGACGCT GCACTGTAGC ATACCTGTAT GACCGCTTGC TTCGGATCAG AGCACTCAGA 420 
TGGGAATATG GTAGCGTCTT GCCAAATGCA TTACGATTTC ACATGGCTGC TGAAGAAGTC 480 
CGGTGTCTAA AAGACTATGG AGAATTTGAA GTTGATGATG GCACTTCAGT CCTATTAAAA 540 
AAAAATAGCC AGCACTTTTT ACCTCGATGG AAATGTGAGC AGCTGATCAG ACAAGGAGTC 600 
CTGGAGCACA TCCTGTCATG ACCATGCGCC GAGGCACTTC CAGGCTTCAC TCAACTCATG 660 
GACTCCTCTG TACTCACTCT CTCCACCACT CCCTTCACCT CCCTCTTTGA TTTTAGAAGC 720 
TATAGACATT GTTTAAGATA ACTAAGAATA CTTGGCTAAG AAGTATAATT TGCTAACTAT 780 
TAAGGACTTT CTTTTTTTAA TGTTGTACAC TATTCTTCCT ACTCTTTTTT GGTTTTGGTT 840 
TTGTTTTGTA GAGACTGTCT CACTATGTTG CCCAAGCTGG TCTCAAACTC CTGGCCTCAA 900 
GCAGTCCTCC CACCTTAGCT TCTCAAAGTG TTGAGATCAC AGGCGTGAGC CACTGCACCC 960 

GGCCCCTACT CCTTTTTCTA ATAAGCTGTA TCTGTAATCA CAGCATTCCT ACAGTTGTTA 1020 

CAGTGTGTTT TTTAAATGAA AGTAAACATG GTTACATTTG AATCTCTTAA ATAAGCAGTC 1080 

ACTTGGCTGG ACAGGAAGAA GGTAGATCCT GTGTGTCTTG TTTTCTGGTC ATGTGTATTG 1140 

TACAAGCTAG AGAGCTGAAT TTCTGAGATA CACATTTTCA AATCACATGC AAGTGAAGAT 1200 

GATGGTCTGT AGAAATTTTC AGTATATATA ATGTTTAATG ACATACTAAT TTATCATCTG 1260 

GCTATTTGGG AAGGAAGGAC ACACATGGAT TTTGCACATT TCCACCATGG TGGCTGGTGT 1320 

GGCTTGTGGC TATGGGGTGA TCACCAGTAT CACCACTTTG GAAGGGGACA GTGAAATTGG 1380 

GGCTAGAGAA GGAACTTTGT ACAGTTTTCC CTGAGATTCA GATTGACTGA AAAGTCACAT 1440 

GAAGAGTTGA TTGTCTTTTA ATGGTATGTT TTAAACAGCT GACATTTTAA ATTTTGATGA 1500 

AATCCAGTTT ATTCGTTTGT TCTTTTATGC TTTGGGTGTT GCATCCGAGA AATCTTTTCC 1560 

CATCCCAAGA TCACAATTTT TTTTCCTTTT TACTTCTAGA AGTGTTATAA TTTTAAGCTT 1620 

TATACTTTGG TCTATGACCC GTTTTTTTTT TTGTTTTGTT TTGTTTTTTC GTTTGTTTCT 16 B0 

TTGTTTTGAG ATGGAGTCTT GTTCTGTCAC CCAGGCTGGG GTGCAGTGGC GTGATCTTGG 1740 

CTCACTGCAA TCTCTATCCC CTGGGTTCAA GTGATTCTCT TGTCTCAGCC TCCCAAGTAG 1800 

CTGGGATTAC AGGCACAGGC CGCCACGCCT GGCTAATTTT TGTATTTTTA GTAGAGACAG 1860 

AGTTTTACCA TGTTGGCCAG GCTGGTTTCA AACTCCTGAC CTCAAGTGAC CCACCTTGGC 1920 

CTCCCAAAGT TTTGGGATTA CAAGTGTGGG CCACCGCGGC CAGCCTATGA TCCATTTTGA 1980 

ATGAATTTTT TATATGGTGC AAGGTGTCAA TCCACCTTCA CTTTTT CTTG GGAATATAGA 2040 

TATCCAGCTG TTTCACTACC ATTTTTTGAA AGGACTGCCC TTTGCTCTAT CACCTTTGCA 2100 

TTTTTGTTAA AAAGTAGTTG TCAATGTATA TGTGGGTTTA TTTCAGGACT CTGTTTTGTT 2160 

CCATTGACCT GTTTTTCTCT CCTGAATGCC AATACCATAT TTGTATGTAG TGTATGTAAT 2220 

TTTCTAATAA TTCTTGAAAC AGATAGTATT AATGTGTCAT ATTTTTGCTG TTGTTTGTAT 2280 

TTTTTGTAGA GATGGGGTTT CACCGTGTTG GCCAGGCTGT GTTGAACTCC TGAGCTAAAG 2340 

CAATACACTT GCCTCGTCCT CCCCATGTGC TGGGATTACA GGCGTGAGCC TTGGTGCTGG 2400 

CCCAGTGTAC CACATTTCTT TTTGAGATTT GTTTTGGCTA TGTTAAGTCC TTTGCTTTTG 2460 

ATGTGAAATT TGGGAACAGG CAGGGTGTGG TGGCTTATGC CTGTAATCCT AGAACTTTGG 2520 

GAGGCCTAGA TGGGTGGATC ACTTGAGCTC AGGAGTTCCA GACCAGCCCG GGCCTATGGC 2580 

AAAACTCCGT CTCTACAAAA AATAGAAAAA ATTAGCCAGG TGTGGTGGTG CATGCCTGTA 2640 

GTCACAGTTA CACGGCAGGC TGAGGTGGGA GGATCACTTG AACCCCAGAG GTCAAGACTG 2700 

CAGTGAGCTG AGATCACACC ACTGTACTCC AGCCTGGGTG ACAAAGTGAG ACTCTATCTC 2760 

AAAAAGAAAT TAGGATCAAT TTGTCAATTT CTACAACAAC AACAACAAAA ACCCCTGTTG 2820 

GGCACCTTGA TTGAGATTGC ATTGAATTTA TATAAAACTG TTGGGAGAAT TGACATCTTA 2880 

ATAATATTGA GTCTTCTGGC CTATAAACAA GGTCTGTCTT CCTAGGTATT AATGTTTTGT 2940 

CTTCTATTTC TCTTAATAAT CTTTTGTAGT TTTCAGTGTA CAGGTCTACC ATGTCAGCAT 3000 

TTCATAGTTT TGATGCTAAA TGGTATTTTA AAATTTCAAA TTCTAACCAC TTGTTGCTAG 3 060 

TAAATAGAAA TACAATTGAT GTTGAACTTG TATCCTTCAG CCTTGCTAAA CTGTGAGTTC 3120 

TCATGGTGTT TTTGTAAATT ACATCAACAG TCATGTGTTC TATGAATAAA GAGTTTTACT 3180 
CCTTC 



Seq ID NO: 158 Protein sequence: 
Protein Accession tti Eob sequence 



1 ll 21 31 41 51 

I I I.I I I 
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MFCEKAMELI RELHRAPEGQ LPAFNEDGLR QVLEEMKALY EQNQSDVNEA KSGGRSDLIP 60 
TIKFRHCSLL RNRRCTVAYL YDRLLRIRAL RWEYGSVfcPN ALRFHMAAEE VRCLKDYGEP 120 
EVDDGTSVLL KKNSQHFLPR WKCEQLIRQG VLEHILS 



Seq ID NOt 159 DNA sequence 

Nucleic Acid Accession &t Eos sequence 

Coding sequence i 149-229 

1 11 21 31 41 51 

I I I I I I 

GTTCGGCGCC AAAGCGCGGA QCGGAGGCCG AGGCGAGAGC CTGGCGCTGT AGGACTAGAA 60 
CGAAAGGAGT GAGGCGCCGA GAGCCCAGAT ACCATTTTGG CGTGAGAGCT GGTGGTTGGC 120 
AAGGCCGCOG GAGTGGGAAG CGTCCGCCAT GTTCTGCGAA AAAGCCATGG AACTGATCCG 180 
CGAGCTGCAT CGCGCGCCCG AAGGG CAACT GCCTGCCTTC AACAATTAGC TGGGTGTGGT 240 
GGCACACACC TGTAGTCCCA GCAACTTAGG AGGCTGAAGT GAGAGGATTG CATGGCTCCA 300 
GGAAGTTGAA ACTGCAGTGA ACTGTGGTCA CGCTATTACA CTCCAGCCTG GGTGACAGAC 360 
TGAATCCCTG TCTCAAAAAG GAAAAGGAGG ATGGACTCAG ACAAGTTCTG GAGGAGATGA 420 
AAGCTTTGTA TGAACAAAAC CAGTCTGATG TGTTCTCTGT TAAGAAATCG ACGCTGCACT 480 
GTAGCATACC TGTATGACCG CTTGCTTCGG ATCAGAGCAC TCAGATGG 



Seq ID NO: 160 Protein sequence: 
Protein Accession #r Eos sequence 



1 11 21 31 41 51 

I I I I I I 

ATGTTCTGCG AAAAAGCCAT GGAACTGATC CGCGAGCTGC ATCGCGCGCC CGAAGGGCAA 60 
CTGCCTGCCT TCAACAATTA G 



Seq ID NO t 161 DNA sequence 
Nucleic Acid Accession #: U10694 
Coding sequence: 1333-2280 



1 11 21 31 41 51 

| I I I I I 

GGATCCGGCC GGATCTCAGG GAGGTGAGGA CTTTGTTCTC AGAGGGTGTG TGTGGACAAA 60 

ACAGGGAGGC CCTGTGTTCG ACAGACACAG TGGTCCCAGG ATTGGAGAGC AGTCCAGGTG 120 

AGGAACCTAA GGGAGGATCG AGGGTACCTC CAGGCCAGAQ AAACTCTCAG ATCAAGAGAG 180 

TTTGCCCTGC CCCTACTGTC ACCCCAGAGA GCCCGGGCAG GGCTGTCTGC TGAGGTCCCT 240 

CCTTTATCCT GGGATCACTG GTGTCGGGGA GGGCTGGCCT TGGTCTGAGG GGGCTGCACT 300 

CACGTCAGCA GAGGGAGGGT CCCAGGCCCT GCCAGGAGTC CAGGTGCAGA CTGAGGGGAC 360 

CCCACTCACC AAACACAGAQ GACCTAGCCC CACCCTGCCC CTTGTGTCAG CTGAGGGAAG 420 

CCGCTGGGTG GATGGACTCC CCTCACTTCC TCTTCAGGTG TCTCCTGGAG ATAGGGCCTC 480 

AGGTCAACAG AGGGAGGGTT CCAGACCCTG CAGGCATCAA GATGAGGACC AGGCAGTATC 540 

CTCACCCCAG GACACATGGA CCCCATTGAA TTTAGACATC TCTTACTGTA CTTCCGAGGA 600 

AACCCTGGGC AGGTGTGGGC AGATGTTGGT TGGGGCATGT CCTTCTGTTC CATATCAGGG 660 

ATGTGAGCTC CTGATCTGAG AGACTCTCAG GCAAGTAGAG GAGTAGAGTC CAGTCCCTGC 720 

CAGGAGAAAG GTCAGGGCCC TGAGTGAGCG CAGAGGGGAC CATCCACCCC AAAAGTGTGT 780 

AGAACTCAAG AGTGTCCAGC CCGCCCTCTT GACAGCACTG AGGGACCGGG GCTCTGCCTG 840 

CAGTCTGCAG CCTAAGGGCC CCTCGATTCC TCTTCCAGGA GCTCCAGGAA GCAGGCAGGC 900 

CTTGGTCTGA GACAGTGTCC TCAGGTCGCA GAGCAGAGGA GACCCAGGCA GTGTCAGCAG 960 

TGAAGGTGAA GTGTTCACCC TGAATGTGCA CCAAGGGCCC CACCTGCCCC AG CACACATG 1020 

GGACCCCATA GCACCTGGCC CCATTCCCCC TACTGTCACT CATAGAGCCT TGATCTCTGC 1080 

AGGCTAGCTG CACGCTGAGT AGCCCTCTCA CTTCCTCCCT CAGGTTCTCG GGACAGGCTA 1140 

ACCAGGAGGA CAGGAGCCCC AAGAGGCCCC AGAGCAGCAC TGACGAAGAC GTGTAAGTCA 1200 

GCCTTTGTTA GAACCTCCAA GGTTCGGTTC TCAGCTGAAG TCTCTCACAC ACTCCCTCTC 1260 

TCCCCAGGCC TGTGGGTCTC CATCGCCCAG CTCCTGCCCA CGCTCCTGAC TGCTGCCCTG 132 0 

ACCAGAGTCA TCATGTCTCT CGAGCAGAGG AGTCCGCACT GCAAGCCTGA TGAAGACCTT 1380 

GAAGCCCAAG GAGAGGACTT GGGCCTGATG GGTGCACAGG AACCCACAGG CGAGGAGGAG 1440 

GAGACTACCT CCTCCTCTGA CAGCAAGGAG GAGGAGGTGT CTGCTGCTGG GT CAT CAAGT 1500 

CCTCCCCAGA GTCCTCAGGG AGGCGCTTCC TCCTCCATTT CCGTCTACTA CACTTTATGG 1560 

AGCCAATTCG ATGAGGG CTC CAGCAGTCAA GAAGAGGAAG AGCCAAGCTC CTCGGTCGAC 1620 

CCAGCTCAGC TGGAGTTCAT GTTCCAAGAA GCACTGAAAT TGAAGGTGGC TGAGTTGGTT 1680 

CATTTCCTGC TCCACAAATA TCGAGTCAAG GAGCCGGTCA CAAAGGCAGA AATGCTGGAG 1740 

AGCGTCATCA AAAATTACAA GCGCTACTTT CCTGTGATCT TCGGCAAAGC CTCCGAGTTC 1800 

ATGCAGGTGA TCTTTGGCAC TGATGTGAAG GAGGTGGACC CCGCCGGCCA CTCCTACATC 1860 

CTTGTCACTG CTCTTGGCCT CTCGTGCGAT AGCATGCTGG GTGATGGTCA TAGCATGCCC 1920 

AAGGCCGCCC TCCTGATCAT TGTCCTGGGT GTGATCCTAA CCAAAGACAA CTGCGCCCCT 198 0 

GAAGAGGTTA TCTGGGAAGC GTTGAGTGTG ATGGGGGTGT ATGTTGGGAA GGAGCACATG 2040 

TTCTACGGGG AGCCCAGGAA GCTGCTCACC CAAGATTGGG TGCAGGAAAA CTACCTGGAG 2100 

TACCGGCAGG TGCCCGGCAG TGATCCTGCG CACTACGAGT TCCTGTGGGG TTCCAAGGCC 2160 

CACGCTGAAA CCAGCTATGA GAAGGTCATA AATTATTTGG TCATGCTCAA TGCAAGAGAG 2220 

CCCATCTGCT ACCCATCCCT TTATGAAGAG GTTTTGGGAG AGGAGCAAGA GGGAGTCTGA 2280 

GCACCAGCCG CAGCCGGGGC CAAAGTTTGT GGGGTCAGGG CCCCATCCAG CAGCTGCCCT 2340 

GCCCCATGTG ACATGAGGCC CATTCTTCGC TCTGTGTTTG AAGAGAGCAA TCAGTGTTCT 2400 

CAGTGGCAGT GGGTGGAAGT GAGCACACTG TATGTCATCT CTGGGTTCCT TGTCTATTGG 2460 

GTGATTTGGA GATTTATCCT TGCTCCCTTT TGGAATTGTT CAAATGTTCT TTTAATGGTC 2520 

AGTTTAATGA ACTTCACCAT CGAAGTTAAT GAATGACAGT AGTCACACAT ATTGCTGTTT 25 B0 

ATGTTATTTA GGAGTAAGAT TCTTGCTTTT GAGTCACATG GGGAAATCCC TGTTATTTTG 2640 

TGAATTGGGA CAAGATAACA TAGCAGAGGA ATTAATAATT TTTTTGAAAC TTGAACTTAG 2700 

CAGCAAAATA GAGCTCATAA AGAAATAGTG AAATGAAAAT GTAGTTAATT CTTGCCTTAT 2760 

ACCTCTTTCT CTCT CCTGTA AAATTAAAAC ATATACATGT ATACCTGGAT TTGCTTGGCT 2820 

TCTTTGAGCA TGTAAGAGAA ATAAAAATTG AAAGAATAAT TTTTCCTGTT CACTGGCTCA 2880 
TTTTTTCTTC AGACACGCAC TGAACATCTG TTATTCGGAA CACCCTGGGT T 



Seq ID NO: 162 Protein sequence: 
Protein Accession ft: AAA68877.1 
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1 11 21 31 41 51 

I I I I I I 

MSLEQRSPHC KPDEDLEAQG EDLGLMGAQE PTGEEEETTS SSDSKEEEVS AAGSSSPPQS 60 

PQGGASSSIS VYYTLWSQFD EGSSSQEEEE PSSSVDPAQL EFMFQEAIjKL KVAELVHFLL 120 

HKYRVKEPVT KAEMLrESVIK NYKRYFPVIF GKASEFMQVI FGTDVKEVDP AGHSYILVTA 180 

LGLS CDSMLG D GHSMPKAAL HIVLGVILT KDNCAPEEVI WEALSVMGVY VGKEHMFYGE 240 

PRKLLTQDWV QENYIiEYRQV PGSDPAHYEF LWGSKAHAET SYEKVINYLV MLNAREPICY 300 
PSLYEEVLGE EQEGV 

Seq ID NO: 163 DNA sequence 
Nucleic Acid Accession ft: AF292100 
Coding sequence: 30-809 

1 11 21 31 41 51 

I I I I I I 

GGGGGGGGAG AGGCCTGGAG GACACCAACA TGAACAAGTT GAAATCATCG CAGAAGGATA 60 

AAGTTCGTCA GTTTATGATC TTCACACAAT CTAGTGAAAA AACAGCAGTA AGTTGTCTTT 120 

CTCAAAATGA CTGGAAGTTA GATGTTGCAA CAGATAATTT TTTCCAAAAT CCTGAACTTT 180 

ATATACGAGA GAGTGTAAAA GGATCATTGG ACAGGAAGAA GTTAGAACAG CTGTACAATA 240 

GATACAAAGA CCCTCAAGAT GAGAATAAAA TTGGAATAGA TGGCATACAG CAGTTCTGTG 300 

ATGACCTGGC ACTCGATCCA GCCAGCATTA GTGTGTTGAT TATTGCGTGG AAGTTCAGAG 3 60 

CAGCAACACA GTGCGAGTTC TCCAAACAGG AGTTCATGGA TGGCATGACA GAATTAGGAT 420 

GTGACAGCAT AGAACAACTA AAGGCCCAGA TACCCAAGAT GGAACAAGAA TTGAAAGAAC 480 

CAGGACGATT TAAGGATTTT TACCAGTTTA CTTTTAATTT TGCAAAGAAT CCAGGACAAA 540 

AAGGATTAGA TCTAGAAATG GCCATTGCCT ACTGGAACTT AGTGCTTAAT GGAAGATTTA 600 

AATTCTTAGA CTTATGGAAT AAATTTTTGT TGGAACATCA TAAACGATCA ATACCAAAAG 660 

ACACTTGGAA TCTTCTTTTA GACTTCAGTA CGATGATTGC AGATGACATG TCTAATTATG 720 

ATGAAGAAGG AGCATGGCCT GTTCTTATTG ATGACTTTGT GGAATTTGCA CGCCCTCAAA 780 

TTGCTGGGAC AAAAAGTACA ACAGTGTAGC ACTAAAGGAA CCTTTTAGAA TGTACATAGT 840 

CTGTACAATA AATACAACAG AAAATTGCAC AGTCAATTTC TGCTGGCTGG ACTGAACTGA 900 

AGATCAAT CC TCACAATTCA GACTGAGGGT TGAGACAAAA CTTTAAGGAT ACATCTTGGA 960 

CCATATCGTA" TTTCATTCTT CTAATGGTGG TTTGGGCTTG TCTTCTAGTC TGGGCCGCTC 1020 

TAAACATTTA TAATTCCAAC ATTGTGGATT TCATCTTATA TCTGTGGACC ATCCTAGTTT 10 80 

ATTCTCCCAT AAGTCTTAGA AGCTTTATGQ TGATTATTTT GAGGTTTTCA TTCTCGCATA 1140 

AAGCACAATG CTGTCTTCAT CAGAAAACAG TTGGCATAAG AATTAAACAT ATGAACATCA 1200 

CAAAACAATT TATAAAAACT TCTTAAATAT ACGCTTTGGG CTAGTTGCAA AGACTATGCT 1260 

AATAGCACTT CCAGTGAGAG TGATATATTT AAGTGTACTG GATCTGGAAT GGTGTTTTGG 1320 

TTTGGGGGGA ATTTTTTTTT TTTCCTGGCA AATCACATAT GTTGTTGATG TGAGTATCTG 13 80 

ATGAAAAAAC AATGTCAGAA TAACCGACAT GAAAATTTTT TAGGATAACT TGGTGCCTAC 1440 

CTGAAAAATG TATTGTGTTT TAGACTCTTG ATTTCAAAAG GTTCCACAGA ACTAGTCTGC 1500 

GCTTACCTTA CCCATGTTTA TATATAGCTG TCCTACAGGG AGCTTTTATT TAGAAAATGT 1560 

CTGCATAATG TTAGATTCTT CTCCTGTCTA CATTATGCAC TACATAATTG GACTTCATTA 1620 

TGCTTTTGAA ATGCTTATCT GCCTGTCACA TAAGTTAAAC TATTTAATTT GTTTTGAATG 1680 

TTTTGGATTG CTACACAATA CAATATTCTA AATTTAGGCA TGAGGGTTTT TTTGTTTTAT 1740 

TTTTACTTTT TTTTTGTCAT TGCACTATGG AACACAAATG AAATTCTCTT AATTTATAAG 1800 

AAGATAGTAG GAGTTAAATT TTGAAAATGG TTGTGATGAG CCACGAAATT CAATCTTTAT 1860 

AATATAGGTA CTGCTCTTTC AGACAAACAG TCCATTTTTA ATGACTTCTT ATTTTGTTGA 1920 

AATTACTTTA ACTGCTAATC ACTGTGGTTG CCAAATATTT ACTTCAGAAG CAAAGATTTT 1980 

CAAACAAGCA TACACGATGC AAAATACCAG TCTGGCTTCT AGTCTATTTA CTGTTTTGTT 2040 

TCACTCAGAT TAG CTCAGTT TTCTCATCAA AGCAGAATGC TATCTTGCGT GTGTGTGTGT 2100 

GTGTGTGTGT GTGTGTGTGT GTATGTGTGT ATATATATAT ATATATATAT ATATATATTT 2160 

TTTTTTTTTT TTTTTTTTAA ATTACAAAAG CCATGAGCTG CTTTTATGCT GAAAATGGTC 2220 

ATTTCCCTGT TCACTTACTG ACATGTGAAG AAGGGTTTCT TGCTTTCTTA AACATTTCCG 2280 

TAAGGCAGGC TAGAAATGTA ATACTTCAAA TGTTTGATGA TTATGGTCTT TTGATAGGAA 2340 

TAGATTCTGC TTGGGATATA TATCCAGGCA CTCTCTAAGG TCTAGGGTTG ATATTAACAA 24 00 

AGGAATGTAC TTAGAATAGC AGTACATTTT ATGCAAATAT GGAAATTATT TTAAGAAACA 24 60 

ATGACATATC AAAACTGCTT TTTACATGAT TTTGAAATAG ACTAGAAAGC TTTCCCTATA 2520 

GACATATTAA TATTCCAATC ATAACTTTAA TTCAAGAATG CAGTTTTACC AAAAGAAAAA 2580 

TTTGAAAATT TCTATTCAGG CTACTGGAAT TGGTTATTAA AAGAAAAAGG AAAAAGAAGA 2640 

ATCTTGCTGC TTTCAGTATT TCCTGATTTT TTTGTAAATA TAAAGAGGAA CTTCAATTAT 2700 

GAAAAATTTT TAAAAGATAT ATATATCTAT ATATCTATAT ATATGTACTG TTTTGTTTCC 27 60 

TGTCTTGAAG ATTTTGAGTT ATGGTTATTG GTTTCAGATT GATTAATTCA CATATGCTGT 2 820 

GTTTTCTTTA AAAGT CATAT GGGTTCGTGG CCTAATGCCT TGGATTTTAC ATATTTTTCT 2880 

TTTTAAATGC AAAACCTTTT CAACAAAATA GTGTTTGTCA TCAGGTTGGT ACTAAACATT 2 940 

TATAATTACT GTGTAATTAT AAACAAAAAT ACATAAAGCT TTGAATATAA TTATGTAGCA 3000 

TAAAAGTTAA GGTTGTTCAC TATGATGGCA TCTTAGAATT AAACAAAACT TTTACTAGGG 3060 

CTGAAAAGAG AAGACTGATT TAATGTGGTG TGATTATTCT GAAGATAAAT GTCTGGCTAC 3120 

AGGGAATATT TTGTACTAAA AAATGATTAC ACATATGGCT GTGTGTGTTT GAGTCTGTGT 3180 

CTGTGAGAGA GCCAGAGAGA GTGAGAGAGA TTGACAGAGA AAGGGAGAGA CACACACACG 3240 

CCCCTTGAAT TGCTTTAACT CCTAAGTGTT TCAGTCCTCA TTCCGGTAAA CTCCCCATGC 3300 

TGATTCTTTG TTTTAAACTG AACCATAGGT ACAGTTTCCT TTTTGCCAAA TGTCAAAACA 3360 

GGTACAAATT TTAAAATGTA ATGCTTTTTA AATAGAAAAA TGTATAAAAT TAGAAGTGCC 3420 

CACATATAAA AAATACTTGA GATGAAGATT ATCTTTAGTG AATATCATCT GCATATCTCT 3480 

GTAAGTTCAA TTGTGTTTCT TACAGTCCCT GTCATATTAC CAACAGAGGC AATAAAAGCT 3540 
GCAGTGAAAT TG 

Seq ID NOt 164 Protein sequence: 
Protein Acceasion ft: AAG00606 

1 11 21 31 41 51 

I I I 1 I I 

MNKLKSSQKD KVRQFMIFTQ SSEKTAVSCL SQNDWKLDVA TDNFFQNPEL YIRESVKGSL 60 

DRKKLEQLYN RYKDPQDENK IGIDGIQQFC DDLALDPASI SVLIIAWKFR AATQCEFSKQ 12 0 

EFMDGMTELG CDSIEQLKAQ IPKMEQELKE PGRFKDFYQF TFNFAKNPGQ KGLDLEMAIA 180 

YWNLVLNGRF KFLDLWNKFL LEHHKRSIPK DTWNLLLDFS TMIADDMSNY DEEGAWPVLI 240 
DDFVEFARPQ IAGTKSTTV 
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Seq ID MOt 165 DNA sequence 
Nucleic Acid Accession #: AF256215 
coding sequencet 220-2028 

1 11 21 31 41 51 

I ! I I 1 ! 

CTCCAGTCCG CATGCTCAGT AGCTGCTGCC GGCCGGGCTG CGGGGCGGCG TCCGCTGCGC 60 

GCCTACGGGC TGCGGTGGCG GCCGCCGCGG CACCCGGCAG GGCCCGCCAG TCCCCGCTTC 120 

CCTGCTCCAG AGCCGCCGCC TGGGCCGGGG CAGGGCGGGC CCGGGGCTCC TCCATGCTGC 180 

CAGCCGCCGG GCTGCGGAGC CGACCAAGTG GCTCCTGCGA TGGCGGCGGA AGAGGAGGCT 240 

GCGGCGGGAG GTAAAGTGTT GAGAGAGGAG AACCAGTGCA TTGCTCCTGT GGTTTCCAGC 300 

CGCGTGAGTC CAGGGACAAG ACCAACAGCT ATGGGGTCTT TCAGCTCACA CATGACAGAG 360 

TTTCCACGAA AACGCAAAGG AAGTGATTCA GACCCATCCC AAGTGGAAGA TGGTGAACAC 420 

CAAGTTAAAA TGAAGGCCTT CAGAGAAGCT CATAGCCAAA CTGAAAAGCG GAGGAGAGAT 480 

AAAATGAATA ACCTGATTGA AGAACTGTCT GCAATGATCC CTCAGTGCAA CCCCATGGCG 540 

CGTAAACTGG ACAAACTTAC AGTTTTAAGA ATGGCTGTTC AACACTTGAG ATCTTTAAAA 600 

GGCTTGACAA ATTCTTATGT GGGAAGTAAT TATAGACCAT CATTTCTTCA GGATAATGAG 660 

CTCAGACATT TAATCCTTAA GACTGCAGAA GGCTTCTTAT TTGTGGTTGG ATGTGAAAGA 720 

GGAAAAATTC TCTTCGTTTC TAAGTCAGTC TCCAAAATAC TTAATTATGA TCAGGCTAGT 780 

TTGACTGGAC AAAGCTTATT TGACTTCTTA CATCCAAAAG ATGTTGCCAA AGTAAAGGAA 840 

CAACTTTCTT CTTTTGATAT TTCACCAAGA GAAAAGCTAA TAGATGCCAA AACTGGTTTG 900 

CAAGTTCACA GTAATCTCCA CGCTGGAAGG ACACGTGTGT ATTCTGGCTC AAGACGATCT 960 

TTTTTCTGTC GGATAAAGAG TTGTAAAATC TCTGTCAAAG AAGAGCATGG ATGCTTACCC 1020 

AACTCAAAGA AGAAAGAGCA CAGAAAATTC TATACTATCC ATTGCACTGG TTACTTGAGA 1080 

AGCTGGCCTC CAAATATTGT TGGAATGGAA GAAGAAAGGA ACAGTAAGAA AGACAACAGT 1140 

AATTTTACCT GCCTTGTGGC CATTGGAAGA TTACAGCCAT ATATTGTTCC ACAGAACAGT 1200 

GGAGAGATTA ATGTGAAACC AACTGAATTT ATAACCCGGT TTGCAGTGAA TGGAAAATTT 1260 

GTCTATGTAG ATCAAAGGGC AACAGCGATT TTAGGATATC TGCCTCAGGA ACTTTTGGGA 1320 

ACTTCTTGTT ATGAATATTT TCATCAAGAT GACCACAATA ATTTGACTGA CAAGCACAAA 1380 

GCAGTTCTAC AGAGTAAGGA GAAAATACTT ACAGATTCCT ACAAATTCAG AGCAAAAGAT 1440 

GGCTCTTTTG TAACTTTAAA AAGCCAATGG TTTAGTTTCA CAAATCCTTG GACAAAAGAA 1500 

CTGGAATATA TTGTATCTGT CAACACTTTA GTTTTGGGAC ATAGTGAGCC TGGAGAAGCA 1560 

TCATTTTTAC CTTGTAGCTC TCAATCATCA GAAGAATCCT CTAGACAGTC CTGTATGAGT 1620 

GTACCTGGAA TGTCTACTGG AACAGTACTT GGTGCTGGTA GTATTGGAAC AGATATTGCA 1680 

AATGAAATTC TGGATTTACA GAGGTTACAG TCTTCTTCAT ACCTTGATGA TTCGAGTCCA 1740 

ACAGGTTTAA TGAAAGATAC TCATACTGTA AACTGCAGGA GTATGTCAAA TAAGGAGTTG 1800 

TTTCGACCAA GTCCTTCTGA AATGGGGGAG CTAGAGGCTA CCAGGCAAAA CCAGAGTACT 1860 

GTTGCTGTCC ACAGCCATGA GCCACTCCTC AGTGATGGTG CACAGTTGGA TTTCGATGCC 1920 

CTATGTGACA ATGATGACAC AGCCATGGCT GCATTTATGA ATTACTTAGA AGCAGAGGGG 1980 

GGCCTGGGAG ACCCTGGGGA CTTCAGTGAC ATCCAGTGGA CCCTCTAGCC TTTGATTTTT 2040 

AACTCCAAAA ATGAGAAACA TTTTAAAGCA TTATTTACGA AAAAACTGTC TCAACTATTC 2100 

TTAAGTACTG TATTGATATT GTTTGTATCT TTTATTAATG TTCTACCACT TTTTATAGAT 2160 

TTGCATCTTC CTGTCACAGG GATGTGGGGA AATACGTTTT CCTCCCAAGA GAACCAAGTT 2220 

TATTATAGAC TCCTTT ATTC AGTGAAATGG CTTATAATCC ACTAGTTGCC ATATTTTTGC 2280 

TAAAATATTT CTAACCAAGA ATACTACTTA CATATTGTTT TGGCTTTGTT TTATTTTTGA 2340 

TGCAGTTTTT TTTAGTTGAG GTAATGTAAT ATATTGATGT TTTCCTTTGT GTCTAAGATT 2400 

GATTTATAAT AGTAGGTTTG TATAATTTGG AACATTTTCC ATGCCTTGCG AATTTCCTTA 2460 

ATTGAGGATA GGGCTTACAC ACTTTAAGAA AACAGTGAGT ACTTGAACAT TTAAAGGGAC 2520 

AGTGCAATTT ATAGTCATAA TCACATTGAA TACTGTATTT GATCTTTGGA GACTTAGGCA 2580 

AGCACAGAGC TGGGATATTT ATGCTCAGTT GAGCACTTTA AGATGAATTT TAAGTGAGAT 2640 

GATTTCTTGC TTAAAACTCA GAAAGTCAAA AGAGTTTCAG CTTTCCTTAC AGAAAAGGAA 2700 

GGATCTTGGG CCCTAGATCT TGGGGATTAA CCTCTGCATA TAAGATTTAC TCTTAATAGG 2760 

CCAGACGTGG TGCTCACGCC TGTAATCCCA GTACTTTGGG AGGCTGAGAC GGGCAGATCA 2820 

CTTGAGGTCA GGAGTTCAAG ACCAGCCTGG CCAATATGGT GAAACCCCGT TTCTACTAAA 2880 

AATACAAAAA AAATTACCCA GGCACTCACT CTTGAGGTAA CTAACCAACT CCCACGATAA 2940 

TGACAGTCCA TTCATGAGCG CAAAGGCCTC ATGACCTAAT GGCACACACC TGTAATCCCA 3000 

ACTGCTTGGG AGGCTGAGGC GAGAGGATTG CTTGAACCTG GGAGGCAGAG GTTGCAGTGA 3060 

GCCGAGATCG CACCACTGCA CTCCAGTCTG GGCAACAGAG TGAGACTTCA TCTCAAAAAA 3120 

AGTAAAAAAA AAGATTTAAT ATAATCACTG AAGATCTCTA TTATAGATAG ATTAGGTTTT 31 BO 

TGACATTGGA AACATACTTA GGGATAGATT TGTCCTAAAG GAAAAAAGTA GGCCCGGGCA 3240 

GATTAAATGT CTTGTGTAAA GTCACACATT AAATTCAGTC ACACATTAAA TTCATAGAGT 3300 

TTTAAATGTT TAATGTATAT AAACCAGTTT CTTTATACAC ATTTGGGAAA ACATTGGTCT 3360 

CACAGATTAA ATGATTAACT AACTGACCCA GGAACTAGTT GTAGCTTTCT AAGTAATTAG 3420 

GCAATTACAG TTATTGCCTG TAACCAAAGG TAATAAAACA AAATGACAAG TACATGTTTA 3480 

AAATTATGAG GCAATGAGAA ATAATTTAAA AACCAATTTT CTAGTTATAA TTTAAAATTT 3540 

GGAGAGCATT TTTAACAGTA ATTAATCCAG AGGTGGCTCA AATTGAGTAT AAGAATTAAG 3600 

ATTATTTAAA ATACTGCATG TCTACCTTCT CGGGGATCAT ACTTTATAAC ACTTTCTGCT 3660 

TCAGTAGCTC TTCATAGCTT GCCAAGTATG CTCCCATATT TTCTCTCTCG TGCCTCGCAA 3720 

ATGAAAGTCA GATAGGCTGG GAACTCATGG GGCAGCCCTC AGACTTCAAT GTGGGCTTCA 3780 

AATCCAGTTT CCTGTTCTAT ATGGTGCTAC ATCTTTCCAG AAAATTTCCC TCAGAGCCCC 3840 

TCGCCAAAAC AAAGCATTAT TTTGACCCTG CATGCTATTT CTTTAGCTGT AGGTGATAGA 3900 

TTAGAACTTC TGTCAGACAT GTTAATGACA AACATACCAA CAGACAATAA CCAAAGCAAA 3960 

TGTTTCCTTC AAGTGTGAAA TGTGCAGGGG CTCGTGGGCA AGGATGTATT GGCACACTGT 4020 

CCTCTTGAAC TGATAGTGTC CCAGCAATGT TGGAGGTTGG CACCATTCCT GGTCCGACAC 4080 

TTGAGGACCT GAGAGACATC AGGTTTAGAA TGAGCCAAAG AAATCCTACA AGATGGGGAG 4140 

AATTGGTGTG CAGCAGCCTA AGTGTTATAG TTAAGTCTAA AQAAGTATGA AAGATCCCCT 4200 

GTGTTCTCTA AATTGAGCAG AGGGGCCTGC CTACCAATAT CACTTTTTAG GGGACTGAAC 4260 

CATTGCAGGT TAGACTTGGC TTCCAAAGAG TCTGCCTAAG CCAGGGGTGG CAGGGTAGGC 4320 

CAT CATAGCT GGATGGCCTC AAAAGCAGAT GGGGGCAGAC TTGCCCTCGT GATGCCAGGA 4380 

TTTGAGAGGC AGAGTTTCTA GAGGGAGACC AGTGCTGCCT CTCACAGTGG CAGTTTTTTC 4440 

TCTTTGCAAG AGGAGGGGCT GTTCAATTCC ATAGACCAGT GGGCAGATAG CCAGTTGAAT 4500 

ACTCTGTGCA TGGTTTGATC CTTTATTAGT TCGCTCTAAT ATTTTTCTGT AGATCCTTTT 4560 

GTCCTGGACT CAAAATCTAA TCCATGCATT GTATGATACC GTAGCTCTCC TAAGGTTTGT 4620 

GTTTCCTTCA AAATGTTTTA GTTTTCTTCA ACTAAATTTG ATTTTTGCTG TTAGAAGTGA 4680 

CAT ATTT TT A TGGTATACAC TATGTTCCTT TTTTCTACTG CGAGTCAATT TTTTGAATTT 4740 

TCGTGAGAAA GAATATATCT ACAAATTGCA CGAAAGTATC ATAAAAACAG TACTCTAGAG 4800 
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5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



CAGCGCTGTC 
ACATTAAAGA 
CAAAATATCA 
TTGGTAATAC 
GTACTAGCCA 
AGCACAGTTC 
AGCTGGAAGT 
GTTTAAATGG 
CCAGGCACTC 
TGTAGAGTAG 
GTGGGCGGAT 
CTCTACTGAA 
CTGCTCTGGA 
AGATCGCGCC 
AAAAAAAAAA 
GAAGTAGACC 
AATTATTTAT 
AATTTAACCT 
CTTGGAATCA 
CAAAGACAAA 
AAGTGTTTAT 
GATCTCCGTA 
TGTTTAGTAG 
CATCACTGAG 
GGCCGACATC 
CCACTTTTCT 
GCCTATGAGA 
TGTTTTATGA 
GGCTTAGAGA 
CAGCACGGAA 
TATTCTTGAG 
CTCGTCTGTA 
ATTTTATAAT 
GCTCCCTTAA 



CAATAGAAAT 
AGTAAAAAGA 
TTTGAACATG 
TAGTCTTCAA 
CATTGCAAGT 
TAGGTTCCAC 
AAACATAGAG 
CTTGCATGAG 
TTCCCACTCC 
GCCGGGCGCA 
CACGAGGTCA 
AATACAAAGC 
GGCTGAGGCA 
ACTGCACCCC 
AAGAAAAGAA 
AAAGTTTATA 
TGATCCTTGA 
TTTGAAAATA 
AGTGAAATGA 
CAGGAAAGTA 
TTACAGTGTC 
AAATCTGGGG 
TGTCCAGTTT 
CCCTGGAATT 
TATGATTAGC 
GTTTTTGTAA 
GGCATTTATG 
GAAATGCTTT 
GCTTTCCAGG 
CATGCTTTCT 
TTTAGATTTG 
TATTGGTATT 
TACT CAT TTG 
AA 



ATAATCTGAG 
TACAAGTAGA 
TAATTAATAT 
AAT CTGGTAT 
GCTCAGTAGC 
CCTAACACCC 
ATCAAACCTC 
GTCATACAGC 
ACTACATTAC 
GTGGCTCATG 
GGAGATCGAG 
AAAATTAGCC 
GAATGGCGTG 
AGCCTGGGCG 
AAGAAAAGTC 
CCATAAGGAT 
ATCTGTAAGA 
ATAAACTTTA 
GTTATATGGT 
CAGAATAGAG 
ACGACAGAAA 
CACAGGTACA 
CAGATCATGC 
GGAGACTCAT 
TAGAAGCCAT 
TGCTTTCATT 
ATTTTTGTGC 
CCAAGGGAGG 
TGTAGTGCCA 
GAACTCACTT 
TCTTTTATAC 
TTTAAATTTT 
TAGTTTTTTT 



CCACATGTAT 
ACTAATTTTA 
AAAATTATTA 
GTATCTTACA 
CACATGTGGC 
AAGTCCTGTG 
CTTTTAAAAA 
TAAATTCAGC 
TGTAGTGGTA 
CCTGTAATCC 
ACCATCCTGG 
AGGTGTGGTG 
AACCCAGGAG 
ACAGAGCGAG 
TAGAGAACAT 
ATTTTTCCTT 
TCAAATAACA 
AAATATCAGA 
CATCACTAAA 
ACTTTTAGTA 
AGGATGTCTT 
AGAAATAGCC 
TGCCAAGAGG 
ACTTGCCCAG 
AAAGAAAAGC 
AGCAGATCTT 
CTACAATAAG 
TCTAGGAAGA 
ATAAAAACTG 
GAGAGTGTAT 
AATTTTTAGC 
TGTGGTAAAT 
.TTTTAATTTA 



seg ID NO: 166 Protein sequence: 
protein Accession #; AAG34652 



1 

I 

MAAEEEAAAG 
QVEDGEHQVK 
QHXiRSLKGLT 
LNYDQASLTG 
YSGSRRSFFC 
NSKKDNSNFT 
LPQELLGTSC 
TNPWTKELEY 
SIGTDIANEI 
TRQNQSTVAV 
Th 



11 

I 

GKVLREENQC 
MKAFREAHSQ 
NSYVGSNYRP 
QSLFDFLHPK 
RIKSCKISVK 
CIiVAIGRIiQP 
YEYFHQDDHN 
IVSVNTLVLG 
LDLQRLQSSS 
HSKBPIxLSDG 



21 

I 

IAPWSSRVS 
TEKRRRDKMN 
SFLQDNELRH 
DVAKVKBQLS 



YIVPQNSGEI 
NLTD KHKAVL 
HSEPGEASFL 
YLDDSSPTGL 
AQIjDFDALCD 



31 

I 

PGTRPTAMGS 
NLIEELSAMI 
LILKTAEGFL 
SFDISPREKL 
KKEHRKFYTI 
NVKPTEFITR 
QSKEKILTDS 



MKDTHTVNCR 
NDDTAMAAFM 



Seq ID NO j 167 DNA sequence 
Nucleic Acid Accession #« NM_014400 
Coding sequence* 86-1126 



1 

I 

GGTTACTCAT 
GACGCCAAGG 
GATCTGGACT 
GTGCTACAGC 
GAAGTGCGCG 
CGGACAATTC 
CGGCCTGGAT 
CTGCAACGCC 
ATACCCGCCC 
GGGTACATCG 
CTTCGACGGC 
CTGTGTCCAG 
TGGCTCCTGT 
CCCTCGAATC 
CACATCTGTC 
GCCAGCGCCA 
GGAGCCCAGG 
TCCTGCAAAA 
ATTGGCAGCC 
AAATTTCCCT 
CCCACCACTG 
CTTCTGCTGC 
GGGTGTTCTA 
TCCTCTTGTG 
AGGATGCTAA 
GGTGGGACAA 
ATCGGTTCCC 
CTTATGTCTG 



11 
I 

CCTGGGCTCA 
GAGCAGGACG 
GCAGGCTGGC 
TGCGTGCAGA 
CCGGGCGTGG 
TCGCTGGCAG 
CTTCACGGGC 
AAGCTCAACC 
AACGGCGTGG 
CCGCCGGTCG 
AACGTCACCT 
GATGAATTCT 
TGCCAGGGGT 
CCACCCCTTG 
ACCACTTCTA 
ACCAGTCAGA 
TTGACTGGAG 
GGGGGGCCCC 
CTTCTGTTGG 
CTCACCTACT 
GACTGGGCTG 
GCTGGTTTGC 
GCTTTTTGAG 
ATGTTAGGAC 
GCTTCCTACT 
TGGCTCCCCA 
CATATGTCTT 
TGTGTGATCA 



21 
I 

GGTAAGAGGG 
GAGCCATGGA 
TGCTGCTGCT 
AAGCAGATGA 
ACGTCTGCAC 
TGCSGGGTTG 
TTCTGGCGTT 
TCACCTCGCG 
AGTGCTACAG 
TGAGCTGCTA 
TGACGGCAGC 
GCACTCGGGA 
CCCGCTGTAA 
TCCGGCTGCC 
CCTCGGCCCC 
CTCCGAGACA 
GCGCCGCTGG 
AGCAGCCCCA 
CCGTGGCTGC 
TCTCTGGCCC 
GCCCAGCCCC 
GGCTTTGGGA 
GACAGCTCCT 
AGAGTGAGAG 
CACTTTCTCC 
CTCTAAGCAC 
CCTTACTAGA 
GTTTCTGGCA 



31 

I 

CCCGAGCTCG 
CCCCGCCAGG 
GCTGCTTCGC 
CGGATGCTCC 
CGAGGCCGTG 
CGGTTCGGGA 
CATCCAGCTG 
GGCGCTCGAC 
CTGTGTGGGC 
CAACGCCAGC 
TAATGTGACT 
TGGAGTAACA 
CTCTGACCTC 
CCCTCCAGAG 
AGTGAGACCC 
GGGAGTAGAA 
CCACCAGGAC 
TAATAAAGGC 
TGGTGTCCTA 
TGGGTACCCC 
TGTTTTTCCA 
AATAAAATAC 
GTATCCTTCT 
AAGTCAGCTG 
TAGCCAGCCT 
TGCCTCCCCT 
CTGTGAGCTC 
CATAAATGCC 



AATTTTATTT 
ATGTTTTAAT 
ATGTGATATT 
TTGATAGCAC 
TAGTGGCTAC 
GATTAGAATC 
TGAGGACGCT 
CTCAACAGGG 
ATTCTTAGGG 
CAGCACTTTG 
CCAACATGGT 
GCGGGCGCCT 
GCAGAGATGG 
ACTCCATCTC 
TATATTAAGT 
AAATACCATG 
AGTCTCTATC 
TGTGTTATTA 
TTTAGAAATC 
AATAAATGGA 
TGTTGTCATA 
AATATTTAGT 
TATCTCCCCC 
CACAATGTTA 
TGCTAAGTGG 
TTTTTTCCAA 
TCAGCCTGTC 
TCCTGACACA 
ACCTGGAAAG 
GGTGTATGTC 
TCTTTTCCAG 
AATGAAAAGA 
ATAAACTTCC 



41 

I 

FSSHMTEFPR 
PQCNFMARKL 
FWGCERGKI 
IDAKTGIiQVH 
HCTGYLRSWP 
FAVNGKFVYV 
YKFRAKDGSF 
8RQSCMSVPG 
SMSNKELPPP 
NYLEAEGGLG 



41 
I 

GAGGCGGCAC 
AAAGCAGGTG 
GGAGGAGCGC 
CCGAACAAGA 
GGGGCGGTGG 
CTCCCCGGCA 
CAGCAATGCG 
CCGGCAGGTA 
CTGAGCCGGG 
GATCATGTCT 
GTGTCCTTGC 
GGCCCAGGGT 
CGCAACAAGA 
CCCACGACTG 
ACATCCACCA 
CACGAGGCCT 
CGCAGCAATT 
TGTGTGGCTC 
CTGTGAGCTT 
TCTTCTCATC 
ACATTCCCCA 
CGTTGTATAT 
CATCCTTGTC 
TCACGGGGAA 
GGACTTTGGA 
ACTCCCCGCA 
CTCGAGGGCA 
TCAATAAAGA 



TCTTCTAGCC 
TCAGTATATC 
TTACATTCTT 
ATCTCACTTT 
TGCACTGGAC 
CCAGAATCAG 
GAGGCACAGA 
TCTTCTGATT 
TTAAAAAAAG 
GGAGGCCGAA 
GAAACCCCGT 
GTGGTCCCAG 
CAGTGAGCCA 
AAAAAAAAAA 
GGTTATTATT 
TTTGAAGAAC 
CATGTTACCA 
CAGGATGATA 
TATTGTGAAA 
ATTTAAAAGA 
GTCTTTGAGG 
TCCCAGACCA 
TCAGGTGGGT 
CGGGCAGACA 
CCACTAGGTG 
GCTCCATGGG 
TGGTGTGAGT 
TAAGAACTTT 
AAAACCTGCC 
ACTTCTCATA 
TTCACTTGTG 
GTGAAATTAT 
TCCAAAAAGT 



4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
S700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 



51 
I 

KRKGSDSDPS 
DKLTVLRMAV 
LFVSKSVSKI 
SNLHAGRTRV 
PNIVGMEEER 
DQRATAILGY 
VTLKSQWFSF 
MSTGTVLGAG 
SPSEMGELEA 
DPGDFSDIQW 



51 

I 

ACCCAGGGGG 
CCCAGGCCAT 
AGGCCCTGGA 
TGAAGACAGT 
AGACCAT CCA 
AGAATGACCG 
CTCAGGATCG 
ATGAGAGTGC 
AGGCGTGCCA 
ACAAGGGCTG 
CTGTCCGGGG 
TCACGCTCAG 
CCTACTTCTC 
TGGCCTCAAC 
CCAAACCCAT 
CCCGGGATGA 
CAGGGCAGTA 
CCACAGCTGG 
CTCCACCTGG 
ACTTCCTGTT 
GTATCCCCAG 
ATTCTGGCAG 
TCTCCGCTTG 
GGTGAGAGAG 
GCGTGGGGTG 
TCTTTGGGGA 
GGGACCGTGC 
TTTAATTACT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
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Seq ID NO: 166 Protein sequence: 
^ Protein Accession #: NP_055215 

I 11 21 31 41 Si 

II I I I.I 

MDPARKAGAQ AMIWTAGWLL LLLLRGGAQA LECYSCVQKA DDGCSPNKMK TVKCAPGVDV 60 

CTEAVGAVET IHGQFSLAVX GCGSGLPGKN DRGIiDLHGLL AFIQLQQCAQ DRCNAKLNLT 120 

10 SRALDPAGNE SAYPPNGVEC V SCVGLSREA CQGTSPPWS CYNASDHVYK GCFDGNVTLT 180 

AANVTVSLPV RGCVQDEFCT RDGVTGPGFT LSGSCCQGSR CNSDLRNKTY FSPRIPPLVR 240 

LPPPEPTTVA STTSVTTSTS APVRPTSTTK PMPAPTSQTP RQGVEHEASR DEEPRLTGGA 300 
AGHQDRSNSG QYPAKGGPQQ PHNKGCVAPT AGLAALLLAV AAGVLL 

15 Seg ID NO: 169 DNA sequence 

Nucleic Acid Accession #t NM_00687S 
Coding sequence: 186-1190 

^ 1 11 21 • 31 41 51 

20 | | II I I 

GAATTCGGCA CGAGCGCGCG GCGAATCTCA ACGCTGCGCC GTCTGCGGGC GCTTCCGGGC 60 

CACCAGTTTC TCTGCTTTCC ACCCTGGCGC CCCCCAGCCC TGGCTCCCCA GCTGCGCTGC 120 

CCCGGGCGTC CACGCCCTGC GGGCTTAGCG GGTTCAGTGG GCTCAATCTG CGCAGCGCCA 180 

CCTCCATGTT GACCAAGCCT CTACAGGGGC CTCCCGCGCC CCCCGGGACC CCCACGCCGC 240 

25 CGCCAGGAGG CAAGGATCGG GAAGCGTTCG AGGCCGAGTA TCGACTCGGC CCCCTCCTGG 300 

GTAAGGGGGG CTTTGGCACC GTCTTCGCAG GACACCGCCT CACAGATCGA CTCCAGGTGG 360 

CCATCAAAGT GATTCCCCGG AATCGTGTGC TGGGCTGGTC CCCCTTGTCA GACTCAGTCA 420 

CATGCCCACT CGAAGTCGCA CTGCTATGGA AAGTGGGTGC AGGTGGTGGG CACCCTGGCG 480 

TGATCCGCCT GCTTGACTGG TTTGAGACAC AGGAAGGCTT CATGCTGGTC CTCGAGCGGC 540 

30 CTTTGCCCGC CCAGGATCTC TTTGACTATA TCACAGAGAA GGGCCCACTG GGTGAAGGCC 600 

CAAGCCGCTG CTTCTTTGGC CAAGTAGTGG CAGCCATCCA GCACTGCCAT TCCCGTGGAG 660 

TTGTCCATCG TGACATCAAG GATGAGAACA TCCTGATAGA CCTACGCCGT GGCTGTGCCA 720 

AACTCATTGA TTTTGGTTCT GGTGCCCTGC TTCATGATGA ACCCTACACT GACTTTGATG 780 

GGACAAGGGT GTACAGCCCC CCAGAGTGGA TCTCTCGACA CCAGTACCAT GCACTCCCGG 840 

35 CCACTGTCTG GTCACTGGGC ATCCTCCTCT ATGACATGGT GTGTGGGGAC ATTCCCTTTG 900 

AGAGGGACCA GGAGATTCTG GAAGCTGAGC TCCACTTCCC AGCCCATGTC TCCCCAGACT 960 

GCTGTGCCCT AATCCGCCGG TGCCTGGCCC CCAAACCTTC TTCCCGACCC TCACTGGAAG 1020 

AGAfCCTGCT GGACCCCTGG ATGCAAACAC CAGCCGAGGA TGTTACCCCT CAACCCCTCC 1080 

AAAGGAGGCC CTGCCCCTTT GGCCTGGTCC TTGCTACCCT AAGCCTGGCC TGGCCTGGCC 1140 

40 TGGCCCCCAA TGGTCAGAAG AGCCATCCCA TGGCCATGTC ACAGGGATAG ATGGACATTT 1200 

GTTGACTTGG TTTTACAGGT CATTACCAGT CATTAAAGTC CAGTATTACT AAGGTAAGGG 1260 

ATTGAGGATC AGGGGTTAGA AGACATAAAC CAAGTTTGCC CAGTTCCCTT CCCAATCCTA 132 0 

CAAAGGAGCC TTCCTCCCAG AACCTGTGGT CCCTGATTTT GGAGGGGGAA CTTCTTGCTT 1380 

CTCATTTTGC TAAGGAAGTT TATTTTGGTG AAGTTGTTCC CATTTTGAGC CCCGGGACTC 144 0 

45 TTATTTTGAT GATGTGTCAC CCCACATTGG CACCTCCTAC TACCACCACA CAAACTTAGT IS 00 

TCATATGCTT TTACTTGGGC AAGGGTGCTT TCCTTCCAAT ACCCCAGTAG CTTTTATTTT 1560 

AGTAAAGGGA CCCTTTCCCC TAGCCTAGGG TCCCATATTG GGTCAAGCTG CTTACCTGCC 1620 

TCAGCCCAGG ATTTTTTATT TTGGGGGAGG TAATGCCCTG TTGTTACCCC AAGGCTTCTT 1680 

TTTTTTTTTT TTTTTTTTTG GGTGAGGGGA CCCTACTTTG TTATCCCAAG TGCTCTTATT 1740 

50 CTGGTGAGAA GAACCTTAAT TCCATAATTT GGGAAGGAAT GGAAGATGGA CACCACCGGA 1800 

CACCACCAGA CAATAGGATG GGATGGATGG TTTTTTGGGG GATGGGCTAG GGGAAATAAG I860 

GCTTGCTGTT TGTTTTCCTG GGGCGCTCCC TCCAATTTTG CAGATTTTTG CAACCTCCTC 1920 

CTGAGCCGGG ATTGTCCAAT TACTAAAATG TAAATAATCA CGTATTGTGG GGAGGGGAGT 1960 

TCCAAGTGTG CCCTCCTTTT TTTTCCTGCC TGGATTATTT AAAAAGCCAT GTGTGGAAAC 2040 

55 CCACTATTTA ATAAAAGTAA TAGAATCAGA AAAAAAAAAA AAAAAAAA 



60 



Seq ID NO: 170 Protein sequence: 
Protein Accession #t NP_006866 



11 21 31 41 51 

I I I I I I 

MLTKPLQGPP APPGTPTPPP GGKDREAFEA EYRLGPLLGK GGFGTVFAGH RLTDRLQVAI 60 

KVIPRNRVLG WSPLSDSVTC PLEVALLWKV GAGGGHPGVI RLLDWFETQE GFMLVLERPL 120 

65 PAQDLFDYIT EKGPLGEGPS RCFFGQWAA IQHCHSRGW HRDIKDENIL IDLRRGCAKL 160 

IDFGSGALLH DEPYTDFDGT RVYSPPEWIS RHQYHALPAT VWSLGIliLYD MVCGDIPFER 240 

DQEILEAELH FPAHVSPDCC AIiIRRCLAPK PSSRPSLEEI LLDPWMQTPA EDVTPQPLQR 300 
RPCPFGLVLA TLSLAWPGLA PNGQKSHPMA MSQG 

70 Seq ID NO i 171 DNA sequence 

Nucleic Acid Accession #i NM_003646 
Coding sequence: 89.. 2 87 5 

„ 1 11 21 31 41 51 

75 } | i | i I 

GCGGCGCGGA GCGGGCGTGC TGAGCCCCGG CCGCCGGCCC GGCATGGGCG TCTCCCGCGG 60 

GCCCTCCGCC GGCCGGGGCT AGGGCCGGAT GGAGCCGCGG GACGGTAGCC CCGAGGCCCG 120 

GAGCAGCGAC TCCGAGTCGG CTTCCGCCTC GTCCAGCGGC TCCGAGCGCG ACGCCGGTCC 180 

CGAGCCGGAC AAGGCGCCGC GGCGACTCAA CAAGCGGCGC TTCCCGGGGC TGCGGCTCTT 240 

80 CGGGCACAGG AAAGCCATCA CCAAGTCGGG CCTCCAGCAC CTGGCCCCCC CTCCGCCCAC 300 

CCCTGGGGCC CCGTGCAGCG AGTCAGAGCG GCAGATCCGG AGTACAGTGG ACTGGAGCGA 360 

GTCAGCGACA TATGGGGAGC ACATCTGGTT CGAGACCAAC GTGTCCGGGG ACTTCTGCTA 420 

CGTTGGGGAG CAGTACTGTG TAGCCAGGAT GCTGAAGTCA GTGTCTCGAA GAAAGTGCGC 480 

AGCCTGCAAG ATTGTGGTGC ACACGCCCTG CATCGAGCAG CTGGAGAAGA TAAATTTCCG 540 

85 CTGTAAGCCG TCCTTCCGTG AATCAGGCTC CAGGAATGTC CGCGAGCCAA CCTTTGTACG 600 

GCACCACTGG GTACACAGAC GACGCCAGGA CGGCAAGTGT CGGCACTGTG GGAAGGGATT 660 

CCAGCAGAAG TTCACCTTCC ACAGCAAGGA GATTGTGGCC ATCAGCTGCT CGTGGTGCAA 720 
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GCAGGCATAC CACAGCAAGG TGTCCTGCTT CATGCTGCAG CAGATCGAGG AG CCGTGCT C 780 

GCTGGGGGTC CACGCAGCCG TGGTCATCCC GCCCACCTGG ATCCTCCGCG CCCGGAGGCC 840 

CCAGAATACT CTGAAAGCAA GCAAGAAGAA GAAGAGGGCA TCCTTCAAGA GGAAGTCCAG 900 

CAAGAAAGGG CCTGAGGAGG GCCGCTGGAG ACCCTTCATC ATCAGGCCCA CCCCCTCCCC .960 

GCTCATGAAG CCCCTGCTGG TGTTTGTGAA CCCCAAGAGT GGGGGCAACC AGGGTGCAAA 1020 

GATCATCCAG TCTTTCCTCT GGTATCTCAA TCCCCGACAA GTCTTCGACC TGAGCCAGGG 1080 

AGGGCCCAAG GAGGCGCTGG AGATGTACCG CAAAGTGCAC AACCTGCGGA TCCTGGCGTG 1140 

CGGGGGCGAC GGCACGGTGG GCTGGATCCT CTCCACCCTG GACCAGCTAC GCCTGAAGCC 1200 

GCCACCCCCT GTTGCCATCC TGCCCCTGGG TACTGGCAAC GACTTGGCCC GAACCCTCAA 1260 

CTGGGGTGGG GGCTACACAG ATGAGCCTGT GTCCAAGATC CTCTCCCACG TGGAGGAGGG 1320 

GAACGTGGTA CAGCTGGACC GCTGGGACCT CCACGCTGAG CCCAACCCCG AGGCAGGGCC 1380 

TGAGGACCGA GATGAAGGCG CCACCGACCG GTTGCCCCTG GATGTCTTCA ACAACTACTT 1440 

CAGCCTGGGC TTTGACGCCC ACGTCACCCT GGAGTTCCAC GAGTCTCGAG AGGCCAACCC 1500 

AGAGAAATTC AACAGCCGCT TTCGGAATAA GATGTTCTAC GCCGGGACAG CTTTCTCTGA 1560 

CTTCCTGATG GGCAGCTCCA AGGACCTGGC CAAGCACATC CGAGTGGTGT GTGATGGAAT 1620 

GGACTTGACT CCCAAGATCC AGGACCTGAA ACCCCAGTGT GTTGTTTTCC TGAACATCCC 1680 

CAGGTACTGT GCGGGCACCA TGCCCTGGGG CCACCCTGGG GAGCACCACG ACTTTGAGCC 1740 

CCAGCGGCAT GACGACGGCT ACCTCGAGGT CATTGGCTTC ACCATGACGT CGTTGGCCGC 1800 

GCTGCAGGTG GGCGGACACG GCGAGCGGCT GACGCAGTGT CGCGAGGTGG TGCTCACCAC 1860 

ATCCAAGGCC ATCCCGGTGC AGGTGGATGG CGAQCCCTGC AAGCTTGCAG CCTCACGCAT 1920 

CCGCATCGCC CTGCGCAACC AGGCCACCAT GGTGCAGAAG GCCAAGCGGC GGAGCGCCGC 1980 

CCCCCTGCAC AGCGACCAGC AGCCGGTGCC AGAGCAGTTG CGCATCCAGG TGAGTCGCGT 2 040 

CAGCATGCAC GACTATGAGG CCCTGCACTA CGACAAGGAG CAGCTCAAGG AGGCCTCTGT 2100 

GCCGCTGGGC ACTGTGGTGG TCCCAGGAGA CAGTGACCTA GAGCTCTGCC GTGCCCACAT 2160 

TGAGAGACTC CAGCAGGAGC CCGATGGTGC TGGAGCCAAG TCCCCGACAT GCCAGAAACT 2220 

GTCCCCCAAG TGGTGCTTCC TGGACGCCAC CACTGCCAGC CGCTTCTACA GGATCGACCG 2280 

AGCCCAGGAG CACCTCAACT ATGTGACTGA GATCGCACAG GATGAGATTT ATATCCTGGA 2340 

CCCTGAGCTG CTGGGGGCAT CGGCCCGGCC TGACCTCCCA ACCCCCAGTT CCCCTCTCCC 2400 

CACCTCACCC TGCTCACCCA CGCCCCGGTC ACTGCAAGGG GATGCTGCAC CCCCTCAAGG 2460 

TGAAGAGCTG ATTGAGGCTG CCAAGAGGAA CGACTTCTGT AAGCTCCAGG AGCTGCACCG 2520 

AGCTGGGGGC GACCTCATGC ACCGAGACGA GCAGAGTCGC ACGCTCCTGC ACCACGCAGT 2580 

CAGCACTGGC AGCAAGGATG TGGTCCGCTA CCTGCTGGAC CACGCCCCCC CAGAGATCCT 2640 

TGATGCGGTG GAGGAAAACG GGGAGACCTG TTTGCACCAA GCAGCGGCCC TGGGCCAGCG 2700 

CACCATCTGC CACTACATCG TGGAGGCCGG GGCCTCGCTC ATGAAGACAG ACCAGCAGGG 2760 

CGACACTCCC CGGCAGCGGG CTGAGAAGGC TCAGGACACC GAGCTGGCCG CCTACCTGGA 2820 

GAACCGGCAG CACTACCAGA T GAT CCAGCG GGAGQACCAG GAGACGGCTG TGTAGCGGGC 2880 

Seq ID NOt 172 Protein sequence i 
Protein Accession #: NP_003637 

1 11 21 31 41 51 

1 I I 1 I I 

MEPRDGSPEA RSSDSESASA SSSGSERDAG PEPDKAPRRL NKRRFPGLRL FGHRKAITKS 60 

GLQHLAPPPP TPGAPCSESE RQIRSTVDWS ESATYGEHIW FETNVSGDFC YVGEQYCVAR 120 

MLKSVSRRKC AACKIWHTP CIEQLEKINF RCKPSFRESG SRNVREPTFV RHHWVHRRRQ 180 

DGKCRHCGKG FQQKFTFHSK EIVAISCSWC KQAYHSKVSC FMLQQIEEPC SLGVHAAWI 240 

PPTWILRARR PQNTLKASKK KKRASFKRXS SKKGPEEGRW RPFIIRPTPS PLMKPLIjVFV 300 

NPKSGGNQGA KIIQSFLWYL NPRQVFDLSQ GGPKEALEMY RKVHNLRILA CGGDGTVGWI 360 

LSTLDQLRUC PPPPVAILPL GTGNDLARTL NWGGGYTDEP VSKILSHVEE GNWQLDRWD 420 

LHAEPNPEAG PEDRO EGATD RLPLDVFNNY FSLGFDAHVT LEFHESREAN PEKFNSRFRN 480 

KMFYAGTAFS DFLMGSSKDL AKHIRWCDG MDLTPKIQDL KPQCWFLNI PRYCAGTMPW 540 

GHPGEHHDFE PQRHDDGYLE VIGFTMTSLA ALQVGGHGER LTQCREWLT TSKAIPVQVD 600 

GEPCKLAASR IRIALRNQAT MVQKAKRRSA APLHSDQQPV PEQLRIQVSR VSMHDYEALH 660 

YDKEQLKEAS VPLGTVWPG DSDLELCRAH IERLQQEPDG AGAKSPTCQK LSPKWCFU3A 720 

TTASRFYRID RAQEHLNYVT EIAQDEIYIL DPELLGASAR PDLPTPTSPI* PTSPCSPTPR 780 

SLQGDAAPPQ GEEIiIEAAKR NDFCKLQELH RAGGDLMHRD EQSRTLIiHHA VSTG8KDWR 840 

YLLDHAPPEI LDAVEENGET CLHQAAALGQ RTICHYIVEA GASLMKTDQQ GDTPRQRAEK 900 
AQDTELAAYL ENRQHYQMIQ REDQETAV 

Seq ID NO: 173 DNA sequence 
Nucleic Acid Accession ft: AF232772 
Coding sequence: 1-1662 

1 11 21 31 41 51 

I I I I I I 

ATGCCGGTGC AGCTGACGAC AGCCCTGCGT GTGGTGGGCA CCAGCCTGTT TGCCCTGGCA 60 

GTGCTGGGTG GCATCCTGGC AGCCTATGTG ACGGGCTACC AGTTCATCCA CACGGAAAAG 120 

CACTACCTGT CCTTCGGCCT GTACGGCGCC ATCCTGGGCC TGCACCTGCT CATTCAGAGC 180 

CTTTTTGCCT TCCTGGAGCA CCGGCGGATG CGACGTGCCG GCCAGGCCCT GAAGCTGCCC 240 

TCCCCGCGGC GGGGCTCGGT GGCACTGTGC ATTGCCGCAT ACCAGGAGGA CCCTGACTAC 300 

TTGCGCAAGT GCCTGCGCTC GGCCCAGCGC ATCTCCTTCC CTGACCTCAA GGTGGTCATG 360 

GTGGTGGATG GCAACCGCCA GGAGGACGCC TACATGCTGG ACATCTTCCA CGAGGTGCTG " 420 

GGCGGCACCG AGCAGGCCGG CTTCTTTGTG TGGCGCAGCA ACTTCCATGA GGCAGGCGAG 480 

GGTGAGACGG AGGCCAGCCT GCAGGAGGGC ATGGACCGTG TGCGGGATGT GGTGCGGGCC 540 

AGCACCTTCT CGTGCATCAT GCAGAAGTGG GGAGGCAAGC GCGAGGTCAT GTACACGGCC 600 

TTCAAGGCCC TCGGCGATTC GGTGGACTAC ATCCAGGTGT GCGACTCTGA CACTGTGCTG 660 

GATCCAGCCT GCACCATCGA GATGCTTCGA GTCCTGGAGG AGGATCCCCA AGTAGGGGGA 720 

GTCGGGGGAG ATGT CCAGAT CCTCAACAAG TACGACTCAT GGATTTCCTT CCTGAGCAGC 780 

GTGCGGTACT GGATGG CCTT CAACGTGGAG CGGGCCTGCC AGTCCTACTT TGGCTGTGTG 840 

CAGTGTATTA GTGGGCCCTT GGGCATGTAC CGCAACAGCC TCCTCCAGCA GTTCCTGGAG 900 

GACTGGTACC ATCAGAAGTT CCTAGGCAGC AAGTGCAGCT TCGGGGATGA CCGGCACCTC 960 

ACCAACCGAG TCCTGAGCCT TGGCTACCGA ACTAAGTATA CCGCGCGCTC CAAGTGCCTC 1020 

ACAGAGACCC CCACTAAGTA CCTCCGGTGG CTCAACCAGC AAACCCGCTG GAGCAAGTCT 1080 

TACTTCCGGG AGTGGCTCTA CAACTCTCTG TGGTTCCATA AGCACCACCT CTGGATGACC 1140 

TACGAGTCAG TGGTCACGGG TTTCTTCCCC TTCTTCCTCA TTGCCACGGT TATACAGCTT 1200 

TTCTACCGGG GCCGCATCTG GAACATTCTC CTCTTCCTGC TGACGGTGCA GCTGGTGGGC 1260 

ATTATCAAGG CCACCTACGC CTGCTTCCTT CGGGGCAATG CAGAGATGAT CTTCATGTCC 1320 
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CTCTACTCCC TCCTCTATAT GTCCAGCCTT 
ATCAACAAAT CTGGCTGGGG CACCTCTGGC 
CTCATTCCTG TGTCCATCTG GGTGGCAGTT 
TGCCAGGACC TGTTCAGTGA GACAGAGCTA 
GGCTGCTACT GGGTGGCCCT CCTCATGCTA 
AAGAAGCCGG AGCAGTACAG CTTGGCTTTT 
CGGGTAAAGT GCAATGGGTA AGGGAGGGAA 
GGGAGGAGGG AGTGCTGTGT TTTAGTCTCT 
AAGAACGGTG ATGTAGTATG GCCTGACAGC 
AGATGCAGGG CTGCAGGGGA TTCTGTGTTT 
TAGGCAGTAG CCTCCTCCTG GGCTCCAGAG 
AGTCCCATTC AGTGGCAACT TGTGATAGGT 
TTCTCCCAGC CCATCTGAAC ACAACCAGAG 
GGCOGGTTAG TGTATGTCAC CCCCACCCCA 
GCGCGTGAGA TACAAGGCCC AGAAGCCTGA 
ATGGTGCTTT ATGTGAGATA CCCCACTCCA 
CAGGGAGTTA GCACTGAACT GCTTTTAAAA 
GGAACAAAGA GATTGTGGTG GTGCTAAAGG 
TTCCACCTGG AAACTGCTCA GACGTCTAGA 
GGGGAGATAA AAAGATTAAG CCCCAACATG 
TAACCTGTAT ACTCTTGAAT TCCTCTCAAA 
CCCCACTTCA CTTTCTTCAA AGCCACATTT 
CCCTCATCAT CATAGGTAAG GTTTTCAAGG 
TAGAAGCTTC AGCAGGAGGC AAGCGTGTTC 
TGATCAAATT GGCTACAATC TTGGAGCTGC 
ATGTGTGACT TTCAGGCTAC TGTTCTTGAC 
TCCCAAAGTG AACTCTCAAA TCCAAAATGG 
GGCTTCTCCA GGGAATTCTT ACAGCCAAGT 
TCCAGAAACC AAACTAGGAG ATGAAACTGG 
CTCATGCCTC CTGAGGCTGT T TTTGGCTGT 
GAAGCCATTT TCCAAGTGAC TTGCAATCCA 
TGGGATCCTG ACTAAGCCTT TGACTTAAGG 
TCAAAGGGGC CAACTAACCC GTGCAGAACC 
AAGCCTCTAA TGTACCAAGT GCTTCCTACA 
GGGCAAACCC TGGTGCTTTC CTTCATCTCC 
AACAGTTGCC ACATCACACA GACCTCCAGT 
CCAACCCATT GAAGGCTGGA AGGCAGCAGG 
GTTCTGCTGG CCAAGAAGTT AAACTATTTT 
CAAGTGCAGA GTTCAGACTT CGCTAAGGGC 
AATGTAGGAT GACAGGCTCT CCTGGCTGTC 
GTCAACTTTC CTCAAATCAA AAACAGGCAG 
TCGACTGGTT TTTCTAAGTT ATTTTGTACA 
CAGCTTTATC CCCGTTTCTT GCAAGGGAAG 
TTTCCTCATT GAGAATTCAA ATCCTCTTTT 
TTTATTTTTA CCTGCTTTTT TTTTTTTTTT 
CTTATTTGTC AAAATAAAGA TTCTCACAT 

Seq ID NOx 174 Protein sequence! 
Protein Accession #: AAP36984 



PCT/US02/12476 



MPVQLTTALR 
LFAFLBHRJRM 
WDGNRQEDA 
STP6CXMQKW 
VGGDVQXLNK 
DWYHQKFLGS 
YPREWLYNSL 
IIKATYACPL 
LIPVSIWVAV 
KKPEQYSLAF 



11 
I 

WGTSLPALA 
RRAGQALKLP 
YMLDIFHEVL 
GGKREVMYTA 
YDSWISPL93 
KCSFGDDRHL 
WFHKHHLWMT 
RGNAEMIFMS 
LLEGLAYTAY 
ABV 



21 

I 

VLGGILAAYV 
SPRRGSVAXiC 
GGTEQAGFFV 
FKALGDSVDY 
VRYWMAFNVE 
TNRVLSLGYR 
YESWTGFFP 
LYSLLYMSSL 
CQDLFSETEL 



CTGCCGGCCA 
CGAAAAACCA 
CTCCTGGAGG 
GCCTTCCTTG 
TATCTGGCCA 
GCTGAGGTGT 
GGGGAATGGA 
TAATGGTCCA 
TCTGTTTAGA 
TCAGACTGCC 
GGCACTCAGA 
ACCTGAGTGA 
GTGGCAGGAG 
CCCATAAGTA 
TCTTTGGGCA 
CATCAACATT 
GTGCACATTA 
AGGCCATAAG 
TGGGTTCTTA 
TTCAGAAAAG 
TTCAGCTCTG 
TTTGAGGTAT 
TGGCAATTGG 
TCAGCACATA 
TTGGACGGAT 
AATCATCTCC 
TTATCTTTGA 
TGTGACAGTC 
TTCCTACATC 
TTTCCCTCTG 
GGCTGTTCTC 
GTTGCTTGCT 
AGCACTAAGG 
AAGACGCAAG 
CACGAACTCA 
TTCTGGTAAG 
CATTTGCTAA 
GAGCATTAGA 
TTGTTTTTCT 
CTACCATCAG 
GTACAGGTAG 
TTTTTCAGCA 
AGCCTTTATA 
GTATTGTTTC 
TAATTTTCAG 



31 
I 

TGYQFIHTBK 
IAAYQEDPDY 
WRSNFHEAGE 
IQVCDSDTVL 
RACQSYFGCV 
TKYTARSKCL 
FFLIATVIQL 
ItPAKIFAIAT 
AFLVSGAILY 



AGATCTTTGC 
TTGTGGTGAA 
GGCTGGCCTA 
TCTCTGGGGC 
TCATCGCCCG 
GACATGGCCC 
AGAGAAAAGA 
AAGGACAAAT 
GGAGGCAACA 
TGTCTGCTTG 
AGTTGTGCTA 
CGGCAACCTG 
AATTTCTACT 
GTCATCAATG 
TCAGAAAACA 
CCAGGGATGA 
AAAAGGAAAG 
CTACACAGAG 
GCTTGTCTGT 
AAGTGAAGTC 
ATCTGAGGCT 
CACTGCAGTC 
GGCGGAGCCC 
TGGGAACTAT 
TCCTTGGCAG 
AATGGAAAGC 
GACCATCCAT 
ACTGCATTTG 
CTAAGGTTCT 
CTGCTTTTGG 
AGCGTTTTGA 
TGCCCTCCAA 
TGGACAGCAG 
GTGTGCTCCG 
AGGGTTTTCC 
ACTGCTGGTT 
GGCAGCTGAT 
ATGGAGGAAA 
TCAGCATTTA 
CTCTGCCTTG 
TGGGCTCACA 
GCAAAACCAA 
CAATTGGACG 
TACAATAATT 
GTCAAGTTTT 



CATTGCTACC 
CTTCATTGGC 
CACAGCTTAT 
TATACTGTAT 
GCGATGTGGG 
CCAAGCAGAG 
CAGGGTGGGA 
CTAAAATGCA 
CTGATCCCCC 
CATCTGCACA 
AACCAAGTTA 
CGGAAGGAGG 
GAGCGAGGTG 
CAATAAGATT 
GGGTCCAGGA 
GCCAAACCAG 
TTTGCCAGGA 
GCCTTGGGTG 
GATCTCTGCT 
TTGGGTATTT 
AAGACACACT 
ACCTCTTCTA 
CGGCTTCTTA 
GAGGAGCCTC 
CCGGGTTAGC 
TTTTCAGTGT 
TCT CCTCAGT 
CCTGCTTCTT 
TGCTTTCTCT 
GGAATGAGGG 
GTTTAAAACC 
ATGTCCTTTC 
ACAAGAGGGC 
AACCACAGAT 
AAGTGTAGCT 
GACATCAGAC 
" CCAGGCAATC 
TCCGGTCAGC 
CTTGAAGATT 
CACTGTGGTC 
ACGTTTGACC 
ACTGGGTCTT 
CATTTTGGTT 
TGTAAACATA 
TTATACTGCA 



HYLSFGLYGA 
LRKCLRSAQR 



DPACTIEMLR 
QCISGPLGMY 
TETPTKYIxRW 
FYRGRIWNIL 
' INKSGWGTSG 
GCYWVALLML 



51 
I 

XLGLHLLIQS 
ISPPDUCWM 
MDRVRDWRA 
VLEEDPQVGG 
RNSI*LQQFI*E 
LNQQTRWSKS 
liFLLTVQIiVG 
RKTIWNPIG 
YIiAIIARRCG 



1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2680 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
378Q 
3840 
3900 
3960 
4020 



60 
120 
180 
240 
300 
360 
420 
480 
540 



Seq ID NO» 175 DNA sequence 
Nucleic Acid Accession #» NM_00069l 
Coding sequence: 43.. 1404 



70 



75 



80 



85 



1 
1 

CCAGGAGCCC 
GCCGTGAAGC 
TTCCAGCAGC 
GCGCTGGCCG 
GTCCTAGAGG 
GTGGAGAAGA 
GTGGTCCTCG 
GGCGCCATCG 
GCGAGCCTGC 
AATGGGGGTG 
ACGGGCAGCA 
GTCACGCTGG 
GTGGCCTGCC 
CCAGACTACA 
TCACTGAAAG 
AGTGCCCGGC 
GGCACCGGGG 



11 
I 

CAGTTACCGG 
GCGCCCGCGC 
TGGAGGCGCT 
CAGACCTGCA 
AGATCGAGTA 
CGCCCCAGAC 
TCATTGGCAC 
CTGCAGGGAA 
TGGCTACCAT 
TCCCTGAGAC 
CGGGGGTGGG 
AGCTGGGAGG 
GACGCATCGC 
TCCTCTGTGA 
AGTTCTACGG 
ACTTCCAGAG 
ATGCCGCCAC 



21 
1 

GAGAGGCTGT 
CGCCTTCAGC 
GCAGCGCCTG 
CAAGAATGAA 
CATGATCCAG 
TCAGCAGGAC 
CTGGAACTAC 
CGCAGTGGTC 
CATCCCCCAG 
CACGGAGCTG 
GAAGATCATC 
GAAGAGTCCC 
CTGGGGGAAA 
CCCCTCGATC 
GGAAGATGCT 
GGTGATGGGC 
TCGCTACATA 



31 
) 

GTCAAAGGCG 
TCGGGCAGGA 
ATCCAGGAGC 
TGGAACGCCT 
AAGCTCCCTG 
GAGCTCTACA 
CCCTTCAACC 
CTCAAGCCCT 
TACCTGGACA 
CTCAAGGAGA 
ATGACGGCTG 
TGCTACGTGG 
TTCATGAACA 
GAGAACCAAA 
AAGAAATCCC 
CTGATTGAGG 
GCCCCCACCA 



41 

! 

CCATGAGCAA 
CCCGTCCGCT 
AGGAGCAGGA 
ACTATGAGGA 
AGTGGGCCGC 
TCCACTCGGA 
TCACCATCCA 
CGGAGCTGAG 
AGGATCTGTA 
GGTTCGACCA 
CTGCCAAGCA 
ACAAGAACTG 
GTGGCCAGAC 
TTGTGGAGAA 
GGGACTATGG 
GCCAGAAGGT 
TCCTCACGGA 



51 
I 

GATCAGCGAG 
GCAGTTCCGA 
GCTGGTGGGC 
GGTGGTGTAC 
GGATGAGCCC 
GCCACTGGGC 
GCCCATGGTG 
TGAGAACATG 
CCCAGTAATC 
TATCCTGTAC 
CCTGACCCCT 
TGACCTGGAC 
CTGCGTGGCC 
GCTCAAGAAG 
AAGAATCATT 
GGCTTATGGG 
CGTGGACCCC 



60 
120 
180 
' 240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
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CAGTCCCCGG TGATGCAAGA GGAGAT CTTC GGGCCTGTGC TGCCCATCGT GTGCGTGCGC 1080 

AGCCTGGAGG AGGCCATCCA GTTCATCAAC CAGCGTGAGA AGCCCCTGGC CCTCTACATG 1140 

TTCTCCAGCA ACGACAAGGT GATTAAGAAG ATGATTGCAG AGACATCCAG TGGTGGGGTG 1200 

GCGGCCAACG ATGTCATCGT CCACATCACC TTGCACTCTC TGCCCTTCGG GGGCGTGGGG 1260 

AACAGCGGCA TGGGATCCTA CCATGGCAAG AAGAGCTTCG AGACTTTCTC TCACCGCCGC 1320 

TCTTGCCTGG TGAGGCCTCT GATGAATGAT GAAGGCCTGA AGGTCAGATA CCCCCCGAGC 1380 

CCGGCCAAGA TGACCCAGCA CTGAGGAGGG GTTGCTCCGC CTGGCCTGGC CATACTGTGT 1440 

CCCATCGGAG TGCGGACCAC CCTCACTGGC TCTCCTGGCC CTGGAGAATC GCTCCTGCAG 1500 

CCCCAGCCCA GCCCCACTCC TCTGCTGACC TGCTGACCTG TGCACACCCC ACTCCCACAT 1560 

GGGCCCAGGC CTCACCATTC CAAGTCTCCA CCCCTTTCTA GACCAATAAA GAGACAAATA 1620 
CAATTTTCTA ACTCGG 

Seq ID NO: 176 Protein sequence i 
Protein Accession ft: NP_0006B2 

1 11 21 31 41 51 

MSKISEAVKR ARAAFSSGRT RPLQFRFQQL EALQRLIQEQ EQELVGALAA DLHKNEWNAY 60 

YEEWYVLEE IEYMIQKLPE WAADEPVEKT PQTQQDELYI HSEPLGWLV IGTWNYPFNL 120 

TIQPMVGAIA AGNAWLKPS ELSENMASLL ATIIPQYLDK DLYPVINGGV PETTELLKER 180 

FDHILYTGST GVGKI IMTAA AKHLTPVTLE LGGKSPCYVD KNCDLDVACR RIAWGKFMNS 240 

GQTCVAPDYI LCDPSIQNQI VEKLKKSLKE FYGEDAKKSR DYGRIISARH FQRVMGLIEG 300 

QKVAYGGTGD AATRYIAPTI LTDVDPQSPV MQEEIFGPVL PIVCVRSLEE AIQFINQREK 360 

PLAI>YMPSSN DKVIKKMIAE TSSGGVAAND VIVHITLHSL PFGGVGNSGM GSYHGKKSFE 420 
TFSHRRSCLV RPLMNDEGLK VRYPPSPAKM TQH 

Seq ID NO: 177 DNA sequence 

Nucleic Acid Accession #» NM_001067.1 

Coding sequence: 108-4703 

1 11 21 31 41 51 

CTAACCGACG CGCGTCTGTG GAGAAGCGGC TTGGTCGGGG GTGGTCTCGT GGGGTCCTGC 60 

CTGTTTAGTC GCTTTCAGGG TTCTTGAGCC CCTTCACGAC CGTCACCATG GAAGTGTCAC 120 

CATTGCAGCC TGTAAATGAA AATATGCAAG TCAACAAAAT AAAGAAAAAT GAAGATGCTA 180 

AGAAAAGACT GTCTGTTGAA AGAATCTATC AAAAGAAAAC ACAATTGGAA CATATTTTGC 240 

TCCGCCCAGA CACCTACATT GGTTCTGTGG AATTAGTGAC CCAGCAAATG TGGGTTTACG 300 

ATGAAGATGT TGGCATTAAC TATAGGGAAG TCACTTTTGT TCCTGGTTTG TACAAAATCT 360 

TTGATGAGAT TCTAGTTAAT GCTGCGGACA ACAAACAAAG GGACCCAAAA ATGTCTTGTA 420 

TTAGAGTCAC AATTGATCCG GAAAACAATT TAATTAGTAT ATGGAATAAT GGAAAAGGTA 480 

TTCCTGTTGT TGAACACAAA GTTGAAAAGA TGTATGTCCC AGCTCTCATA TTTGGACAGC 540 

TCCTAACTTC TAGTAACTAT GATGATGATG AAAAGAAAGT GACAGGTGGT CGAAATGGCT 600 

ATGGAGCCAA ATTGTGTAAC ATATTCAGTA CCAAATTTAC TGTGGAAACA GCCAGTAGAG 660 

AATACAAGAA AATGTTCAAA CAGACATGGA TGGATAATAT GGGAAGAGCT GGTGAGATGG 720 

AACTCAAGCC CTTCAATGGA GAAGATTATA CATGTATCAC CTTTCAGCCT GATTTGTCTA 780 

AGTTTAAAAT GCAAAGCCTG GACAAAGATA TTGTTGCACT AATGGTCAGA AGAGCATATG 840 

ATATTGCTGG ATCCACCAAA GATGTCAAAG TCTTTCTTAA TGGAAATAAA CTGCCAGTAA 900 

AAGGATTTCG TAGTTATGTG GACATGTATT TGAAGGACAA GTTGGATGAA ACTGGTAACT 960 

CCTTGAAAGT AATACATGAA CAAGTAAACC ACAGGTGGGA AGTGTGTTTA ACTATGAGTG 1020 

AAAAAGGCTT TCAGCAAATT AGCTTTGTCA ACAGCATTGC TACATCCAAG GGTGGCAGAC 1080 

ATGTTGATTA TGTAGCTGAT CAGATTGTGA CTAAACTTGT TGATGTTGTG AAGAAGAAGA 1140 

ACAAGGGTGG TGTTGCAGTA AAAGCACATC AGGTGAAAAA TCACATGTGG ATTTTTGTAA 1200 

ATG CCTTAAT TGAAAACCCA ACCTTTGACT CTCAGACAAA AGAAAACATG ACTTTACAAC 1260 

CCAAGAGCTT TGGATCAACA TGCCAATTGA GTGAAAAATT TATCAAAGCT GCCATTGGCT 1320 

GTGGTATTGT AGAAAGCATA CTAAACTGGG TGAAGTTTAA GGCCCAAGTC CAGTTAAACA 1380 

AGAAGTGTTC AGCTGTAAAA CATAATAGAA TCAAGGGAAT TCCCAAACTC GATGATGCCA 1440 

ATGATGCAGG GGGCCGAAAC TCCACTGAGT GTACGCTTAT CCTGACTGAG GGAGATTCAG 1500 

CCAAAACTTT GGCTGTTTCA GGCCTTGGTG TGGTTGGGAG AGACAAATAT GGGGTTTTCC IS 60 

CTCTTAGAGG AAAAAT ACT C AATGTTCGAG AAGCTTCTCA TAAGCAGATC ATGGAAAATG 1620 

CTGAGATTAA CAA TAT CATC AAGATTGTGG GTCTTCAGTA CAAGAAAAAC TATGAAGATG 1680 

AAGATTCATT GAAGACGCTT CGTTATGGGA AGATAATGAT TATGACAGAT CAGGACCAAG 1740 

ATGGTTCCCA CATCAAAGGC TTGCTGATTA ATTTTATCCA TCACAACTGG CCCTCTCTTC 1800 

TGCGACATCG TTTTCTGGAG GAATTTATCA CTCCCATTGT AAAGGTATCT AAAAACAAGC 1860 

AAGAAATGGC ATTTTACAGC CTTCCTGAAT TTGAAGAGTG GAAGAGTTCT ACTCCAAATC 1920 

ATAAAAAATG GAAAGTCAAA TATTACAAAG GTTTGGGCAC CAGCACATCA AAGGAAGCTA 1980 

AAGAATACTT TGCAGATATG AAAAGACATC GTATCCAGTT CAAATATTCT GGTCCTGAAG 2040 

ATGATGCTGC TATCAGCCTG GCCTTTAGCA AAAAACAGAT AGATGATCGA AAGGAATGGT 2100 

TAACTAATTT CATGGAGGAT AGAAGACAAC GAAAGTTACT TGGGCTTCCT GAGGATTACT 2160 

TGTATGGACA AACTACCACA TATCTGACAT ATAATGACTT CATCAACAAG GAACTTATCT 2220 

TGTTCTCAAA TTCTGATAAC GAGAGATCTA TCCCTTCTAT GGTGGATGGT TTGAAACCAG 2280 

GTCAGAGAAA GGTTTTGTTT ACTTGCTTCA AACGGAATGA CAAGCGAGAA GTAAAGGTTG- 2340 

CCCAATTAGC TGGATCAGTG GCTGAAATGT CTTCTTATCA TCATGGTGAG ATGTCACTAA 2400 

TGATGACCAT TATCAATTTG GCTCAGAATT TTGTGGGTAG CAATAATCTA AACCTCTTGC 2460 

AG CCCATTGG TCAGTTTGGT ACCAGGCTAC ATGGTGGCAA GGATTCTGCT AGTCCACGAT 2520 

ACATCTTTAC AATGCTCAGC TCTTTGGCTC GATTGTTATT TCCACCAAAA GATGATCACA 2580 

CGTTGAAGTT TTTATATGAT GACAACCAGC GTGTTGAGCC TGAATGGTAC ATTCCTATTA 2640 

TTCCCATGGT GCTGATAAAT GGTGCTGAAG GAATCGGTAC TGGGTGGTCC TGCAAAATCC 2700 

CCAACTTTGA TGTGCGTGAA ATTGTAAATA ACATCAGGCG TTTGATGGAT GGAGAAGAAC 2760 

CTTTGCCAAT GCTTCCAAGT TACAAGAACT TCAAGGGTAC TATTGAAGAA CTGGCTCCAA 2820 

ATCAATATGT GATTAGTGGT GAAGTAGCTA TTCTTAATTC TACAACCATT GAAAT CTCA G 2880 

AGCTTCCCGT CAGAACATGG ACCCAGACAT ACAAAGAACA AGTTCTAGAA CCCATGTTGA 2940 

ATGGCACCGA GAAGACACCT CCTCTCATAA CAGACTATAG GGAATACCAT ACAGATACCA 3000 

CTGTGAAATT TGTTGTGAAG ATGACTGAAG AAAAACTGGC AGAGGCAGAG AGAGTTGGAC 3060 

TACACAAAGT CTTCAAACTC CAAACTAGTC TCACATGCAA CTCTATGGTG CTTTTTGACC 3120 

ACGTAGGCTG TTTAAAGAAA TATGACACGG TGTTGGATAT TCTAAGAGAC TTTTTTGAAC 3180 

TCAGACTTAA ATATTATGGA TTAAGAAAAG AATGGCTCCT AGGAATGCTT GGTGCTGAAT 3240 

CTGCTAAACT GAATAATCAG GCTCGCTTTA TCTTAGAGAA AATAGATGGC AAAATAATCA 3300 
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TTGAAAATAA GCCTAAOAAA GAATTAATTA AAGTTCTGAT TCAGAGGGGA TATGATTCGG 3360 

ATCCTGTGAA GGCCTGGAAA GAAGCCCAGC AAAAGGTTCC AGATGAAGAA GAAAATGAAG 3420 

AGAGTGACAA CGAAAAGGAA ACTGAAAAGA GTGACTCCGT AACAGATTCT GGACCAACCT 3480 

TCAACTATCT TCTTGATATG CCCCTTTGGT ATTTAACCAA GGAAAAGAAA GATGAACTCT 3540 

GCAGGCTAAG AAATGAAAAA GAACAAGAGC TGGACACATT AAAAAGAAAG AGTCCATCAG 3600 

ATTTGTGGAA AGAAGACTTG GCTACATTTA TTGAAGAATT GGAGGCTGTT GAAGCCAAGG 3660 

AAAAACAAGA TGAACAAGTC GGACTTCCTG GGAAAGGGGG GAAGGCCAAG GGGAAAAAAA 3720 

CACAAATGGC TGAAGTTTTG CCTTCTCCGC GTGGTCAAAG AGTCATTCCA CGAATAACCA 3780 

TAGAAATGAA AGCAGAGGCA GAAAAGAAAA ATAAAAAGAA AATTAAGAAT GAAAATACTG 3840 

AAGGAAGCCC TCAAGAAGAT GGTGTGGAAC TAGAAGGCCT AAAACAAAGA TTAGAAAAGA 3900 

AACAGAAAAG AGAACCAGGT ACAAAGACAA AGAAACAAAC TACATTGGCA TTTAAGCCAA 3960 

TCAAAAAAGG AAAGAAGAGA AATCCCTGGC CTGATTCAGA ATCAGATAGG AGCAGTGACG 4020 

AAAGTAATTT TGATGTCCCT CCACGAGAAA CAGAGCCACG GAGAGCAGCA ACAAAAACAA 4080 

AATTCACAAT GGATTTGGAT TCAGATGAAG ATTTCTCAGA TTTTGATGAA AAAACTGATG 4140 

ATGAAGATTT TGTCCCATCA GATGCTAGTC CACCTAAGAC CAAAACTTCC CCAAAACTTA 4200 

GTAACAAAGA ACTGAAACCA CAGAAAAGTG TCGTGTCAGA CCTTGAAGCT GATGATGTTA 4260 

AGGGCAGTGT ACCACTGTCT TCAAGCCCTC CTGCTACACA TTTCCCAGAT GAAACTGAAA 4320 

TTACAAACCC AGTTCCTAAA AAGAATGTGA CAGTGAAGAA GACAGCAGCA AAAAGTCAGT 4380 

CTTCCACCTC CACTACCGGT GCCAAAAAAA GGGCTGCCCC AAAAGGAACT AAAAGGGATC 4440 

CAGCTTTGAA TTCTGGTGTC TCTCAAAAGC CTGATCCTGC CAAAACCAAG AATCGCCGCA 4500 

AAAGGAAGCC ATCCACTTCT GATGATTCTG ACTCTAATTT TGAGAAAATT GTTTCGAAAG 4560 

CAGTCACAAG CAAGAAATCC AAGGGGGAGA GTGATGACTT CCATATGGAC TTTGACTCAG 4620 

CTGTGGCTCC TCGGGCAAAA TCTGTACGGG CAAAGAAACC TATAAAGTAC CTGGAAGAGT 4660 

CAGATGAAGA TGATCTGTTT TAAAATGTGA GGCGATTATT TTAAGTAATT ATCTTACCAA 4740 

GCCCAAGACT GGTTTTAAAG TTACCTGAAG CTCTTAACTT CCTCCCCTCT GAATTTAGTT 4800 

TGGGGAAGGT GTTTTTAGTA CAAGACATCA AAGTGAAGTA AAGCCCAAGT GTTCTTTAGC 4860 

TTTTTATAAT ACTGTCTAAA TAGTGACCAT CTCATGGGCA TTGTTTTCTT CTCTGCTTTG 4920 

TCTGTQTTTT GAGTCTGCTT TCTTTTGTCT TTAAAACCTG ATTTTTAAGT TCTTCTGAAC 4980 

TGTAGAAATA GCTATCTGAT CACTTCAGCG TAAAGCAGTG TGTTTATTAA CCATCCACTA 5040 

AGCTAAAACT AGAGCAGTTT GATTTAAAAG TGTCACTCTT CCTCCTTTTC TACTTTCAGT 5100 

AGATATGAGA T AGAG CAT AA TTATCTGTTT TATCTTAGTT TTATACATAA TTTACCATCA 5160 

GATAGAACTT TATGGTTCTA GTACAGATAC TCTACTACAC TCAGCCTCTT ATGTGCCAAG 5220 

TTTTTCTTTA AGCAATGAGA AATTG CT CAT GTTCTTCATC TTCTCAAATC ATCAGAGGCC 5280 

AAAGAAAAAC ACTTTGGCTG TGTCTATAAC TTGACACAGT CAATAGAATG AAGAAAATTA 5340 

GAGTAGTTAT GTGATTATTT CAGCTCTTGA CCTGTCCCCT CTGGCTGCCT CTGAGTCTGA 5400 

ATCTCCCAAA GAGAGAAACC AATTTCTAAG AGGACTGGAT TGCAGAAGAC TCGGGGACAA 5460 

CATTTGATCC AAGATCTTAA ATGTTATATT GATAACCATG CTCAGCAATG AGCTATTAGA 5520 

TTCATTTTGG GAAATCTCCA TAATTTCAAT TTGTAAACTT TGTTAAGACC TGTCTACATT 5580 

GTTATATGTG TGTGACTTGA GTAATGTTAT CAACGTTTTT GTAAATATTT ACTATGTTTT 5640 
TCTATTAGCT AAATTCCAAC AATTTTGTAC TTTAATAAAA TGTTCTAAAC ATTGC 

Seq ID NO: 178 Protein sequence: 
Protein Accession #: NP_001058.1 

1 11 21 31 41 51 

I I I I I I 

MEVSPLQPVN ENMQVNKIKK NEDAKKRLSV ERIYQKKTQL EHILLRPDTY IGSVELVTQQ 60 

MWVYDEDVGI NYREVTFVPG LYKIFDEIfcV NAADNKQRDP KMSCIRVTID PENNIjISIWN 120 

NGKGIPWEH KVEKMYVPAL IFGQLLTSSN YDDDEKKVTG GRNGYGAKLC NIFSTKFTVE 180 

TASREYKKMF KQTWMDNMGR AGEMELKPFN GEDYTCITFQ PDLSKFKMQS LDKDIVALMV 240 

RRAYDIAGST KDVKVFLNGN KLPVKGFRSY VDMYLKDKLD ETGNSLKVIH EQVNHRWEVC 300 

LTMSEKGFQQ ISFVNSIATS KGGRHVDYVA DQIVTKLVDV VKKKNKGGVA VKAHQVKNHM 360 

WIFVNALIEN PTPDSQTKEN MTLQPKSFGS TCQLSEKFIK AAIGCGIVES ILNWVKFKAQ 420 

VQLNKKCSAV KHNRIKGIPK LDDANDAGGR NSTECTLILT EGDSAKTLAV SGLGWGRDK 480 

YGVFPLRGKI LNVREASHKQ IMENAEINNI IKIVGLQYKK* NYEDEDSLKT LRYGKIMIMT 540 

DQDQDGSHIK GLLINFIHHN WPSIiLRliRFT* EEPITPIVKV SKNKQEMAFY SLPEPEEWKS . 600 

STPNHKKWKV KYYKGLGTST SKEAKEYFAD MKRHRIQFKY SGPEDD AAI S LAFSKKQIDD 660 

RKEWLTNFME DRRQRKLLGL PEDYLYGQTT TYLTYNDFIN KELILFSNSD NERSIPSMVD 720 

GLKPGQRKVL FTCFKRNDKR EVKVAQLAGS VAEMSSYHHG EMSLMMTIIN LAQNFVGSNN 780 

LNLLQPIGQP GTRLHGGKDS ASPRYIFTML SSLARLLFPP KDDHTLKFLY DDNQRVEPEW 840 

YIPIIPMVLI NGAEG1GTGW SCKIPNFDVR EIVNNIRRLM DGEEPLPMLP SYKNFKGTIE 900 

ELAPNQYVIS GEVAILNSTT IEISELPVRT WTQTYKEQVL EPMLNGTEKT PPL ITDYREY 960 

HTDTTVKFW KMTEEKIiAEA ER VGLHKVF K LQTSLTCNSM VLFDHVGCLK KYDTVLDILR 1020 

DFFELRLKYY GLRKEWLLGM LGAESAKLNN QARFILEKID GKIIIENKPK KELIKVLIQR 1080 

GYDSDPVKAW KEAQQKVPDE EENEESDNEK ETEKSDSVTD SGPTFNYIiLD MP L WYLTKEK 1140 

KDELCRLRNE KEQELDTLKR KSPSDLWKED LATFIEELEA VEAKEKQDEQ VGLPGKGGKA 1200 

KGKKTQMAEV LPSPRGQRVI PRITIEMKAE AEECKNKKKIK NENTEGSPQE DGVELEGLKQ 1260 

RLEKKQKREP GTKTKKQTTL AFKPIKKGKK RNPWPDSESD RSSDESNFDV PPRETEPRRA 1320 

ATKTKFTMDL DSDEDFSDFD EKTDDEDFVP SDASPPKTKT SPKLSNKELK PQKSWSDLE 1380 

ADDVKGSVPL SSSPPATHFP DBTEITNPVP KKNVTVKKTA AKSQSSTSTT GAKKRAAPKG 1440 

TKRDPAIiNSG VSQKPDPAKT KNRRKRKPST SDDSDSNFEK IVSKAVTSKK SKGESDDFHM 1500 
DFDSAVAPRA KSVRAKKPIK YLEESDEDDL P 



Seq ID NO: 179 DNA sequence 

Nucleic Acid Accession ft: Eos sequence 

Coding sequence: 148-7095 

1 11 21 31 41 

I I I I I 

CACACATACG CACGCACGAT CTCACTTOGA TCTATACACT GGAGGATTAA 
CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG 
CGGCGAGGGG CCGCAGACCG TCTGGAAATG CGAATCCTAA AGCGTTTCCT 
CAGCTCCTCT GTGTTTGCCG CCTGGATTGG GCTAATGGAT ACTACAGACA 
CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAAAA 
AAATATCCAA CATGTAATAG CCCAAAACAA TCTCCTATCA ATATTGATGA 
CAAGTAAATG TGAATCTTAA GAAACTTAAA TTTCAGGGTT GGGATAAAAC 



51 

I 

AACAAACAAA 60 

AGGAGCCGCA 120 

CGCTTGCATT 180 

ACAGAGAAAA 240 

TTGGGGAAAG 300 

AGATCTTACA 360 

ATCATTGGAA 420 
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AACACATTCA TTCATAACAC TQGQAAAACA GTGGAAATTA ATCTCACTAA TGACTACCGT 480 

GTCAGCGGAG GAGTTTCAGA AATGGTGTTT AAAGCAAGCA AGATAACTTT TCACTGGGGA 540 

AAATGCAATA TGTCATCTGA TGGATCAGAG CATAGTTTAG AAGGACAAAA ATTTCCACTT 600 

GAGATGCAAA TCTACTGCTT TGATGCGGAC CGATTTTCAA GTTTTGAGGA AGCAGTCAAA 660 

GGAAAAGGGA AGTTAAGAGC TTTATCCATT TTGTTTGAGG TTGGGACAGA AGAAAATTTG 720 

GATTTCAAAG CGATTATTGA TGGAGTCGAA AGTGTTAGTC GTTTTGGGAA GCAGGCTGCT 780 

TTAGATCCAT TCATACTGTT GAACCTTCTG CCAAACTCAA CTGACAAGTA TTACATTTAC 840 

AATGGCTCAT TGACATCTCC TCCCTGCACA GACACAGTTG ACTGGATTGT TTTTAAAGAT 900 

ACAGTTAGCA TCTCTGAAAG CCAGTTGGCT GTTTTTTGTG AAGTTCTTAC AATGCAACAA 960 

TCTGGTTATG TCATGCTGAT GGACTACTTA CAAAACAATT TTCGAGAGCA ACAGTACAAG 1020 

TTCTCTAGAC AGGTGTTTTC CTCATACACT GGAAAGGAAG AGATTCATGA AGCAGTTTGT 1080 

AGTTCAGAAC CAGAAAATGT TCAGGCTGAC CCAGAGAATT ATACCAGCCT TCTTGTTACA 1140 

TGGGAAAGAC CTCGAGTCGT TTATGATACC ATGATTGAGA AGTTTGCAGT TTTGTACCAG 1200 

CAGTTGGATG GAGAGGACCA AACCAAGCAT GAATTTTTGA CAGATGGCTA TCAAGACTTG 1260 

GGTGCTATTC TCAATAATTT GCTACCCAAT ATGAGTTATG TTCTTCAGAT AGTAGCCATA 1320 

TGCACTAATG GCTTATATGG AAAATACAGC GACCAACTGA TTGTCGACAT GCCTACTGAT 1380 

AATCCTGAAC TTGATCTTTT CCCTGAATTA ATTGGAACTG AAGAAATAAT CAAGGAGGAG 1440 

GAAGAGGGAA AAGACATTGA AGAAGGCGCT ATTGTGAATC CTGGTAGAGA CAGTGCTACA 1500 

AACCAAATCA GGAAAAAGGA ACCCCAGATT TCTACCACAA CACACTACAA TCGCATAGGG 1560 

ACGAAATACA ATGAAGCCAA GACTAACCGA TCCCCAACAA GAGGAAGTGA ATTCTCTGGA 1620 

AAGGGTGATG TTCCCAATAC ATCTTTAAAT TCCACTTCCC AACCAGTCAC TAAATTAGCC 1680 

ACAGAAAAAG ATATTTCCTT GACTTCTCAG ACTGTGACTG AACTGCCACC TCACACTGTG 1740 

GAAGGTACTT CAGCCTCTTT AAATGATGGC TCTAAAACTG TTCTTAGATC TCCACATATG 1800 

AACTTGTCGG GGACTGCAGA AT CCTT AAAT ACAGTTTCTA TAACAGAATA TGAGGAGGAG 1860 

AGTTTATTGA CCAGTTTCAA GCTTGATACT GGAGCTGAAG ATTCTTCAGG CTCCAGTCCC 1920 

GCAACTTCTG CTATCCCATT CATCTCTGAG AACATATCCC AAGGGTATAT ATTTTCCTCC 1980 

GAAAACCCAG AGACAATAAC ATATGATGTC CTTATACCAG AATCTGCTAG AAATGCTTCC 2040 

GAAGATTCAA CTTCATCAGG TTCAGAAGAA TCACTAAAGG ATCCTTCTAT GGAGGGAAAT 2100 

GTGTGGTTTC CTAGCTCTAC AGACATAACA GCACAGCCCG ATGTTGGATC AGGCAGAGAG 2160 

AGCTTTCTCC AGACTAATTA CACTGAGATA CGTGTTGATG AATCTGAGAA GACAACCAAG 2220 

TCCTTTTCTG CAGGCCCAGT GATGTCACAG GGTCCCTCAG TTACAGATCT GGAAATGCCA 2280 

CATTATTCTA CCTTTGCCTA CTTCCCAACT GAGGTAACAC CTCATGCTTT TACCCCATCC 2340 

TCCAGACAAC AGGATTTGGT CTCCACGGTC AACGTGGTAT ACTCGCAGAC AACCCAACCG 2400 

GTATACAATG GTGAGACACC TCTTCAACCT TCCTACAGTA GTGAAGTCTT TCCTCTAGTC 2460 

ACCCCTTTGT TGCTTGACAA TCAGATCCTC AACACTACCC CTGCTGCTTC AAGTAGTGAT 2S20 

TCGGCCTTGC ATGCTACGCC TGTATTTCCC AGTGTCGATG TGTCATTTGA ATCCATCCTG 2580 

TCTTCCTATG ATGGTGCACC TTTGCTTCCA TTTTCCTCTG CTTCCTTCAG TAGTGAATTG 2 640 

TTTCGCCATC TGCATACAGT TTCTCAAATC CTTCCACAAG TTACTTCAGC TACCGAGAGT 2700 

GATAAGGTGC CCTTGCATGC TTCTCTGCCA GTGGCTGGGG GTGATTTGCT ATTAGAGCCC 2760 

AGCCTTGCTC AGTATTCTGA TGTGCTGTCC ACTACTCATG CTGCTTCAGA GACGCTGGAA 2820 

TTTG GTAGTG AATCTGGTGT TCTTTATAAA ACGCTTATGT TTTCTCAAGT TGAACCACCC 2880 

AGCAGTGATG CCATGATGCA TGCACGTTCT TCAGGGCCTG AACCTTCTTA TGCCTTGTCT 2940 

GATAATGAGG GCTCCCAACA CATCTTCACT GTTTCTTACA GTTCTGCAAT ACCTGTGCAT 3000 

GATTCTGTGG GTGTAACTTA TCAGGGTTCC TTATTTAGCG GCCCTAGCCA TATACCAATA 3060 

CCTAAGTCTT CGTTAATAAC CCCAACTGCA TCATTACTGC AGCCTACTCA TGCCCTCTCT 3120 

GGTGATGGGG AATGGTCTGG AGCCTCTTCT GATAGTGAAT TTCTTTTACC TGACACAGAT 3180 

GGGCTGACAG CCCTTAACAT TTCTTCACCT GTTTCTGTAG CTGAATTTAC ATATACAACA 3 240 

TCTGTGTTTG GTGATGATAA TAAGGCGCTT TCTAAAAGTG AAATAATATA TGGAAATGAG 3300 

ACTGAACTGC AAATTCCTTC TTTCAATGAG ATGGTTTACC CTTCTGAAAG CACAGTCATG 3360 

CCCAACATGT ATGATAATGT AAATAAGTTG AATGCGTCTT TACAAGAAAC CT CTGTTTCC 3420 

ATTTCTAGCA CCAAGGGCAT GTTTCCAGGG TCCCTTGCTC ATACCACCAC TAAGGTTTTT 3480 

GATCATGAGA TTAGTCAAGT TCCAGAAAAT AACTTTTCAG TTCAACCTAC ACATACTGTC 3540 

TCTCAAGCAT CTGGTGACAC TTCGCTTAAA CCTGTGCTTA GTGCAAACTC AGAGCCAGCA 3 600 

TCCTCTGACC CTGCTTCTAG TGAAATGTTA TCTCCTTCAA CTCAGCTCTT ATTTTATGAG 3660 

ACCTCAGCTT CTTTTAGTAC TGAAGTATTG CTACAACCTT CCTTTCAGGC TTCTGATGTT 3 720 

GACACCTTGC TTAAAACTGT TCTTCCAGCT GTGCCCAGTG ATCCAATATT GGTTGAAACC 3780 

CCCAAAGTTG ATAAAATTAG TTCTACAATG TTGCATCTCA TTGTATCAAA TTCTGCTTCA 3B40 

AGTGAAAACA TGCTGCACTC TACATCTGTA CCAGTTTTTG ATGTGTCGCC TACTTCTCAT 3 900 

ATGCACTCTG CTTCACTTCA AGGTTTGACC ATTTCCTATG CAAGTGAGAA ATATGAACCA 3960 

GTTTTGTTAA AAAGTGAAAG TTCCCACCAA GTGGTACCTT CTTTGTACAG TAATGATGAG 4020 

TTGTTCCAAA CGGCCAATTT GGAGATTAAC CAGGCCCATC CCCCAAAAGG AAGGCATGTA 4080 

TTTGCTACAC CTGTTTTATC AATTGATGAA CCATTAAATA CACTAATAAA TAAGCTTATA 4140 

CATTCCGATG AAATTTTAAC CTCCACCAAA AGTTCTGTTA CTGGTAAGGT ATTTGCTGGT 4200 

ATTCCAACAG TTGCTTCTGA TACATTTGTA TCTACTGATC ATTCTGTTCC TATAGGAAAT 4260 

GGGCATGTTG CCATTACAGC TGTTTCTCCC CACAGAGATG GTTCTGTAAC CTCAACAAAG 4320 

TTGCTGTTTC CTTCTAAGGC AACTTCTGAG CTGAGTCATA GTGCCAAATC TGATGCCGGT 4380 

TTAGTGGGTG GTGGTGAAGA TGGTGACACT GATGATGATG GTGATGATGA TGATGATGAC 4440 

AGAGGTAGTG ATGGCTTATC CATTCATAAG TGTATGTCAT GCTCATCCTA TAGAGAATCA 4500 

CAGGAAAAGG TAATGAATGA TTCAGACACC CACGAAAACA GTCTTATGGA TCAGAATAAT 4560 

CCAATCTCAT ACT CACT AT C TGAGAATTCT GAAGAAGATA ATAGAGTCAC AAGTGTATCC 4620 

TCAGACAGTC AAACTGGTAT GGACAGAAGT CCTGGTAAAT CACCATCAGC AAATGGGCTA 4680 

TCCCAAAAGC ACAATGATGG AAAAGAGGAA AATGACATTC AGACTGGTAG TGCTCTGCTT 4740 

CCTCTCAGCC CTGAATCTAA AGCATGGGCA GTTCTGACAA GTGATGAAGA AAGTGGATCA 4 800 

GGGCAAGGTA CCTCAGATAG CCTTAATGAG AATGAGACTT CCACAGATTT CAGTTTTGCA 4 860 

GACACTAATG AAAAAGATGC TGATGGGATC CTGGCAGCAG GTGACTCAGA AATAACTCCT 4920 

GGATTCCCAC AGTCCCCAAC ATCATCTGTT ACTAGCGAGA ACTCAGAAGT GTTCCACGTT 4980 

TCAGAGGCAG AGGCCAGTAA TAGTAGCCAT GAGTCTCGTA TTGGTCTAGC TGAGGGGTTG 5040 

GAATCCGAGA AGAAGGCAGT TATACCCCTT GTGATCGTGT CAGCCCTGAC TTTTATCTGT 5100 

CTAGTGGTTC TTGTGGGTAT TCTCATCTAC TGGAGGAAAT GCTTCCAGAC TGC ACA CTTT 5160 

TACTTAGAGG ACAGTACATC CCCTAGAGTT ATATCCACAC CTCCAACACC TATCTTTCCA 5220 

ATTTCAGATG ATGTCGGAGC AATTCCAATA AAGCACTTTC CAAAGCATGT TGCAGATTTA 5280 

CATGCAAGTA GTGGGTTTAC TGAAGAATTT GAGACACTGA AAGAGTTTTA CCAGGAAGTG 5340 

CAGAGCTGTA CTGTTGACTT AGGTATTACA GCAGACAGCT CCAACCACCC AGACAACAAG 5400 

CACAAGAATC GATACATAAA TATCGTTGCC TATGATCATA GCAGGGTTAA GCTAGCACAG 5460 

CTTGCTGAAA AGGATGGCAA ACTGACTGAT TAT AT CAATG CCAATTATGT TGATGGCTAC 5520 

AACAGACCAA AAGCTTATAT TGCTGCCCAA GGCCCACTGA AATCCACAGC TGAAGATTTC 5580 

TGGAGAATGA TATGGGAACA TAATGTGGAA GTTATTGTCA TGATAACAAA CCTCGTGGAG 5640 
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AAAGGAAGGA GAAAATGTGA TCAGTACTGG CCTGCCGATG GGAGTGAGGA GTACGGGAAC 5700 

TTTCTGGTCA CTCAGAAGAG TGTGCAAGTG CTTGCCTATT ATACTGTGAG GAATTTTACT 5760 

CTAAGAAACA CAAAAATAAA AAAGGGCTCC CAGAAAGGAA GACCCAGTGG ACGTGTGGTC 5820 

ACACAGTATC ACTACACGCA GTGGCCTGAC ATGGGAGTAC CAGAGTACTC CCTGCCAGTG 5880 

CTGACCTTTG TGAGAAAGGC AGCCTATGCC AAGCGCCATG CAGTGGGGCC TGTTGTCGTC 5940 

CACTGCAGTG CTGGAGTTGG AAGAACAGGC ACATATATTG TGCTAGACAG TATGTTGCAG 6000 

CAGATTCAAC ACGAAGGAAC TGTCAACATA TTTGGCTTCT TAAAACACAT CCGTTCACAA 6060 

AGAAATTATT TGGTACAAAC TGAGGAGCAA TATGTCTTCA TTCATGATAC ACTGGTTGAG 6120 

GCCATACTTA GTAAAGAAAC TGAGGTGCTG GACAGT CAT A TTCATGCCTA TGTTAATGCA 6180 

CTCCTCATTC CTGGACCAGC AGGCAAAACA AAGCTAGAGA AACAATTCCA GCTCCTGAGC 6240 

CAGTCAAATA TACAGCAGAG TGACTATTCT GCAGCCCTAA AGCAATGCAA CAGGGAAAAG 6300 

AATCGAACTT CTTCTATCAT CCCTGTGGAA AGATCAAGGG TTGGCATTTC ATCCCTGAGT 6360 

GGAGAAGGCA CAGACTACAT CAATGCCTCC TATATCATGG GCTATTACCA GAGCAATGAA 6420 

TTCATCATTA CCCAGCACCC TCTCCTTCAT ACCATCAAGG ATTTCTGGAQ GATGATATGG 6480 

GACCATAATG CCCAACTGGT GGTTATGATT CCTGATGGCC AAAACATGGC AGAAGATGAA 6540 

TTTGTTTACT GGCCAAATAA AGATGAGCCT ATAAATTGTG AGAGCTTTAA GGTCACTCTT 6600 

ATGGCTGAAG AACACAAATG TCTATCTAAT GAGGAAAAAC TTATAATTCA GGACTTTATC 6660 

TTAGAAGCTA CACAGGATGA TTATGTACTT GAAGTGAGGC ACTTTCAGTG TCCTAAATGG 6720 

CCAAATCCAG ATAGCCCCAT TAGTAAAACT TTTGAACTTA TAAGTGTTAT AAAAGAAGAA 6780 

GCTGCCAATA GGGATGGGCC TATGATTGTT CATGATGAGC ATGGAGGAGT GACGGCAGGA 6840 

ACTTTCTGTG CTCTGACAAC CCTTATGCAC CAACTAGAAA AAGAAAATTC CGTGGATGTT 6900 

TACCAGGTAG CCAAGATGAT CAATCTGATG AGGCCAGGAG TCTTTGCTGA CATTGAGCAG 6960 

TATCAGTTTC TCTACAAAGT GATCCTCAGC CTTGTGAGCA CAAGGCAGGA AGAGAATCCA 7020 

TCCACCTCTC TGGACAGTAA TGGTGCAGCA TTGCCTGATG GAAATATAGC TGAGAGCTTA 708 0 

GAGTCTTTAG TTTAACACAG AAAGGGGTGG GGGGACTCAC ATCTGAGCAT TGTTTTCCTC 7140 

TTCCTAAAAT TAGGCAGGAA AATCAGTCTA GTTCTGTTAT CTGTTGATTT CCCATCACCT 7200 

GACAGTAACT TTCATGACAT AGGATTCTGC CGCCAAATTT ATATCATTAA CAATGTGTGC 7260 

CTTTTTGCAA GACTTGTAAT TTACTTATTA TGTTTGAACT AAAATGATTG AATTTTACAG 7320 

TATTTCTAAG AATGGAATTG TGGTATTTTT TTCTGTATTG ATTTTAACAG AAAATTTCAA 7380 

TTTATAGAGG TTAGGAATTC CAAACTACAG AAAATGTTTG TTTTTAGTGT CAAATTTTTA 7440 

GCTGTATTTG TAGCAATTAT CAGGTTTGCT AGAAATATAA CTTTTAATAC AGTAGCCTGT 7500 

AAATAAAACA CTCTTCCATA TGATATTCAA CATTTTACAA CTGCAGTATT CACCTAAAGT 7560 

AGAAATAATC TGTTACTTAT TGTAAATACT GCCCTAGTGT CTCCATGGAC CAAATTTATA 7620 

TTTATAATTG TAGATTTTTA TATTTTACTA CTGAGTCAAG TTTTCTAGTT CTGTGTAATT 7680 

GTTTAGTTTA ATGACGTAGT TCATTAGCTG GTCTTACTCT ACCAGTTTTC TGACATTGTA 7740 

TTGTGTTACC TAAGTCATTA ACTTTGTTTC AGCATGTAAT TTTAACTTTT GTGGAAAATA 7800 

GAAATACCTT CATTTTGAAA GAAGTTTTTA TGAGAATAAC ACCTTACCAA ACATTGTTCA 7860 

AATGGTTTTT ATCCAAGGAA TTGCAAAAAT AAATATAAAT ATTGCCATTA AAAAAAAAAA 7920 
AAAAAAAAAA AAAAAAAAAA AAAA 

Seq ID NO: 180 Protein sequence: 
Protein Accession tf: Eos sequence 

1 11 21 31 41 51 

1 1 I I 1 I 

MRILKRFLAC IQLLCVCRLD WANGYYRQQR KLVEBIGWSY TGALNQKNWG KKYPTCNSPK 60 

QSPINIDEDL TQVNVNLKKL KFQGWDKTSL ENTPIHNTGK TVEINLTNDY RVSGGVSBMV 120 

FKAEKITFHW GKCNMSBDGS EHSLEGQKFP LEMQIYCFDA DRFSSFEEAV KGKGKLRALS 180 

ILFEVGTEEN LDFKAIIDGV ESVSRFGKQA ALPPPILLNL LPNSTDKYYI YNGSLTSPPC 240 

TDTVDWIVFK DTVSISESQL AVFCEVLTMQ QSGYVMLMDY LQNNFREQQY KFSRQVFSSY 300 

TGKEEIHEAV CSSEPENVQA DPENYTSLLV TWERPRWYD TMIEKFAVLY QQLDGEDQTK 360 

HBFLTDGYQD LGAILNNLLP NMSYVLQIVA ICTNGLYGKY SDQLIVDMPT DNPELDLFPE 420 

LIGTEEIIKE EEEGKDIEEG AIVNPGRDSA TNQIRKKEPQ ISTTTHYNRI GTKYNEAKTN 480 

RSPTRGSEFS GKGDVPNTSL NSTSQPVTKL ATEKDI SLTS QTVTEIjPPHT VEGTSASLND 540 

GSKTVLRSPH MNIiSGTAESti NTVSITEYEE ESLLTSPKLD TGAEDSSGSS PATSAIPFIS 600 

ENISQGYIFS SENPETITYD VLIPESARNA SEDSTSSGSE ESLKDPSMEG NVWFPSSTDI 660 

TAQPDVGSGR ESFLQTNYTE IRVDESEKTT KSFSAGPVMS QGPSVTDLEM PHY ST FAY FP 720 

TEVTPHAFTP SSRQQDLVST VNWYSQTTQ PVYNGETPLQ PSYSSEVPPL VTPIjLLDNQI 780 

LNTTPAASSS DSALHATPVF PSVDVSFESI LSSYDGAPLL PFSSASFSSE IiFRHLHTVSQ 840 

ILPQVTSATE SDKVPLHASL PVAGGDLLLE PSLAQYSDVL STTHAASETI* EFGSESGVLY 900 

KTLMFSQVEP PSSDAMMHAR SSGPEPSYAL SDNEGSQHIF TVSYSSAIPV HDSVGVTYQG 960 

8LFSGPSHIP IPKSSL1TPT ASLLQPTHAIj SGDGEWSGAS SDSEFIiLPDT DGLTALNISS 1020 

PVSVAEFTYT TSVPGDDNKA LSKSEI IYGN ETELQIPSFN EMVYPSESTV MPNMYDNVNK 1080 

LNASIiQETSV SISSTKGMFP GSLiAHTTTKV FDHEISQVPE NNFSVQPTHT VSQASGDTSL 1140 

KPVLSANSEP ASSDPASSEM LSPSTQLLFY ETSASFSTEV LLQPSFQASD VDTLIiKTVLP 1200 

AVPSDPILVE TPKVDKISST MLHLIVSNSA SSENMLHSTS VPVFDVSPTS HMHSASLQGL 1260 

TISYASEXYE PVLLKSESSH QWPSLYSND ELFQTANLEI NQAHPPKGRH VFATPVIjSID 1320 

EPLNTLINKL IHSDEILTST KSSVTGKVFA GIPTVASDTF VSTDHSVPIG NGHVAITAVS 1380 

PHRDGSVTST KLIjFPSKATS ELSHSAKSDA GLVGGGEDGD TDDDGDDDDD DRGSDGLSIH 1440 

KCMSCSSYRE SQEKVMNDSD THENSLMDQN NPISYSLSEN SEEDNRVTSV SSDSQTGMDR 1500 

SPGKSPSANG LSQKHNDGKE ENDIQTGSAL LPLSPESKAW AVLTSDEESG SGQGTSDSLN 1560 

ENETSTDFSF ADTNEKDADG ILAAGDSEIT PGFPQSPTSS VTSENSEVFH VSEAEASNSS 1620 

HESRIGLAEG LESEKKAVIP LVIVSALTFI CLWLVGII*! YWRKCFQTAH FYLEDSTSPR 1680 

VISTPPTPIF PISDDVGAIP IKHFPKHVAD LHASSGFTEE FETIjKEFYQE VQSCTVDLGI 1740 

TADSSNHPDN KHKNRYINIV AYDHSRVKLA QLAEKDGKLT DYINANYVDG YNRPRAYIAA 1800 

QGPLKSTAED FWRMIWEHNV EVIVMITNLV EKGRRKCDQY WPADGSEEYG NFLVTQKSVQ 1860 

VLAYYTVRNF TLRNTKIKKG SQKGRPSGRV VTQYHYTQWP DMGVPEYSLP VLTFVRKAAY 1920 

AKRHAVGPW VHCSAGVGRT GTYIVLDSML QQIQHEGTVN IFGFLKHIRS QRNYLVQTEE 1980 

QYVFIHDTLV EAILSKETEV LDSHIHAYVN ALLIPGPAGK TKLEKQFQt»I* SQSNIQQSDY 2040 

SAALKQCNRE KNRTSSIIPV ERSRVGISSL SGEGTDYINA SYIMGYYQSN EFIITQHPLIi 2100 

HTIKDFWRMI WDHNAQLWM IPDGQNMAED EFVYWPNKDE PINCESFKVT LMAEEHKCLS 2160 

NEEKL1IQPF ILEATQDDYV liEVRHPQCPK WPNPDSPISK TFEtil SVIKE EAANRDGPMI 2220 

VHDEHGGVTA GTFCALTTLM HQLEKENSVD VYQVAKMINIj MRPGVFADIE QYQFLYKVIL 2280 
SLVSTRQEEN PSTSLDSNGA ALPDGNIAES LESLV 

Seq ID NO j 181 DNA sequence 

Nucleic Acid Accession ttt Eos sequence 
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Coding sequence: 148-4518 

1 11 21 31 41 51 

I 1 1 1 1 1 

CACACATACG CACGCACGAT CTCACTTCGA TCTATACACT GGAGGATTAA AACAAACAAA 60 
CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGGAGCCGCA 120 
CGGCGAGGGG CCGCAGACCG TCTGGAAATG CGAATCCTAA AGCGTTTCCT CGCTTGCATT 180 
CAGCTCCTCT GTGTTTGCCG CCTGQATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA 240 
CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAAAA TTGGGGAAAG 300 
AAATATCCAA CATGTAATAG CCCAAAACAA TCTCCTATCA ATATTGATGA AGATCTTACA 360 
CAAGTAAATG TGAATCTTAA GAAACTTAAA TTTCAGGGTT GGGATAAAAC ATCATTGGAA 420 
AACACATTCA TTCATAACAC TGGGAAAACA GTGGAAATTA ATCTCACTAA TGACTACCGT 480 
GTCAGCGGAG GAGTTTCAGA AATGGTGTTT AAAGCAAGCA AGATAACTTT TCACTGGGGA 540 
AAATGCAATA TGTCATCTGA TGGATCAGAG CATAGTTTAG AAGGACAAAA ATTTCCACTT 600 
GAGATGCAAA TCTACTGCTT TGATGCGGAC CGATTTTCAA GTTTTGAGGA AGCAGTCAAA 660 
GGAAAAGGGA AGTTAAGAGC TTTATCCATT TTGTTTGAGG TTGGGACAGA AGAAAATTTG 720 
GATTTCAAAG CGATTATTGA TGGAGTCGAA AGTGTTAGTC GTTTTGGGAA GCAGGCTGCT 780 
TTAGATCCAT TCATACTGTT GAACCTTCTG CCAAACTCAA CTGACAAGTA TTACATTTAC 840 
AATGGCTCAT TGACATCTCC TCCCTGCACA GACACAGTTG ACTGGATTGT TTTTAAAGAT 900 
ACAGTTAGCA TCTCTGAAAG CCAGTTGGCT GTTTTTTGTG AAGTTCTTAC AATGCAACAA 960 

TCTGGTTATG TCATGCTGAT GGACTACTTA CAAAACAATT TTCGAGAGCA ACAGTACAAG 1020 

TTCTCTAGAC AGGTGTTTTC CTCATACACT GGAAAGGAAG AGATTCATGA AGCAGTTTGT 1080 

AGTTCAGAAC CAGAAAATGT TCAGGCTGAC CCAGAGAATT ATACCAGCCT TCTTGTTACA 1140 

TGGGAAAGAC CTCGAGTCGT TTATGATACC ATGATTGAGA AGTTTGCAGT TTTGTACCAG 1200 

CAGTTGGATG GAGAGGACCA AACCAAGCAT GAATTTTTGA CAGATGGCTA TCAAGACTTG 1260 

GGTGCTATTC TCAATAATTT GCTACCCAAT ATGAGTTATG TTCTTCAGAT AGTAGCCATA 1320 

TGCACTAATG GCTTATATGG AAAATACAGC GACCAACTGA TTGTCGACAT GCCTACTGAT 1380 

AATCCTGAAC TTGATCTTTT CCCTGAATTA ATTGGAACTG AAGAAATAAT CAAGGAGGAG 1440 

GAAGAGGGAA AAGACATTGA AGAAGGCGCT ATTGTGAATC CTGGTAGAGA CAGTGCTACA 1500 

AACCAAATCA GGAAAAAGGA ACCCCAGATT TCTACCACAA CACACTACAA TCGCATAGGG 1560 

ACGAAATACA ATGAAGCCAA GACTAACCGA TCCCCAACAA GAGGAAGTGA ATTCTCTGGA 1620 

AAGGGTGATG TTCCCAATAC ATCTTTAAAT TCCACTTCCC AACCAGTCAC TAAATTAGCC 1680 

ACAGAAAAAG ATATTTCCTT GACTTCTCAG ACTGTGACTG AACTGCCACC TCACACTGTG 1740 

GAAGGTACTT CAGCCTCTTT AAATGATGGC TCTAAAACTG TTCTTAGATC TCCACATATG * 1800 

AACTTGTCGG GGACTGCAGA ATCCTTAAAT ACAGTTTCTA TAACAGAATA TGAGGAGGAG I860 

AGTTTATTGA CCAGTTTCAA GCTTGATACT GGAGCTGAAG ATTCTTCAGG CTCCAGTCCC 1920 

GCAACTTCTG CTATCCCATT CATCTCTGAG AACATATCCC AAGGGTATAT ATTTTCCTCC 1980 

GAAAACCCAG AGACAATAAC ATATGATGTC CTTATACCAG AATCTGCTAG AAATGCTTCC 2040 

GAAGATTCAA CTTCATCAGG TTCAGAAGAA TCACTAAAGG ATCCTTCTAT GGAGGGAAAT 2100 

GTGTGGTTTC CTAGCTCTAC AGACATAACA GCACAGCCCG ATGTTGGATC AGGCAGAGAG 2160 

AGCTTTCTCC AGACTAATTA CACTGAGATA CGTGTTGATG AATCTGAGAA GACAACCAAG 2220 

TCCTTTTCTG CAGGCCCAGT GATGTCACAG GGTCCCTCAG TTACAGATCT GGAAATGCCA 2280 

CATTATTCTA CCTTTGCCTA CTTCCCAACT GAGGTAACAC CTCATGCTTT TACCCCATCC 2340 

TCCAGACAAC AGGATTTGGT CTCCACGGTC AACGTGGTAT ACTCGCAGAC AACCCAACCG 2400 

GTATACAATG CAGAGGCCAG TAATAGTAGC CATGAGTCTC GTATTGGTCT AGCTGAGGGG 2460 

TTGGAATCCG AGAAGAAGGC AGTTATACCC CTTGTGATCG TGTCAGCCCT GACTTTTATC 2520 

TGTCTAGTGG TTCTTGTGGG TATTCTCATC TACTGGAGGA AATGCTTCCA GACTGCACAC 2580 

TTTTACTTAG AGGACAGTAC ATCCCCTAGA GTTATATCCA CACCTCCAAC ACCTATCTTT 2640 

CCAATTTCAG ATGATGTCGG AGCAATTCCA ATAAAGCACT TTCCAAAGCA TGTTGCAGAT 2700 

TTACATGCAA GTAGTGGGTT TACTGAAGAA TTTGAGACAC TGAAAGAGTT TTACCAGGAA 2760 

GTGCAGAGCT GTACTGTTGA CTTAGGTATT ACAGCAGACA GCTCCAACCA CCCAGACAAC 2820 

AAGCACAAGA ATCGATACAT AAATATCGTT GCCTATGATC ATAGCAGGGT TAAGCTAGCA 2880 

CAGCTTGCTG AAAAGGATGG CAAACTGACT GATTATATCA ATGCCAATTA TGTTGATGGC 2940 

TACAACAGAC CAAAAGCTTA TATTGCTGCC CAAGGCCCAC TGAAATCCAC AGCTGAAGAT 3 0 00 

TTCTGGAGAA TGATATG GGA ACATAATGTG GAAGTTATTG TCATGATAAC AAACCTCGTG 3 060 

GAGAAAGGAA GGAGAAAATG TGATCAGTAC TGGCCTGCCG ATGGGAGTGA GGAGTACGGG 3120 

AACTTTCTGG TCACTCAGAA GAGTGTGCAA GTGCTTGCCT ATTATACTGT GAGGAATTTT 3180 

ACTCTAAGAA ACACAAAAAT AAAAAAGGGC TCCCAGAAAG GAAGACCCAG TGGACGTGTG 3240 

GTCACACAGT ATCACTACAC GCAGTGGCCT GACATGGGAG TACCAGAGTA CTCCCTGCCA 3300 

GTGCTGACCT TTGTGAGAAA GGCAGCCTAT GCCAAGCGCC ATGCAGTGGG GCCTGTTGTC 3360 

GTCCACTGCA GTGCTGGAGT TGGAAGAACA GGCACATATA TTGTGCTAGA CAGTATGTTG 3420 

CAGCAGATTC AACACGAAGG AACTGTCAAC ATATTTGGCT TCTTAAAACA CATCCGTTCA 3480 

CAAAGAAATT ATTTGGTACA AACTGAGGAG CAATATGTCT TCATTCATGA TACACTGGTT 3540 

GAGGCCATAC TTAGTAAAGA AACTGAGGTG CTGGACAGTC ATATTCATGC CTATGTTAAT 3600 

GCACTCCTCA TTCCTGGACC AGCAGGCAAA ACAAAGCTAG AGAAACAATT CCAGCTCCTG 3660 

AGCCAGTCAA ATATACAGCA GAGTGACTAT TCTGCAGCCC TAAAGCAATG CAACAGGGAA 3720 

AAGAATCGAA CTTCTTCTAT CATCCCTGTG GAAAGATCAA GGGTTGGCAT TTCATCCCTG 3780 

AGTGGAGAAG GCACAGACTA CATCAATGCC TCCTATATCA TGGGCTATTA CCAGAGCAAT 3840 

GAATTCATCA TTACCCAGCA CCCTCTCCTT CATACCATCA AGGATTTCTG GAGGATGATA 3 900 

TGGGACCATA ATGCCCAACT GGTGGTTATG ATTCCTGATG GCCAAAACAT GGCAGAAGAT 3 960 

GAATTTGTTT ACTGGCCAAA TAAAGATGAG CCTATAAATT GTGAGAGCTT TAAGGTCACT 4020 

CTTATGGCTG AAGAACACAA ATGTCTATCT AATGAGGAAA AACTTATAAT TCAGGACTTT 4080 

ATCTTAGAAG CTACACAGGA TGATTATGTA CTTGAAGTGA GGCACTTTCA GTGTCCTAAA 4140 

TGGCCAAATC CAGATAGCCC CATTAGTAAA ACTTTTGAAC TTATAAGTGT TATAAAAGAA 4200 

GAAGCTGCCA ATAGGGATGG GCCTATGATT GTTCATGATG AGCATGGAGG AGTGACGGCA 4260 

GGAACTTTCT GTGCTCTGAC AACCCTTATG CACCAACTAG AAAAAGAAAA TTCCGTGGAT 4320 

GTTTACCAGG TAGCCAAGAT GATCAATCTG ATGAGGCCAG GAGTCTTTGC TGACATTGAG 4380 

CAGTATCAGT TTCTCTACAA AGTGATCCTC AGCCTTGTGA GCACAAGGCA GGAAGAGAAT 4440 

CCATCCACCT CTCTGGACAG TAATGGTGCA GCATTGCCTG ATGGAAATAT AGCTGAGAGC 4500 

TTAGAGTCTT TAGTTTAACA CAGAAAGGGG TGGGGGGACT CACATCTGAG CATTGTTTTC 4560 

CTCTTCCTAA AATTAGGCAG GAAAATCAGT CTAGTTCTGT TATCTGTTGA TTTCCCATCA 4620 

CCTGACAGTA ACTTTCATGA CATAGGATTC TGCCGCCAAA TTTATATCAT TAACAATGTG 4680 

TGC CTTTTT G CAAGACTTGT AATTTACTTA TTATGTTTGA ACTAAAATGA TTGAATTTTA 4740 

CAGTATTTCT AAGAATGGAA TTGTGGTATT TTTTTCTGTA TTGATTTTAA CAGAAAATTT 4800 

CAATTTATAG AGGTTAGGAA TTCCAAACTA CAGAAAATGT TTGTTTTTAG TGTCAAATTT 4860 

TTAGCTGTAT TTGTAGCAAT TATCAGGTTT GCTAGAAATA TAACTTTTAA TACAGTAGCC 4920 

TGTAAATAAA ACACTCTTCC ATATGATATT CAACATTTTA CAACTGCAGT ATTCACCTAA 4980 
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AQTAGAAATA ATCTGTTACT TATTGTAAAT ACTGCCCTAG TGTCTCCATG 
ATATTTATAA TTGTAGATTT TTATATTTTA CTACTGAGTC AAGTTTT CTA 
ATTGTTTAGT TTAATGACGT AGTTCATTAG CTGGTCTTAC TCTACCAGTT 
GTATTGTGTT ACCTAAGTCA TTAACTTTGT TTCAGCATGT AATTTTAACT 
ATAGAAATAC CTTCATTTTG AAAGAAGTTT TTATGAGAAT AACACCTTAC 
TCAAATGGTT TTTATCCAAG GAATTGCAAA AATAAATATA AATATTGCCA 
AAAAAAAAAA AAAAAAAAAA AAAAAAA 



Seq ID NOi 182 Protein sequence: 
Protein Accession #: Eos sequence 

1 11 21 31 41 51 

I I I I I I 

MR ILiKRFLAC IQLLCVCRIiD WANGYYRQQR KLVEEIGWSY TGALNQKNWG KKYPTCNSPK 60 

QSPINIDEDL TOVNVNLKKL KFQGWDKTSL ENTFIHNTGK TVEINLTNDY RVSGGVSEMV 120 

FKASKITFHW GKCNMSSDGS EHSLEGQKFP LEMQIYCFDA DRPSSFEEAV KGKGKLRALS 180 

ILFEVGTEEN LDPKA1IDGV ESVSRFGKQA ALDPFILLNL LPNSTDKYYI YNGSLTSPPC 240 

TDTVDWIVFK DTV9ISESQL AVPCEVLTMQ QSGYVMLMDY LQNNFREQQY KFSRQVFSSY 300 

TGKEE IHEAV CSSEPENVQA DPENYTELLV TWERPRWYD TMIEKFAVLY QQLDGEDQTK 360 

HEFLTDGYQD LGAILNNLLP NMSYVLQIVA ICTNGLYGKY SDQLIVDMPT DNPELDLFPE 420 

LIGTEEIIKE EEEGKDIEEG AIVNPGRDSA TNQIRKKEPQ ISTTTHYNRI GTKYNEAKTN 480 

RSPTRGSEFS GKGDVPNTSL NSTSQPVTKL ATEKDISLTS QTVTELPPHT VEGTSASLND 540 

GSKTVLRSPH MNLSGTAESL NTVSITEYEE ESLLTSFKLD TGAEDSSGSS PATSAIPFIS 600 

ENISQGYIFS SENPETITYD VltlPESARNA SEDSTSSGSE ESLKDPSMEG NVWFPSSTDI 660 

TAQPDVGSGR ESFLQTNYTE IRVDESEKTT KSPSAGPVMS QGPSVTDLEM PHYSTFAYFP 720 

TEVTPHAFTP SSRQQDLVST VNWYSQTTQ PVYNAEASNS SHESRIGLAE GLESEKKAVI 780 

PLVIVSALTP ICLWLVGIL IYWRKCFQTA HFYLEDSTSP RVISTPPTPI PPISDDVGAI 840 

PIKHPPKHVA DLHASSGPTE EFETLKEFYQ EVQSCTVDLG ITADSSNHPD NKHKNRYINI 900 

VAYDHSRVKL AQLAEKDGKL TDYINANYVD GYNRPKAYIA AQGPLKSTAE DFWRMIWEHN 960 

VEVIVMITNL VEKGRRKCDQ YWPADGSEEY GNFLVTQKSV QVLAYYTVRN FTLRNTKIKK 1020 

GSQKGRPSGR WTQYHYTQW PDMGVPEYSL PVLTPVRKAA YAKRHAVGPV WHCSAGVGR 1080 

TGTYIVLDSM LQQIQHEGTV NIPGFLKHIR SQRNYLVQTE EQYVPIHDTL VEAILSKETE 1140 

VLDSHIHAYV NALLIPGPAG KTKLBKQFQL LSQSNIQQSD YSAALKQCNR EKNRTSSIIP 1200 

VERSRVGISS LSGEGTDYIN ASYIMGYYQS NEPIITQHPL LHTIKDFWRM IWDHNAQLW 1260 

MIPDGQNMAE DEFVYWPNKD EPINCESFKV TLMAEEHKCL SNEEKLI IQD FIIiEATQDDY 1320 

VLEVRHPQCP KWPNPDSPIS KTFELISVIK EEAANRDGPM IVHDEHGGVT AGTFCALTTL 1380 

MHQLEKENSV DVYQVAKMIN LMRPGVFADI EQYQFLYKVI LSLVSTRQEE NPSTSLDSNG 1440 
AALPDGNIAE 9LESLV 

Seq ID NO: 183 dna sequence 

Nucleic Acid Accession #: EOS sequence 

Coding sequencet 148-4494 

1 11 21 31 41 51 

I 1 I I I I 

CACACATACG CACGCACGAT CTCACTTCGA TCTATACACT GGAGGATTAA AACAAACAAA 60 

CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGGAGCCGCA 120 

CGGCGAGGGG CCGCAGACCG TCTGGAAATG CGAATCCTAA AGCGTTTCCT CGCTTGCATT 180 

CAGCTCCTCT GTGTTTGCCG CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA 240 

CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAAAA TTGGGGAAAG 300 

AAATATCCAA CATGTAATAG CCCAAAACAA TCTCCTATCA ATATTGATGA AGATCTTACA 360 

CAAGTAAATG TGAATCTTAA GAAACTTAAA TTTCAGGGTT GGGATAAAAC ATCATTGGAA 420 

AACACATTCA TTCATAACAC TGGGAAAACA GTGGAAATTA ATCTCACTAA TGACTACCGT 480 

GTCAGCGGAG GAGTTTCAGA AATGGTGTTT AAAGCAAGCA AGATAACTTT TCACTGGGGA 540 

AAATGCAATA TGTCATCTGA TGGATCAGAG CATAGTTTAG AAGGACAAAA ATTTCCACTT 600 

GAGATGCAAA TCTACTGCTT TGATGCAGAC CGATTTTCAA GTTTTGAGGA AGCAGTCAAA 660 

GGAAAAGGGA AGTTAAGAGC TTTATCCATT TTGTTTGAGG TTGGGACAGA AGAAAATTTG 720 

GATTTCAAAG CGATTATTGA TGGAGTCGAA AGTGTTAGTC GTTTTGGGAA GCAGGCTGCT 780 

TTAGATCCAT TCATACTGTT GAACCTTCTG CCAAACTGAA CTGACAAGTA TTACATTTAC 840 

AATGGCTCAT TGACATCTCC TCCCTGCACA GACACAGTTG ACTGGATTGT TTTTAAAGAT 900 

ACAGTTAGCA TCTCTGAAAG CCAGTTGGCT GTTTTTTGTG AAGTTCTTAC AATGCAACAA 960 

TCTGGTTATG TCATGCTGAT GGACTACTTA CAAAACAATT TTCGAGAGCA ACAGTACAAG 1020 

TTCTCTAGAC AGGTGTTTTC CTCATACACT GGAAAGGAAG AGATT CATG A AGCAGTTTGT 1080 

AGTTCAGAAC CAGAAAATGT TCAGGCTGAC CCAGAGAATT ATACCAGCCT TCTTGTTACA 1140 

TGGGAAAGAC CTCGAGTCGT TTATGATACC ATGATTGAGA AGTTTGCAGT TTTGTACCAG 1200 

CAGTTGGATG GAGAGGACCA AACCAAGCAT GAATTTTTGA CAGATGGCTA TCAAGACTTG 1260 

GGTGCTATTC TCAATAATTT GCTACCCAAT ATGAGTTATG TTCTTCAGAT AGTAGCCATA 1320 

TGCACTAATG GCTTATATGG AAAATACAGC GACCAACTGA TTGTCGACAT GCCTACTGAT 1380 

AATCCTGAAC TTGATCTTTT CCCTGAATTA ATTGGAACTG AAGAAATAAT CAAGGAGGAG 1440 

GAAGAGGGAA AAGACATTGA AGAAGGCGCT ATTGTGAATC CTGGTAGAGA CAGTGCTACA 1500 

AACCAAATCA GGAAAAAGGA ACCCCAGATT TCTACCACAA CACACTACAA TCGCATAGGG 1560 

ACGAAATACA ATGAAGCCAA GACTAACCGA TCCCCAACAA GAGGAAGTGA ATTCTCTGGA 1620 

AAGGGTGATG TTCCCAATAC ATCTTTAAAT TCCACTTCCC AACCAGTCAC TAAATTAGCC 1680 

ACAGAAAAAG ATATTTCCTT GACTTCTCAG ACTGTGACTG AACTGCCACC TCACACTGTG 1740 

GAAGGTACTT CAGCCTCTTT AAATGATGGC TCTAAAACTG TTCTTAGATC TCCACATATG 1800 

AACTTGTCGG GGACTGCAGA ATCCTTAAAT ACAGTTTCTA TAACAGAATA TGAGGAGGAG 1860 

AGTTTATTGA CCAGTTTCAA GCTTGATACT GGAGCTGAAG ATTCTTCAGG CTCCAGTCCC 1920 

GCAACTTCTG CTATCCCATT CATCTCTGAG AACATATCCC AAGGGTATAT ATTTTCCTCC 1980 

GAAAACCCAG AGACAATAAC ATATGATGTC CTTATACCAG AATCTGCTAG AAATGCTTCC 2040 

GAAGATTCAA CTTCATCAGG TTCAGAAGAA TCACTAAAGG ATCCTTCTAT GGAGGGAAAT 2100 

GTGTGGTTTC CTAGCTCTAC AGACATAACA GCACAGCCCG ATGTTGGATC AGGCAGAGAG 2160 

AGCTTTCTCC AGACTAATTA CACTGAGATA CGTGTTGATG AATCTGAGAA GACAACCAAG 2220 

TCCTTTTCTG CAGGCCCAGT GATGTCACAG GGTCCCTCAG TTACAGATCT GGAAATGCCA 2280 

CATTATTCTA CCTTTGCCTA CTTCCCAACT GAGGTAACAC CTCATGCTTT TACCCCATCC 2340 

TCCAGACAAC AGGATTTGGT CTCCACGGTC AACGTGGTAT ACT CG CAG AC AACCCAACCG 2400 

GTATACAATG AGG CCAGTAA TAGTAGCCAT GAGTCTCGTA TTGGTCTAGC TGAGGGGTTG 2460 



GACCAAATTT 5040 

GTTCTGTGTA 5100 

TTCTGACATT 5160 

TTTGTGGAAA 5220 

CAAACATTGT 5280 

TTAAAAAAAA 5340 
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GAATCCGAGA AGAAGGCAGT TATACCCCTT GTGATCGTGT CAGCCCTGAC TTTTATCTGT 2520 

CTAGTGGTTC TTGTGGGTAT TCTCATCTAC TGGAGGAAAT GCTTCCAGAC TGCACACTTT 2580 

TACTTAGAGG ACAGTACATC CCCTAGAGTT ATATCCACAC CTCCAACACC TATCTTTCCA 2640 

ATTTCAGATG ATGTCGGAGC AATTCCAATA AAGCACTTTC CAAAGCATGT TGCAGATTTA 2700 

CATGCAAGTA GTGGGTTTAC TGAAGAATTT GAGGAAGTGC AGAGCTGTAC TGTTGACTTA 2760 

GGTATTACAG CAGACAGCTC CAACCACCCA GACAACAAGC ACAAGAATCG ATACATAAAT 2820 

ATCGTTGCCT ATGATCATAG CAGGGTTAAG CTAGCACAGC TTGCTGAAAA GGATGGCAAA 2880 

CTGACTGATT ATATCAATGC CAATTATGTT GATGGCTACA ACAGACCAAA AGCTTATATT 2940 

GCTGCCCAAG GCCCACTGAA ATCCACAGCT GAAGATTTCT GGAGAATGAT ATGGGAACAT 3000 

AATGTGGAAG TTATTGTCAT GATAACAAAC CTCGTGGAGA AAGGAAGGAG AAAATGTGAT 3060 

CAGTACTGGC CTGCCGATGG GAGTGAGGAG TACGGGAACT TTCTGGTCAC TCAGAAGAGT 3120 

GTGCAAGTGC TTGCCTATTA TACTGTGAGG AATTTTACTC TAAGAAACAC AAAAATAAAA 3180 

AAGGGCTCCC AGAAAGGAAG ACCCAGTGGA CGTGTGGTCA CACAGTATCA CTACACGCAG 3240 

TGGCCTGACA TGGGAGTACC AGAGTACTCC CTGCCAGTGC TGACCTTTGT GAGAAAGGCA 3300 

GCCTATGCCA AGCGCCATGC AGTGGGGCCT GTTGTCGTCC ACTGCAGTGC TGGAGTTGGA 33 60 

AGAACAGGCA CATATATTGT GCTAGACAGT ATGTTGCAGC AGATTCAACA CGAAGGAACT 3420 

GTCAACATAT TTGGCTTCTT AAAACACATC CGTTCACAAA GAAATTATTT GGTACAAACT 3480 

GAGGAGCAAT ATGTCTTCAT TCATGATACA CTGGTTGAGG CCATACTTAG TAAAGAAACT 3540 

GAGGTGCTGG ACAGTCATAT TCATGCCTAT GTTAATGCAC TCCTCATTCC TGGACCAGCA 3600 

GGCAAAACAA AGCTAGAGAA ACAATTCCAG CTCCTGAGCC AGTCAAATAT ACAGCAGAGT 3660 

GACTATTCTG CAGCCCTAAA GCAATGCAAC AGGGAAAAGA ATCGAACTTC TTCTATCATC 3720 

CCTGTGGAAA GATCAAGGGT TGGCAT TTCA TCCCTGAGTG GAGAAGGCAC AGACTACATC 3780 

AATGCCTCCT ATATCATGGG CTATTACCAG AGCAATGAAT TCATCATTAC CCAGCACCCT 3 840 

CTCCTTCATA CCATCAAGGA TTTCTGGAGG ATGATATGGG ACCATAATGC CCAACTGGTG 3 900 

GTTATGATTC CTGATGGCCA AAACATGGCA GAAGATGAAT TTGTTTACTG GCCAAATAAA 3 960 

GATGAGCCTA TAAATTGTGA GAGCTTTAAG GTCACTCTTA TGGCTGAAGA ACACAAATGT 4020 

CTATCTAATG AGGAAAAACT TATAATTCAG GACTTTATCT TAGAAGCTAC ACAGGATGAT 4080 

TATGTACTTG AAGTGAGGCA CTTTCAGTGT CCTAAATGGC CAAATCCAGA TAGCCCCATT 4140 

AGTAAAACTT TTGAACTTAT AAGTGTTATA AAAGAAGAAG CTGCCAATAG GGATGGGCCT 4200 

ATGATTGTTC ATGATGAGCA TGGAGGAGTG ACGGCAGGAA CTTTCTGTGC TCTGACAACC 4260 

CTTATGCACC AACTAGAAAA AGAAAATTCC GTGGATGTTT ACCAGGTAGC CAAGATGATC 4320 

AATCTGATGA GGCCAGGAGT CTTTG CTGAC ATTGAGCAGT ATCAGTTTCT CTACAAAGTG 4380 

ATCCTCAGCC TTGTGAGCAC AAGGCAGGAA GAGAATCCAT CCACCTCTCT GGACAGTAAT 4440 

GGTGCAGCAT TGCCTGATGG AAATATAGCT GAGAGCTTAG AGTCTTTAGT TTAACACAGA 4500 

AAGGGGTGGG GGGACTCACA TCTGAGCATT GTTTTCCTCT TCCTAAAATT AGGCAGGAAA 4560 

ATCAGTCTAG TTCTGTTATC TGTTGATTTC CCATCACCTG ACAGTAACTT TCATGACATA 4620 

GGATTCTGCC GCCAAATTTA TATCATTAAC AATGTGTGCC TTTTTGCAAG ACTTGTAATT 4680 

TACTTATTAT GTTTGAACTA AAATGATTGA ATTTTACAGT ATTTCTAAGA ATGGAATTGT 4740 

GGTATTTTTT TCTGTATTGA TTTTAACAGA AAATTTCAAT TTATAGAGGT TAGGAATTCC 4800 

AAACTACAGA AAATGTTTGT TTTTAGTGTC AAATTTTTAG CTGTATTTGT AGCAATTATC 4860 

AGGTTTGCTA GAAATATAAC TTTTAATACA GTAGCCTGTA AATAAAACAC TCTTCCATAT 4920 

GATATTCAAC ATTTTACAAC TGCAGTATTC ACCTAAAGTA GAAATAATCT GTTACTTATT 4980 

GTAAATACTG CCCTAGTGTC TCCATGGACC AAATTTATAT TTATAATTGT AGATTTTTAT 5040 

ATTTTACTAC TGAGTCAAGT TTTCTAGTTC TGTGTAATTG TTTAGTTTAA TGACGTAGTT S100 

CATTAGCTGG TCTTACTCTA CCAG TTTT CT GACATTGTAT TGTGTTACCT AAGTCATTAA 5160 

CTTTGTTTCA GCATGTAATT TTAACTTTTG TGGAAAATAG AAATACCTTC ATTTTGAAAG 5220 

AAGTTTTTAT GAGAATAACA CCTTACCAAA CATTGTTCAA ATGGTTTTTA TCCAAGGAAT 52 80 

TGCAAAAATA AATATAAATA TTGCCATTAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 5340 
AAA 

Seq ID NO: 184 Protein sequence : 
Protein Accession #: EOS sequence 

1 11 21 31 41 51 

1111)1 

MRILKRFLAC IQLLCVCRLD WANGYYRQQR KLVEEIGWSY TGALNQKNWG KKYPTCNSPK 60 

QSPINIDEDL TQVNVNLKKL KFQGWDKTSL ENTFIHNTGK TVEINLTNDY RVSGGVSEMV 120 

FKASKITFHW GKCNMSSDGS EHSLEGQKFP LEMQIYCFDA DRFSSFEEAV KGKGKLRALS 180 

ILFEVGTEEN LDFKAIIDGV ESVSRFGKQA ALDPFILLNL LPNSTDKYY I YNGSLTSPPC 240 

TDTVDWIVFK DTVSISESQL AVFCEVLTMQ QSGYVMLMDY LQNNFREQQY KFSRQVFSSY 300 

TGKEEIHEAV CSSEPENVQA DPENYTSLLV TWERPRWYD TMIEKFAVLY QQLDGEDQTK 360 

HEFIiTDGYQD LGAILNNLLP NMSYVLQIVA ICTNGLYGKY SDQLIVDMPT DNPELDLFPE 420 

LIGTEEI IKE EEEGKDIEEG AIVNPGRDSA TNQIRKKEPQ ISTTTHYNRI GTKYNEAKTN 480 

RSPTRGSEFS GKGDVPNTSL NSTSQPVTKL ATEKDISLTS QTVTBLPPHT VEGTSASLND 540 

QSKTVLRSPH MNLSGTAESL NTVSITEYEE ESLLTSFKLD TGAEDSSGSS PATSAIPFIS 600 

ENISQGYIFS SENPBTITYD VLIPESARNA SEDSTSSGSE ESLKDPSMEG NVWFPSSTDI 660 

TAQPDVGSGR ESFLQTNYTE IRVDESEKTT KSFSAGPVMS QGPSVTDLEM PHYSTFAYFP .720 

TEVTPHAFTP SSRQQDLVST VNWYSQTTQ PVYNEASNSS HESRIGLAEG LESEKKAVIP 780 

LVIVSALTFI CLWLVGILI YWRKCFQTAH F YLEDSTS PR VISTPPTPIP PISDDVGAIP 840 

IKHFPKHVAD LBASSGFTEE FEEVQSCTVD LGITADSSNH PDNKHKNRYI NIVAYDHSRV 900 

KIiAQIiAEKDG KLTDYINANY VDGYNRPKAY IAAQGPLKST AEDFWRMIWE HNVEVIVMIT 960 

NLVBKGRRKC DQYWPADGSE EYGNFIiVTQK SVQVLAYYTV RNFTLRNTKI KKGSQKGRPS 1020 

GRWTQYHYT QWPDMGVPBY SLPVLTFVRK AAYAKRHAVG PVWHCSAGV GRTGTYIVLD 1080 

SMLOQIQHEG TVNIFGFLKH IRSQRNYLVQ TEEQYVFIHD TLVEAILSKE TBVLDSHIHA 1140 

YVNALLIPGP AGKTKLEKQF QLLSQSNIQQ SDYSAALKQC NREKNRTSSI IPVERSRVGI 1200 

SSLSGEGTDY INASYIMGYY QSNEFIITQH PLIiHTIKDFW RMIWDHNAQL WMIPDGQNM 12 60 

AEDEFVYWPN KDEPINCESF KVTLMAEEHK CLSNEEKLII QDFILEATQD DYVLEVRHFQ 1320 

CPKWPNPDSP ISKTFEJjISV IKEEAANRDG PMIVHDEHGG VTAGTFCALT TLMHQLEKEN 13 80 

SVDVYQVAKM INLMRPGVFA DIEQYQFLYK VIliSLVSTRQ EENPSTSLDS NGAALPDGNI 1440 
AESLESLV 



Seq ID NOt 185 DNA sequence 

Nucleic Acid Accession it* EOS sequence 

Coding sequence: 501-4514 

1 11 21 31 41 SI 
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CACACATACG CACGCACGAT 
CAAAAAAAAC ATTTCCTTCG 
CGGCGAGGGG CCGCAGACCG 
CAGCTCCTCT GTGTTTGCCG 
CTTGTTGAAG AGATTGGCTG 
AATATCCAAC ATGTAATAGC 
AAGTAAATGT GAATCTTAAG 
ACACATTCAT TCATAACACT 
TCAGCGGAGG AGTTTCAGAA 
AATGCAATAT GTCATCTGAT 
AGATGCAAAT CTACTGCTTT 
GAAAAGGGAA GTTAAGAGCT 
ATTTCAAAGC GATTATTGAT 
TAGATCCATT CATACTGTTG 
ATGGCTCATT GACATCTCCT 
CAGTTAGCAT CTCTGAAAGC 
CTGGTTATGT CATGCTGATG 
TCTCTAGACA GGTGTTTTCC 
GTTCAGAACC AGAAAATGTT 
GGGAAAGACC TCGAGTCGTT 
AGTTGGATGG AGAGGACCAA 
GTGCTATTCT CAATAATTTG 
GCACTAATGG CTTATATGGA 
ATCCTGAACT TGATCTTTTC 
AAGAGGGAAA AGACATTGAA 
ACCAAATCAG GAAAAAGGAA 
CGAAATACAA TGAAG CCAAG 
AGGGTGATGT TCCCAATACA 
CAGAAAAAGA TATTTCCTTG 
AAGGTACTTC AGCCTCTTTA 
ACTTGTCGGG GACTGCAGAA 
GTTTATTGAC CAGTTTCAAG 
CAACTTCTGC TATCCCATTC 
AAAACCCAGA GACAATAACA 
AAGATTCAAC TTCATCAGGT 
TGTGGTTTCC TAGCTCTACA 
GCTTTCTCCA GACTAATTAC 
CCTTTTCTGC AGGCCCAGTG 
ATT ATT CT AC CTTTGCCTAC 
CCAGACAACA GGATTTGGTC 
TATACAATGA GGCCAGTAAT 
AATCCGAGAA GAAGGCAGTT 
TAGTGGTTCT TGTGGGTATT 
ACTTAGAGGA CAGTACATCC 
.TTTCAGATGA TGTCGGAGCA 
ATGCAAGTAG TGGGTTTACT 
AGAGCTGTAC TGTTGACTTA 
ACAAGAATCG ATACATAAAT 
TTGCTGAAAA GGATGGCAAA 
ACAGACCAAA AGCTTATATT 
GGAGAATGAT ATGGGAACAT 
AAGGAAGGAG AAAATGTGAT 
TTCTGGTCAC TCAGAAGAGT 
TAAGAAACAC AAAAATAAAA 
CACAGTATCA CTACACGCAG 
TGACCTTTGT GAGAAAGGCA 
ACTGCAGTGC TGGAGTTGGA 
AGATTCAACA CGAAGGAACT 
GAAATTATTT GGTACAAACT 
CCATACTTAG TAAAGAAACT 
TCCTCATTCC TGGACCAGCA 
AGTCAAATAT ACAGCAGAGT 
ATCGAACTTC TTCTATCATC 
GAGAAGGCAC AGACTACATC 
TCATCATTAC CCAGCACCCT 
ACCATAATGC CCAACTGGTG 
TTGTTTACTG GCCAAATAAA 
TGGCTGAAGA ACACAAATGT 
TAGAAGCTAC ACAGGATGAT 
CAAATCCAGA TAGCCCCATT 
CTGCCAATAG GGATGGGCCT 
CTTTCTGTGC TCTGACAACC 
ACCAGGTAGC CAAGATGATC 
ATCAGTTTCT CTACAAAGTG 
CCACCTCTCT GGACAGTAAT 
AGTCTTTAGT TTAACACAGA 
TCCTAAAATT AGGCAGGAAA 
ACAGTAACTT TCATGACATA 
TTTTTGCAAG ACTTGTAATT 
ATTTCTAAGA ATGGAATTGT 
TTATAGAGGT TAGGAATTCC 
CTGTATTTGT AGCAATTATC 
AATAAAACAC T CTTCCAT AT 
GAAATAATCT GTTACTTATT 
TTATAATTGT AGATTTTTAT 
TTTAGTTTAA TGACGTAGTT 



CTCACTTCGA 
CTCCCCCTCC 

tctggaaatg 
cctggattgg 
gtcctataca 
ccaaaacaat 

AAACTTAAAT 
GGGAAAACAG 
ATGGTGTTTA 
GGATCAGAGC 
GATGCGGACC 
TTAT CCATTT 
GGAGTCGAAA 
AACCTTCTGC 
CCCTGCACAG 
CAGTTGGCTG 
GACTACTTAC 
TCATACACTG 
CAGGCTGACC 
TATGATACCA 
ACCAAGCATG 
CTACCCAATA 
AAATACAGCG 
CCTGAATTAA 
GAAGGCGCTA 
CCCCAGATTT 
ACTAACCGAT 
TCTTTAAATT 
ACTTCTCAGA 
AATGATGGCT 
TCCTTAAATA 
CTTGATACTG 
ATCTCTGAGA 
TATGATGTCC 
TCAGAAGAAT 
GACATAACAG 
ACTGAGATAC 
ATGTCACAGG 
TTCCCAACTG 
TCCACGGTCA 
AGTAGCCATG 
ATACCCCTTG 
CTCATCTACT 
CCTAGAGTTA 
ATTCCAATAA 
GAAGAATTTG 
GGTATTACAG 
ATCGTTGCCT 
CTGACTGATT 
GCTGCCCAAG 
AATGTGGAAG 
CAGTACTGGC 
GTGCAAGTGC 
AAGGGCTCCC 
TGGCCTGACA 
GCCTATGCCA 
AGAACAGGCA 
GTCAACATAT 
GAGGAGCAAT 
GAGGTGCTGG 
GGCAAAACAA 
GACTATTCTG 
CCTGTGGAAA 
AATGCCTCCT 
CT CCTT CAT A 
GTTATGATTC 
GATGAGCCTA 
CTATCTAATG 
TATGTACTTG 
AGTAAAACTT 
ATGATTGTTC 
CTTATGCACC 
AATCTGATGA 
ATCCTCAGCC 
GGTGCAGCAT 
AAGGGGTGGG 
ATCAGTCTAG 
GGATTCTGCC 
TACTTATTAT 
GGTATTTTTT 
AAACTACAGA 
AGGTTTGCTA 
GATATTCAAC 
GTAAATACTG 
ATTTTACTAC 
CATTAGCTGG 



TCTATACACT 
CTCTCCACTC 
CGAATCCTAA 
GCTAATGGAT 
GGAGCACTGA 
CTCCTATCAA 
TTCAGGGTTG 
TGGAAATTAA 
AAGCAAGCAA 
ATAGTTTAGA 
GATTTTCAAG 
TGTTTGAGGT 
GTGTTAGTCG 
CAAACTCAAC 
ACACAGTTGA 
TTTTTTGTGA 
AAAACAATTT 
GAAAGGAAGA 
CAGAGAATTA 
TGATTGAGAA 
AATTTTTGAC 
TGAGTTATGT 
ACCAACTGAT 
TTGGAACTGA 
TTGTGAATCC 
CTACCACAAC 
CCCCAACAAG 
CCACTTCCCA 
CTGTGACTGA 
CTAAAACTGT 
CAGTTTCTAT 
GAG C TGAAG A 
ACATATCCCA 
TTATACCAGA 
CACTAAAGGA 
CACAGCCCGA 
GTGTTGATGA 
GTCCCTCAGT 
AGGTAACACC 
ACGTGGTATA 
AGTCTCGTAT 
TGATCGTGTC 
GGAGGAAATG 
TATCCACACC 
AGCACTTTCC 
AGACACTGAA 
CAGACAGCTC 
ATGATCATAG 
ATATCAATGC 
GCCCACTGAA 
TTATTGTCAT 
CTGCCGATGG 
TTGCCTATTA 
AGAAAGGAAG 
TGGGAGTACC 
AGCGCCATGC 
CATATATTGT 
TTGGCTTCTT 
ATGTCTTCAT 
ACAGTCATAT 
AGCTAGAGAA 
CAGCCCTAAA 
GATCAAGGGT 
ATATCATGGG 
CCATCAAGGA 
CTGATGGCCA 
TAAATTGTGA 
AGGAAAAACT 
AAGTGAGGCA 
TTGAACTTAT 
ATGATGAGCA 
AACTAGAAAA 
GGCCAGGAGT 
TTGTGAGCAC 
TGCCTGATGG 
GGGACTCACA 
TTCTGTTATC 
GCCAAATTTA 
GTTTGAACTA 
TCTGTATTGA 
AAATGTTTGT 
GAAATATAAC 
ATTTTACAAC 
CCCTAGTGTC 
TGAGTCAAGT 
TCTTACTCTA 
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GGAGGATTAA 
TGAGAAGCAG 
AGCGTTTCCT 
ACTACAGACA 
ATCAAAAAAT 
TATTGATGAA 
GGATAAAACA 
TCTCACTAAT 
GATAACTTTT 
AGGACAAAAA 
TTTTGAGGAA 
TGGGACAGAA 
TTTTGGGAAG 
TGACAAGTAT 
CTGGATTGTT 
AGTTCTTACA 
TCGAGAGCAA 
GATTCATGAA 
TACCAGCCTT 
GTTTGCAGTT 
AGATGGCTAT 
TCTTCAGATA 
TGTCGACATG 
AGAAATAATC 
TGGTAGAGAC 
ACACTACAAT 
AGGAAGTGAA 
ACCAGTCACT 
ACTGCCACCT 
TCTTAGATCT 
AACAGAATAT 
TTCTT CAGGC 
AGGGTATATA 
ATCTGCTAGA 
TCCTTCTATG 
TGTTGGATCA 
ATCTGAGAAG 
TACAGATCTG 
TCATGCTTTT 
CTCGCAGACA 
TGGTCTAGCT 
AGCCCTGACT 
CTTCCAGACT 
TCCAACACCT 
AAAGCATGTT 
AGAGTTTTAC 
CAACCACCCA 
CAGGGTTAAG 
CAATTATGTT 
ATCCACAGCT 
GATAACAAAC 
GAGTGAGGAG 
TACTGTGAGG 
ACCCAGTGGA 
AGAGTACTCC 
AGTGGGGCCT 
GCTAGACAGT 
AAAACACATC 
TCATGATACA 
TCATGCCTAT 
ACAATTCCAG 
GCAATGCAAC 
TGGCATTTCA 
CTATTACCAG 
TTTCTGGAGG 
AAACATGGCA 
GAGCTTTAAG 
TATAATTCAG 
CTTTCAGTGT 
AAGTGTTATA 
TGGAGGAGTG 
AGAAAATTCC 
CTTTGCTGAC 
AAGGCAGGAA 
AAATATAGCT 
TCTGAG CATT 
TGTTGATTTC 
TATCATTAAC 
AAATGATTGA 
TTTTAACAGA 
TTTTAGTGTC 
TTTTAATACA 
TGCAGTATTC 
TCCATGGACC 
TTTCTAGTTC 
CCAGTTTTCT 



AACAAACAAA 
AGGAGCCGCA 
CGCTTGCATT 
ACAGAGAAAA 
TGGGGAAAGA 
GATCTTACAC 
TCATTGGAAA 
GACTACCGTG 
CACTGGGGAA 
TTTCCACTTG 
GCAGTCAAAG 
GAAAATTTGG 
CAGGCTGCTT 
TACATTTACA 
TTTAAAGATA 
ATGCAACAAT 
CAGTACAAGT 
GCAGTTTGTA 
CTTGTTACAT 
TTGTACCAGC 
CAAGACTTGG 
GTAGCCATAT 
CCTACTGATA 
AAGGAGGAGG 
AGTGCTACAA 
CGCATAGGGA 
TTCTCTGGAA 
AAATTAGCCA 
CACACTGTGG 
CCACATATGA 
GAGGAGGAGA 
TCCAGTCCCG 
TTTTCCTCCG 
AATGCTTCCG 
GAGGGAAATG 
GGCAGAGAGA 
ACAACCAAGT 
GAAATGCCAC 
ACCCCATCCT 
ACCCAACCGG 
GAGGGGTTGG 
TTTATCTGTC 
GCACACTTTT 
ATCTTTCCAA 
GCAGATTTAC 
CAGGAAGTGC 
GACAACAAGC 
CTAGCACAGC 
GATGGCTACA 
GAAGATTTCT 
CTCGTGGAGA 
TACGGGAACT 
AATTTTACTC 
CGTGTGGTCA 
CTGCCAGTGC 
GTTGTCGTCC 
ATGTTGCAGC 
CGTTCACAAA 
CTGGTTGAGG 
GTTAATGCAC 
CTCCTGAGCC 
AGGGAAAAGA 
TCCCTGAGTG 
AGCAATGAAT 
ATGATATGGG 
GAAGATGAAT 
GTCACTCTTA 
GACTTTATCT 
CCTAAATGGC 
AAAGAAGAAG 
ACGGCAGGAA 
GTGGATGTTT 
ATTGAGCAGT 
GAGAATCCAT 
GAGAGCTTAG 
GTTTTCCTCT 
CCATCACCTG 
AATGTGTGCC 
ATTTT ACAGT 
AAATTTCAAT 
AAATTTTTAG 
GTAGCCTGTA 
ACCTAAAGTA 
AAATTTATAT 
TGTGTAATTG 
GACATTGTAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
B40 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
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TGTGTTACCT AAGTCATTAA CTTTGTTTCA GCATQTAATT TTAACTTTTG TGGAAAATAG 5220 

AAATACCTTC ATTTTGAAAG AAGTTTTTAT GAGAATAACA CCTTACCAAA CATTGTTCAA 5280 

ATGGTTTTTA TCCAAGGAAT TGCAAAAATA AATATAAATA TTGCCATTAA AAAAAAAAAA 5340 
AAAAAAAAAA AAAAAAAAAA AAA 

Seq ID NOi 186 Protein sequence: 
Protein Accession #: EOS sequence 

1 11 21 31 41 51 

I I I 1 I I 

MVPKASKITP HWGKCNMSSD GSEHSLEGQK FPLEMQIYCP DADRFSSFEE AVKGKGKLRA 60 

LSILFEVGTE ENLDFKAI ID GVESVSRFGK QAALDPFILL NLLPNSTDKY YIYNGSLTSP 120 

PCTDTVDWIV FKDTVSISES QLAVFCEVLT MQQSGYVMLM DYLQNNFREQ QYKPSRQVFS 180 

SYTGKEEIHE AVCSSEPENV QADPENYTSL LVTWERPRW YDTMI EKPAV LYQQLDGEDQ 240 

TKHEPLTDGY QDLGAILNNL LPNMSYVLQI VAICTNGLYG KYSDQLIVDM PTDNPELDLF 300 

PEIiIGTEEII KEEEEGKDIE EGAIVNPGRD SATNQIRKKE PQISTTTHYN RIGTKYNEAK 360 

TNRSPTRGSE FSGKGDVPNT SLNSTSQPVT KLATEKDISL TSQTVTELPP HTVEGTSASL 420 

NDGSKTVLRS PHMNLSGTAE SLNTVSITEY EEESLLTSFK LDTGAEDSSG SSPATSAIPP 480 

ISENISQGYI FSSENPETIT YDVLIPESAR NASEDSTSSG SEESLKDPSM EGNVWFPSST 540 

DITAQPDVGS GRESFLQTNY TEIRVDESEK TTKSPSAGPV MSQGPSVTDL EMPHYSTFAY 600 

FPTEVTPHAF TPSSRQQDLV STVNWYSQT TQPVYNEASN SSHESRIGLA EGLESEKKAV 660 

IPLVIVSALT FICLWLVGI IilYWRKCFQT AHFYLEDSTS PRVISTPPTP IFPISDDVGA 720 

IPIKHFPKHV ADLHASSGPT EEFETLKEFY QEVQSCTVDL GITADSSNHP DNKHKNRYIN 780 

IVAYDHSRVK LAQXiAEXDGK LTD Y INANYV DGYNRPKAYI AAQGPLKSTA EDFWRMIWEH 840 

NVEVIVMITN LVEKGRRKCD QYWPADGSEE YGNFLVTQKS VQVLAYYTVR NFTLRNTKIK 900 

KGSQKGRPSG RWTQYHYTQ WPDMGVPEYS LPVLTFVRKA AYAKRHAVGP VWHCSAGVG 960 

RTGTYIVLDS MLQQIQHEGT VNIFGFLKHI RSQRNYLVQT EEQYVFIHDT LVEAIIjSKET 1020 

EVLDSHIHAY VNAbLIPGPA GKTKLEKQFQ LLSQSNIQQS DYSAALKQCN REKNRTSSII 1080 

PVERSRVGIS SLSGEGTDYI NASYIMGYYQ SNEFIITQHP LLHTIKDFWR MIWDHNAQLV 1140 

VMIPDGQNMA EDEPVYWPNK DEPINCESFK VTliMAEEHKC LSNEBKLIIQ DFILEATQDD 1200 

YVLEVRHFQC PKWPNPDSPI SKTFELISVI KEEAANRDGP MIVHDBHGGV TAGTFCALTT 1260 

LMHQLEKENS VDVYQVAKMI NLMRPGVFAD lEQYQFTiYKV ILSLVSTRQE ENPSTSLDSN 1320 
GAALPDGNIA ESLESLV 

Seq ID NO: 187 DNA sequence 

Nucleic' Acid Accession ft: EOS sequence 

Coding sequence: 148-4632 

1 11 21 31 41 51 

I t I I I I 

CACACATACG CACGCACGAT CTCACTTCGA TCTATACACT GGAGGATTAA AACAAACAAA 60 

CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGGAGCCGCA 120 

CGGCGAGGGG CCGCAGACCG TCTGGAAATG CGAATCCTAA AACGTTTCCT CGCTTGCATT 180 

CAGCTCCTCT GTGTTTGCCG CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA 240 

CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAAAA TTGGGGAAAG 300 

AAATATCCAA CATGTAATAG CCCAAAACAA TCTCCTATCA ATATTGATGA AGATCTTACA 360 

CAAGTAAATG TGAATCTTAA GAAACTTAAA TTTCAGGGTT GGGATAAAAC AT CATTGGAA 420 

AACACATTCA TTCATAACAC TGGGAAAACA GTGGAAATTA ATCTCACTAA TGACTACCGT 480 

GTCAGCGGAG GAGTTTCAGA AATGGTGTTT AAAGCAAGCA AGATAACTTT TCACTGGGGA S40 

AAATGCAATA TGTCATCTGA TGGATCAGAG CATAGTTTAG AAGGACAAAA ATTTCCACTT 600 

GAGATGCAAA TCTACTGCTT TGATGCGGAC CGATTTTCAA GTTTTGAGGA AGCAGTCAAA 660 

GGAAAAGGGA AGTTAAGAGC TTTATCCATT TTGTTTGAGG TTGGGACAGA AGAAAATTTG 720 

GATTTCAAAG CGATTATTGA TGGAGTCGAA AGTGTTAGTC GTTTTGGGAA GCAGGCTGCT 780 

TTAGATCCAT TCATACTGTT GAACCTTCTG CCAAACTCAA CTGACAAGTA TTACATTTAC 840 

AATGGCT CAT TGACATCTCC TCCCTGCACA GACACAGTTG ACTGGATTGT TTTTAAAGAT 900 

ACAGTTAGCA TCTCTGAAAG CCAGTTGGCT GTTTTTTGTG AAGTTCTTAC AATGCAACAA 960 

TCTGGTTATG TCATGCTGAT GGACTACTTA CAAAACAATT TTCGAGAGCA ACAGT ACA AG 1020 

TTCTCTAGAC AGGTGTTTTC CTCATACACT GGAAAGGAAG AGATTCATGA AGCAGTTTGT 1080 

AGTTCAGAAC CAGAAAATGT TCAGGCTGAC CCAGAGAATT ATACCAGCCT TCTTGTTACA 1140 

TGGGAAAGAC CTCGAGTCGT TTATGATACC ATGATTGAGA AGTTTGCAGT TTTGTACCAG 1200 

CAGTTGGATG GAGAGGACCA AACCAAGCAT GAATTTTTGA CAGATGGCTA TCAAGACTTG 1260 

GGTGCTATTC TCAATAATTT GCTACCCAAT ATGAGT T ATG TTCTTCAGAT AGTAGCCATA 1320 

TGCACTAATG GCTTATATGG AAAATACAGC GACCAACTGA TTGTCGACAT GCCTACTGAT 1380 

AATCCTGAAC TTGATCTTTT CCCTGAATTA ATTGGAACTG AAGAAATAAT CAAGGAGGAG 1440 

GAAGAGGGAA AAGACATTGA AGAAGGCGCT ATTGTGAATC CTGGTAGAGA CAGTGCTACA 1500 

AACCAAATCA GGAAAAAGGA ACCCCAGATT TCTACCACAA CACACTACAA TCGCATAGGG 1560 

ACGAAATACA ATGAAGCCAA GACTAACCGA TCCCCAACAA GAGGAAGTGA ATTCTCTGGA 1620 

AAGGGTGATG TTCCCAATAC ATCTTTAAAT TCCACTTCCC AACCAGTCAC TAAATTAGCC 1680 

ACAGAAAAAG ATATTTCCTT GACTTCTCAG ACTGTGACTG AACTGCCACC TCACACTGTG 1740 

.GAAGGTACTT CAGCCTCTTT AAATGATGGC TCTAAAACTG TTCTTAGATC TCCACATATG 1800 

AACTTGTCGQ GGACTGCAGA ATCCTTAAAT ACAGTTTCTA TAACAGAATA TGAGGAGGAG I860 

AGTTTATTGA CCAGTTTCAA GCTTGATACT GGAGCTGAAG ATTCTTCAGG CTCCAGTCCC 1920 

GCAACTTCTG CTATCCCATT CATCTCTGAG AACATATCCC AAGGGTATAT ATTTTCCTCC 1980 

GAAAACCCAG AGACAATAAC ATATGATGTC CTTATACCAG AATCTGCTAG AAATGCTTCC 2040 

GAAGATTCAA CTTCATCAGG TTCAGAAGAA TCACTAAAGG ATCCTTCTAT GGAGGGAAAT 2100 

GTGTGGTTTC CTAGCTCTAC AGACATAACA GCACAGCCCG ATGTTGGATC AGGCAGAGAG 2160 

AGCTTTCTCC AGACTAATTA CACTGAGATA CGTGTTGATG AATCTGAGAA GACAACCAAG 2220 

TCCTTTTCTG CAGGCCCAGT GATGTCACAG GGTCCCTCAG TTACAG ATCT GGAAATGCCA 2280 

CATTATTCTA CCTTTGCCTA CTTCCCAACT GAGGTAACAC CTCATGCTTT TACCCCATCC 2340 

TCCAGACAAC AGGATTTGGT CTCCACGGTC AACGTGGTAT ACTCGCAGAC AACCCAACCG 2400 

GTATACAATG AGGCCAGTAA TAGTAGCCAT GAGTCTCGTA TTGGTCTAGC TGAGGGGTTG 2460 

GAATCCGAGA AGAAGGCAGT TATACCCCTT GTGATCGTGT CAGCCCTGAC TTTTATCTGT 2520 

CTAGTGGTTC TTGTGGGTAT TCTCATCTAC TGGAGGAAAT GCTTCCAGAC TGCACACTTT 2580 

TACTTAGAGG ACAGTACATC CCCTAGAGTT ATATCCACAC CTCCAACACC TATCTTTCCA 2640 

ATTTCAGATG ATGTCGGAGC AATTCCAATA AAGCACTTTC CAAAGCATGT TGCAGATTTA 2700 

CATGCAAGTA GTGGGTTTAC TGAAGAATTT GAGACACTGA AAGAGTTTTA CCAGGAAGTG 2760 

CAGAGCTGTA CTGTTGACTT AGGTATTACA GCAGACAGCT CCAACCACCC AGACAACAAG 2820 
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CACAAGAATC GATACATAAA TATCOTTGCC TATGATCATA GCAGGGTTAA GCTAGCACAG 2880 

CTTGCTGAAA AGGATGGCAA ACTGACTGAT TATATCAATG CCAATTATGT TGATGGCTAC 2 940 

AACAGACCAA AAGCTTATAT TGCTGCCCAA GGCCCACTGA AATCCACAGC TGAAGATTTC 3000 

TGGAGAATGA TATGGGAACA TAATGTGGAA GTTATTGTCA TGATAACAAA CCTCGTGGAG 3060 

AAAGGAAGGA GAAAATGTGA TCAGTACTGG CCTGCCGATG GGAGTGAGGA GTACGGGAAC 3120 

TTTCTGGTCA CTCAGAAGAG TGTGCAAGTG CTTGCCTATT ATACTGTGAG GAATTTTACT 3180 

CTAAGAAACA CAAAAATAAA AAAGGGCTCC CAGAAAGGAA GACCCAGTGG ACGTGTGGTC 3240 

ACACAGTATC ACTACACGCA GTGGCCTGAC ATGGGAGTAC CAGAGTACTC CCTGCCAGTG 3300 

CTGACCTTTG TGAGAAAGGC AGCCTATGCC AAGCGCCATG CAGTGGGGCC TGTTGTCGTC 3360 

CACTGCAGTG CTGGAGTTGG AAGAACAGGC ACATATATTG TGCTAGACAG TATGTTGCAG 3420 

CAGATTCAAC ACGAAGGAAC TGTCAACATA TTTGGCTTCT TAAAACACAT CCGTTCACAA 3480 

AGAAATTATT TGGTACAAAC TGAGGAGCAA TATGTCTTCA TTCATGATAC ACTGGTTGAG 3540 

GCCATACTTA GTAAAGAAAC TGAGGTGCTG GACAGTCATA TTCATGCCTA TGTTAATGCA 3600 

CTCCTCATTC CTGGACCAGC AGGCAAAACA AAGCTAGAGA AACAATTCCA GGGTCTCACT 3660 

CTGTCACCCA GGCTGGAGTG CAGAGGCACA ATCTCGGCTC ACTGCAACCT TCCTCTCCCT 3720 

GGCTTAACTG ATCCTCCTAC CTCAGCCTCC CGAGTGGCTG GGACTATACT CCTGAGCCAG 3780 

TCAAATATAC AGCAGAGTGA CTATTCTGCA GCCCTAAAGC AATGCAACAG GGAAAAGAAT 3840 

CGAACTTCTT CTATCATCCC TGTG GAAAG A TCAAGGGTTG GCATTTCATC CCTGAGTGGA 3900 

GAAGGCACAG ACTACATCAA TGCCTCCTAT ATCATGGGCT ATTACCAGAG CAATGAATTC 3960 

ATCATTACCC AGCACCCTCT CCTTCATACC ATCAAGGATT TCTGGAGGAT GATATGGGAC 4020 

CATAATGCCC AACTGGTGGT TATGATTCCT GATGGCCAAA ACATGGCAGA AGATGAATTT 4080 

GTTTACTGGC CAAATAAAGA TGAGCCTATA AATTGTGAGA GCTTTAAGGT CACTCTTATG 4140 

GCTGAAGAAC ACAAATGTCT ATCTAATGAG GAAAAACTTA TAATTCAGGA CTTTATCTTA 4200 

GAAGCTACAC AGGATGATTA TGTACTTGAA GTGAGGCACT TTCAGTGTCC TAAATGGCCA 4260 

AATCCAGATA GCCCCATTAG TAAAACTTTT GAACTTATAA GTGTTATAAA AGAAGAAGCT 4320 

GCCAATAGGG ATGGGCCTAT GATTGTTCAT GATGAGCATG GAGGAGTGAC GGCAGGAACT 4380 

TTCTGTGCTC TGACAACCCT TATGCACCAA CTAGAAAAAG AAAATTCCGT GGATGTTTAC 4440 

CAGGTAGCCA AGATGATCAA TCTGATGAGG CCAGGAGTCT TTGCTGACAT TGAGCAGTAT 4500 

CAGTTTCTCT ACAAAGTGAT CCTCAGCCTT GTGGGCACAA GGCAGGAAGA GAATCCATCC 4560 

ACCTCTCTGG ACAGTAATGG TGCAGCATTG CCTGATGGAA ATATAGCTGA GAGCTTAGAG 4620 

TCTTTAGTTT AACACAGAAA GGGGTGGGGG GACT CACATC TGAGCATTGT TTTCCTCTTC 4680 

CTAAAATTAG GCAGGAAAAT CAGTCTAGTT CTGTTATCTG TTGATTTCCC ATCACCTGAC 4740 

AGTAACTTTC ATGACATAGG ATTCTGCCGC CAAATTTATA TCATTAACAA TGTGTGCCTT 4800 

TTTGCAAGAC TTGTAATTTA CTTATTATGT TTGAACTAAA ATGATTGAAT TTTACAGTAT 4860 

TTCTAAGAAT GGAATTGTGG TATTTTTTTC TGTATTGATT TTAACAGAAA ATTTCAATTT 4920 

ATAGAGGTTA GGAATTCCAA ACTACAGAAA ATGTTTGTTT TTAGTGTCAA ATTTTTAGCT 4980 

GTATTTGTAG CAATTATCAG GTTTGCTAGA AATATAACTT TTAATACAGT AGCCTGTAAA 5040 

TAAAACACTC TTCCATATGA T ATT CAACAT TTTACAACTG CAGTATTCAC CTAAAGTAGA 5100 

AATAATCTGT TACTTATTGT AAATACTGCC CTAGTGTCTC CATGGACCAA ATTTATATTT 5160 

ATAATTGTAG ATTTTTATAT TTTACTACTG AGTCAAGTTT TCTAGTTCTG TGTAATTGTT 5220 

TAGTTTAATG ACGTAGTTCA TTAGCTGGTC TTACTCTACC AGTTTTCTGA CATTGTATTG 5280 

TGTTACCTAA GTCATTAACT TTGTTTCAGC ATGTAATTTT AACTTTTGTG GAAAATAGAA 5340 

ATACCTTCAT TTTGAAAGAA GTTTTTATGA GAATAACACC TTACCAAACA TTGTTCAAAT 5400 

GGTTTTTATC CAAGGAATTG CAAAAATAAA TATAAATATT GCCATTAAAA AAAAAAAAAA 5460 
AAAAAAAAAA AAAAAAAAAA A 

Seq ID NO: 188 Protein sequence: 
Protein Accession #j EOS sequence 

1 ll 21 31 41 51 

| | I i I I 

MRILKRFLAC IQLLCVCRLD WANGYYRQQR KLVEEIGWSY TGALNQKNWG KKYPTCNSPK 60 

QSPINIDEDL TQVNVNIjKKL KPQGWDKTSL ENTPIHNTGK TVEINLTNDY RVSGGVSEMV 120 

FKASKTTFHW GKCNMSSDGS EHSLEGQKFP LEMQIYCPDA DRFSSFEEAV KGKGKLRALS 180 

ILFEVGTEEN LDPKAIIDGV ESVSRFGKQA ALDPPIIjLNL LPNSTDKYYI YNGSLTSPPC 240 

TDTVDWrVFK DTVSISESQIi AVPCEVLTMQ QSGYVMLMDY LQNNFREQQY KFSRQVFSSY 300 

TGKEEIHEAV CSSEPENVQA DPENYTSIiI»V TWERPRWYD TMIEKFAVIiY QQLDGEDQTK 360 

HEFLTDGYQD LGAILNNLLP NMSYVLQIVA ICTNGLYGKY SDQLIVDMPT DNPELDLFPE 420 

LIGTEEIIKB EEEGKDIEEG AIVNPGRDSA TNQIRKKEPQ ISTTTHYNRI GTKYNEAKTN 480 

RSPTRGSEFS GKGDVPNTSL NSTSQPVTKL ATEKDISLTS QTVTELPPHT VEGTSASLND 540 

GSKTVLRSPH MNLSGTAESL NTVSITEYEE ESLLTSFKLD TGAEDSSGSS PATSAIPFIS 600 

ENISQGYIFS SEN PET IT YD VLIPESARNA SEDSTSSGSE ESLKDPSMEG NVWFFSSTDI 660 

TAQPDVGSGR ESFLQTNYTE IRVDESEKTT KSFSAGPVMS QGPSVTDLEM PHYSTFAYFP 720 

TEVTPHAFTP SSRQQDLVST VNWYSQTTQ PVYNEASNSS HESRIGLAEG LESEKKAVIP 780 

LVIVSALTFI CliWLVGILI YWRKCFQTAH FYLBDSTSPR VISTPPTPIF PISDDVGAIP 840 

IKHFPKHVAD LHASSGFTEE FETLKEFYQE VQSCTVDLGI TADSSNHPDN KHKNRYINIV 900 

AYDHSRVKLA QLAEKDGKLT DYINANYVPG YNRPKAYIAA QGPLKSTAED FWRMIWBHNV 960 

EVIVMITNIiV EKGRRKCDQY WPADGSBEYG NFLVTQKSVQ VLAYYTVRNF TIiRNTKI KKG 1020 

SQKGRPSGRV VTQYHYTQWP DMGVPEYSLP VLTFVRKAAY AKRHAVGPW VHCSAGVGRT 1080 

GTYIVLDSML QQIQHEGTVN IFGFUKHIRS QRNYLVQTEE QYVFIHDTLV EAILSKBTEV 1140 

LDSHIHAYVN ALLIPGPAGK TKLEKQFQGL TLSPRLECRG TISAHCNLPL PGLTDPPTSA 1200 

SRVAGTILLS QSNIQQSDYS AALKQCNREK NRTSSIIPVE RSRVGISSLS GEGTDYINAS 1260 

YIMGYYOSNE FIITQHPLLH TIKDFWRMIW DHNAQLWMI PDGQNMAEDB FVYWPNKDEP 1320 

INCESFKVTL MAEEHKCLSN EEKLIIQDFI LEATQDDYVL EVRHFQCPKW PNPDSPISKT 1380 

FELISVIKEE AANRDGPMIV HDEHGGVTAG TFCALTTLMH QLEKENSVDV YQVAKMINLM 1440 
RPGVFADIEQ YQFLYKVILS LVGTRQEENP STSLDSNGAA LPDQNIAESL ESLV 



Seq ID NO: 189 DNA sequence 
Nucleic Acid Accession 8: NM_002820 
Coding sequence: 304.. 831 

1 11 21 31 41 51 

! I I I I I 

CCGGTTCGCA AAGAAGCTGA CTTCAGAGGG GGAAACTTTC TTCTTTTAGG AGGCGGTTAG 60 
CCCTGTTCCA CGAACCCAGG AGAACTGCTG GCCAGATTAA TTAGACATTG CTATGGGAGA 120 
CGTGTAAACA CACTACTTAT CATTGATGCA TATATAAAAC CATTTTATTT TCGCTATTAT 180 



263 
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TTCAGAGOAA GCGCCTCTGA 
GTTTGGAGAA AGCACAGTTG 
ACGATGCAGC GGAGACTGGT 
GTGCCCTCCT GCGGGCGCTC 
GAACATCAGC TCCTCCATGA 
CTTCACCATC TGATCGCAGA 
CCTAACTCCA AGCCCTCTCC 
GAGGGCAGAT ACCTAACTCA 
AAGACACCTG GGAAGAAAAA 
AAACGGCGAA CTCGCTCTGC 
GACCACCTGT CTGACACCTC 
CTGGCCCGTA GCCTCAGCGG 
GCTTGGACAA ACCTAGAATT 
CAGAGAATAA CTCAGAATAT 
TGTCCTCCAG CACCATAGAG 
CATCAATCCT TTACCACTCT 
ATCTTCATAA TTTGCTGGAG 
TTCTTCAGTG TTTTTCATTT 
GATATTATCT ACAAACACTG 
ACTTTTTATT TAATTAAATG 
TAAATTATGT TTTAAACACA 
CCAGCTCATA CAAAATAAAT 
GGTTTTTCTC ATGTATCTTT 
CCGTAGGAAA AATAAAACTT 



PCT/US02/12476 



TTTGTTTCTT 
GAGTAGCCGG 
TCAGCAGTGO 
GGTGGAGGGT 
CAAGGGGAAG 
AATCCACACA 
CAACACAAAG 
GGAAACTAAC 
GAAAGGCAAG 
CTGGTTAGAC 
CACAACGTCG 
GGTGCTCTCA 
TTCTCCCTTT 



AGGCGCTAGA 
ACCAAATAAT 
AAGTGTATTT 
CTTACGTTCT 
CAGAACAGCA 
TATTTAATTA 
TGCCTTAAAT 
GGTTTCTGAA 
TTGTTCATTG 
CACATTTAAA 



TTTTCCCTTT 
TTGCTAAATA 
AGCGTCGCGG 
CTCAQCCQCC 
TCCATCCAAG 
GCTGAAATCA 
AACCACCCCG 
AAGGTGGAGA 
CCCGGGAAAC 
TCTGGAGTGA 
CTGGAGCTCG 
GCTGGGTTTT 
ATGTATCTCT 
AAAGCAGTAC 
GCCCATTCCT 
TTCATATTCA 
CTTCCCCTTA 
TTCACTTCAA 
TCATGTCATA 
AATCTCAAAT 
TTGTTTAATT 
AATGTTTAAG 
GCAAGATGAA 



TTGCTCTTTC 
AGTCCCGAGC 
TGTTCCTGCT 
GCCTCAAAAG 
ATTTACGGCG 
GAGCTACCTC 
TCCQATTTGG 
CGTACAAAGA 
GCAAGGAGCA 
CTGGGAGTGG 
ATTCACGGTA 
GGAGCCTCCC 
ATOGATTGTG 
CCCCCTACCA 
CTTTCTCCAC 
AGCTTCAGAA 
CTCTCACACC 
GGGAGAATAT 
AACGATTCTG 
TTATTTTAAT 
AAATTTAACT 
TATTAACTTA 
ATAATTTTTC 



TGGCTGTGTG 
GCGAGCGGAG 
GAGCTACGCG 
AGCTGTGTCT 
ACGATTCTTC 
GGAGGTGTCC 
GTCTGATGAT 
GCAGCCGCTC 
GGAAAAGAAA 
GCTAGAAGGG 
ACAGGCTTCT 
TTCTGCCTTG 
TAGCAATTQA 
CACACACCCC 
CGTCACCCAA 
GCTAGTGACC 
TGGGCAAACT 
AGAAGCATTT 
AGCCATTCAC 
GTAAAGAACT 
CTGGTTTCTA 
CAAGGATATA 
TAGGGTAATG 



Seg ID NO i 190 Protein sequence: 
Protein Accession #t NP_002811 



1 11 21 31 41 51 

I I I I I I 

MQRRLVQQWS VAVFLLSYAV PSCGRSVEGL SRRLKRAVSE HQLLHDKGKS IQDLRRRFFL 
HHLIAEIHTA EIRATSEVSP NSKPSPNTKN HPVRFGSDDE GRYLTQETNK VSTYKEQPLK 
TPGKKKKQKP GKRKEQEKKK RRTRSAWLDS GVTGSGLEGD HLSDTSTTSL ELDSR 

Seq ID no i 191 DNA sequence 
Nucleic Acid Accession #: XM_0S9328 
Coding sequence: 52.. 1023 



1 

I 

GGGCTGTCCG 
CCTCGCATGC 
GGTATCGTGG 
GCGGCCACGG 
GCCAACCTGT 
GGCCCGGAAG 
GTGGATTTGC 
CTGGGCAGGG 
TGCCAGGTGT 
GAGCGCGGTG 
GTGGAGCGCG 
GACGCCTTCG 
GCCCTGGCGC 
GCGCACCCCG 
TTCTCTTGCT 
GCCCAGCTTG 
CCAGGGGAGG 
TGACCCCCTA 
TTAGTCCTGG 
GGACACTGCC 
AGCCTTCTTG 
TGGTGCCCCT 
CTATATTAAT 



11 
I 

GCCCACTCCC 
GCCTGGTGGT 
AGGCCTTTCT 
AGAGCGCGGC 
CCGAGGGCCG 
GCTTCTTCCT 
CTCAGGTGCG 
CCCCCACGCA 
TCGCCGAGGC 
TGGGTGGCTG 
ACGCCCGGGC 
TGGGCCTGAG 
GGGTCCTGGA 
GCTACCCCAG 
CTTGGGAGCG 
CCCAGGATGG 
AGGTCCCCTG 
CAGACAACCA 
CCCAGCCCAG 
ACCTCTGGGC 
GCTGCAGGCA 
CCATGTTGCA 
AAAATAACGT 



21 
I 

CTGGGAGCGC 
CACCGCGGAC 
GGCCGGGGCT 
GGAGCTGGCC 
CCCCGTGGGT 
TGGCAAGATG 
GGAGGAGCTC 
CGCGGACGGG 
GCTGCAGGCC 
CACTTGGCTG 
CGCCGTGGGC 
CACTTGCGGC 
AGGTACCCTA 
TGTGCCTCCC 
GCTGCATGAG 
CGTG CAG CTT 
TGAGCCCACT 
AGCACTAATC 
AGCTGGGACC 
TCAGGTCCTC 
GGCCTAGCCT 
ATGCAAACAC 
GTGTCTTTC 



31 
I 

GAGCGGTGGA 
GACTTTGGTT 
GTGACCAGCG 
CGCAGGCACA 
CCGGCCCGCC 
GGATTCCGGG 
GAGGCCCAAC 
CACCAGCACG 
TATGGGGTGC 
GAGGCCCCCG 
CCCTTCTCCC 
CGGCACATGT 
GCGGGCCACA 
ACCGGCGGCT 
CTGCGCGTCC 
TGCGCCCTCG 
CTGGAACCCT 
CCCTTAGTAC 
TGGAGCACGA 
ATGCCTCCAA 
GTGGCAGCGG 
CTTCACCACT 



41 

1 

CCCAGGCGGC 
ACTGCCCGCG 
TGTCCCTGCT 
GCATCCCCAC 
GTGGCGCCTC 
AGGCGGTGGC 
TAAGCTGCTT 
TGCACGTGCT 
GCTTTAOGCG 
CGCGTGCCTT 
GCCACGGCCT 
CCGCTCACCG 
CCCTGACAGC 
GCGGTGAAGG 
TCACCGCGCC 
ACGACCTGGA 
TCCTGGAACC 
CAAGAAAGGG 
TCTGTTGACT 
ATGGCATCTA 
GCTAGGGCCC 
GGGGCAGTGG 



51 
I 

CATGTCCCGC 
ACGCGATGAG 
GGTCAACGGT 
GGGCCTCCAC 
ATCGCTGCTC 
GGCCGGAGAC 
CCGGGAGCTG 
CCCAGGCGTG 
ACTGCCGCTG 
CGCCTGCGCC 
GCGGTGGACA 
CGTGTCCGGG 
CGAGCTGATG 
CCCCGACGCT 
CACGCTGCGG 
CTCCAAGAGG 
CTCCCTACTC 
GAGCCAGGAT 
TCCCTGGGTA 
GAGTTTGAGC 
GCAGAGCATT 
GGAGAGATGG 



Seq ID NOt 192 Protein sequence: 
Protein Accession #: xp 059328 



1 
I 

MSRPRMRLW 
GLHANLSEGR 
RELLGRAPTH 
ACAVERDARA 
ELMAHPGYPS 
SKRPGEEVPC 



11 
I 

TADDFGYCPR 
PVGPARRGAS 
ADGHQHVHVL 
AVGPPSRHGL 
VPPTGGCGEG 
EPTLEPFLEP 



21 

I 

RDEGIVEAFL 
SLLGPEGFFL 
PGVCQVFAEA 
RWTDAFVGLS 
PDAFSCSWER 
SLL 



31 
I 

AGAVTSVSLL 
GKMGFREAVA 
LQAYGVRFTR 
TCGRHMSAHR 
LHELRVLTAP 



41 51 

I I 
VNGAATESAA ELARRHSIPT 
AGDVDLPQVR EELEAQLSCF 
LPLERGVGGC TWLEAPARAP 
VSGAIiARVLE GTLAGHTI/TA 
TLRAQLAQDG VQLCALDDLD 



85 | 



Seq ID NOt 193 DNA sequence 

Nucleic Acid Accession #: NM_0056BB.l 

Coding sequence: 126.. 4439 

11 21 31 41 51 

I I I I 1 

CCGGGCAGGT GGCTCATGCT CGGGAGCGTG GTTGAGCGGC TGGCGCGGTT GTCCTGGAGC 
AGGGGCGCAG GAATTCTGAT GTGAAACTAA CAGTCTGTGA GCCCTGGAAC CTCCGCTCAG 



240 
300 
360 
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480 
540 
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720 
780 
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900 
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1020 
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1140 
1200 
1260 
1320. 
1380 
1440 
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1560 



60 
120 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 



60 
120 
180 
240 
300 



60 
120 



264 



WO 02/086443 

AGAAGATGAA GGATATCGAC ATAGGAAAAG AGTATATCAT CCCCAGTCCT QGGTATAGAA 180 

GTGTGAGGGA GAGAACCAGC ACTTCTGGGA CGCACAGAGA CCGTGAAGAT TCCAAGTTCA 240 

GGAGAACTCG ACCGTTGGAA TGCCAAGATG CCTTGGAAAC AGCAGCCCGA GCCGAGGGCC 300 

TCTCTCTTGA TGCCTCCATG CATTCTCAGC TCAGAATCCT GGATGAGGAG CATCCCAAGG 360 

GAAAGTACCA TCATGGCTTG AGTGCTCTGA AGCCCATCCG GACTACTTCC AAACACCAGC 420 

ACCCAGTGGA CAATGCTGGG CTTTTTTCCT GTATGACTTT TTCGTGGCTT TCTTCTCTGG 480 

CCCGTGTGGC CCACAAGAAG GGGGAGCTCT CAATGGAAGA CGTGTGGTCT CTGTCCAAGC 540 

ACGAGTCTTC TGACGTGAAC TGCAGAAGAC TAGAGAGACT GTGGCAAGAA GAGCTGAATG GOO 

AAGTTGGGCC AGACGCTGCT TCCCTGCGAA GGGTTGTGTG GATCTTCTGC CGCACCAGGC 660 

TCATCCTGTC CATCGTGTGC CTGATGATCA CGCAGCTGGC TGGCTTCAGT GGACCAGCCT 720 

TCATGGTGAA ACACCTCTTG GAGTATACCC AGGCAACAGA GTCTAACCTG CAGTACAGCT 780 

TGTTGTTAGT GCTGGGCCTC CTCCTGACGG AAATCGTGCG GTCTTGGTCG CTTGCACTGA 840 

CTTGGGCATT GAATTACCGA ACCGGTGTCC GCTTGCGGGG GGCCATCCTA ACCATGGCAT 900 

TTAAGAAGAT CCTTAAGTTA AAGAACATTA AAGAGAAATC CCTGGGTGAG CTCATCAACA 960 

TTTGCTCCAA CGATGGGCAG AGAATGTTTG AGGCAGCAGC CGTTGGCAGC CTGCTGGCTG 1020 

GAGGACCCGT TGTTGCCATC TTAGGCATGA TTTATAATGT AATTATTCTG GGACCAACAG 1080 

GCTTCCTGGG ATCAGCTGTT TTTATCCTCT TTTACCCAGC AATGATGTTT GCATCACGGC 1140 

TCACAGCATA TTTCAGGAGA AAATGCGTGG CCGCCACGGA TGAACGTGTC CAGAAGATGA 1200 

ATGAAQTTCT TACTTACATT AAATT T AT CA AAATGTATGC CTGGGTCAAA GCATTTTCTC 1260 

AGAGTGTTCA AAAAATCCGC GAGGAGGAGC GTCGGATATT GGAAAAAGCC GGGTACTTCC 1320 

AGGGTATCAC TGTGGGTGTG GCTCCCATTG TGGTGGTGAT TGCCAGCGTG GTGACCTTCT 1380 

CTGTTCATAT GACCCTGGGC TTCGATCTGA CAGCAGCACA GGCTTTCACA GTGGTGACAG 1440 

TCTTCAATTC CATGACTTTT GCTTTGAAAG TAACACCGTT TTCAGTAAAG TCCCTCTCAG 1500 

AAGCCTCAGT GGCTGTTGAC AGATTTAAGA GTTTGTTTCT AATGGAAGAG GTTCACATGA 1560 

TAAAGAACAA ACCAGCCAGT CCTCACATCA AGATAGAGAT GAAAAATGCC ACCTTGGCAT 1620 

GGGACTCCTC CCACTCCAGT ATCCAGAACT CGCCCAAGCT GACCCCCAAA ATGAAAAAAG 1680 

ACAAGAGGGC TTCCAGGGGC AAGAAAGAGA AGGTGAGGCA GCTGCAGCGC ACTGAGCATC 1740 

AGGCGGTGCT GGCAGAGCAG AAAGGCCACC TCCTCCTGGA CAGTGACGAG CGGCCCAGTC 1800 

CCGAAGAGGA AGAAGGCAAG CACATCCACC TGGGCCACCT GCGCTTACAG AGGACACTGC 1860 

ACAGCATCGA TCTGGAGATC CAAGAGGGTA AACTGGTTGG AATCTGCGGC AGTGTGGGAA 1920 

GTGGAAAAAC CTCTCTCATT TCAGCCATTT TAGGCCAGAT GACGCTTCTA GAGGGCAGCA 1980 

TTGCAATCAG TGGAACCTTC GCTTATGTGG CCCAGCAGGC CTGGATCCTC AATGCTACTC 2040 

TGAGAGACAA CATCCTGTTT GGGAAGGAAT ATGATGAAGA AAGATACAAC TCTGTGCTGA 210 0 

ACAGCTGCTG CCTGAGGCCT GACCTGGCCA TTCTTCCCAG CAGCOACCTG ACGGAGATTG 2160 

GAGAGCGAGG AGCCAACCTG AGCGGTGGGC AGCGCCAGAG GATCAGCCTT GCCCGGGCCT 2220 

TGTATAGTGA CAGGAGCATC TACATCCTGG ACGACCCCCT CAGTGCCTTA GATGCCCATG 2280 

TGGGCAACCA CATCTTCAAT AGTGCTATCC GGAAACATCT CAAGTCCAAG ACAGTTCTGT 2340 

TTGTTACCCA CCAGTTACAG TACCTGGTTG ACTGTGATGA AGTGATCTTC ATGAAAGAGG 2400 

GCTGTATTAC GGAAAGAGGC ACCCATGAGG AACTGATGAA TTTAAATGGT GACTATGCTA 2460 

CCATTTTTAA TAACCTGTTG CTGGGAGAGA CACCGCCAGT TGAGATCAAT TCAAAAAAGG 2520 

AAACCAGTGG TTCACAGAAG AAGTCACAAG ACAAGGGTCC TAAAACAGGA TCAGTAAAGA 2580 

AGGAAAAAGC AGTAAAGCCA GAGGAAGGGC AGCTTGTGCA GCTGGAAGAG AAAGGGCAGG 2640 

GTTCAGTGCC CTGGTCAGTA TATGGTGTCT ACATCCAGGC TGCTGGGGGC CCCTTGGCAT 2700 

TCCTGGTTAT TATGGCCCTT TTCATGCTGA ATGTAGGCAG CACCGCCTTC AGCACCTGGT 2760 

GGTTGAGTTA CTGGATCAAG CAAGGAAGCG GGAACACCAC TGTGACTCGA GGGAACGAGA 2820 

CCTGGGTGAG TGACAGCATG AAGGACAATC CTCATATGCA GTACTATGCC AGCATCTACG 2880 

CCCTCTCCAT GGCAGTCATG CTGATCCTGA AAGCCATTCG AGGAGTTGTC TTTGTCAAGG 2940 

GCACGCTGCG AGCTTCCTCC CGGCTGCATG AOGAGCTTTT CCGAAGGATC CTTCGAAGCC 3000 

CTATGAAGTT TTTTGACACG ACCCCCACAG GGAGGATTCT CAACAGGTTT TCCAAAGACA 3060 

TGGATGAAGT TGACGTGCGG CTGCCGTTCC AGGCCGAGAT GTTCATCCAG AACGTTATCC 3120 

TGGTGTTCTT CTGTGTGGGA ATGATCGCAG GAGTCTTCCC GTGGTTCCTT GTGGCAGTGG 3180 

GGCCCCTTGT CATCCTCTTT TCAGTCCTGC ACATTGTCTC CAGGGTCCTG ATTCGGGAGC 3240 

TGAAGCGTCT GGACAATATC ACGCAGTCAC CTTTCCTCTC CCACATCACG TCCAGCATAC 3300 

AGGGCCTTGC CACCATCCAC GCCTACAATA AAGGGCAGGA GTTTCTGCAC AGATACCAGG 3360 

AGCTGCTGGA TGACAACCAA GCTCCTTTTT TTTTGTTTAC GTGTGCGATG CGGTGGCTGG 3420 

CTGTGCGGCT GGACCTCATC AGCATCGCCC TCATCACCAC CACGGGGCTG ATGATCGTTC 34 B0 

TTATGCACGG GCAGATTCCC CCAGCCTATG CGGGTCTCGC CATCTCTTAT GCTGTCCAGT 3540 

TAACGGGGCT GTTCCAGTTT ACGGTCAGAC TGGCATCTGA GACAGAAGCT CGATTCACCT 3600 

CGGTGGAGAG GATCAATCAC TACATTAAGA CTCTGTCCTT GGAAGCACCT GCCAGAATTA 3660 

AGAACAAGGC TCCCTCCCCT GACTGGCCCC AGGAGGGAGA GGTGACCTTT GAGAACGCAG 3720 

AGATGAGGTA CCGAGAAAAC CTCCCTCTTG TCCTAAAGAA AGTATCCTTC ACGATCAAAC " 3780 

CTAAAGAGAA GATTGGCATT GTGGGGCGGA CAGGATCAGG GAAGTCCTCG CTGGGGATGG 3840 

CCCTCTTCCG TCTGGTGGAG TTATCTGGAG GCTGCATCAA GATTGATGGA GTGAGAATCA 3900 

GTGATATTGG CCTTGCCGAC CTCCGAAGCA AACTCTCTAT CATTCCTCAA GAGCCGGTGC 3960 

TGTTCAGTGG CACTGTCAGA TCAAATTTGG ACCCCTTCAA CCAGTACACT GAAGACCAGA 4020 

TTTGGGATGC CCTGGAGAGG ACACACATGA AAGAATGTAT TGCTCAGCTA CCTCTGAAAC 4080 

TTGAATCTGA AGTGATGGAG AATGGGGATA ACTTCTCAGT GGGGGAACGG CAGCTCTTGT 4140 

GCATAGCTAG AGCCCTGCTC CGCCACTGTA AGATTCTGAT TTTAGATGAA GCCACAGCTG 4200 

CCATGGACAC AGAGACAGAC TTATTGATTC AAGAGACCAT CCGAGAAGCA TTTGCAGACT 4260 

GTACCATGCT GACCATTGCC CATCGCCTGC ACACGGTTCT AGGCTCCGAT AGGATTATGG 4320 

TGCTGGCCCA GGGACAGGTG GTGGAGTTTG ACACCCCATC GGTCCTTCTG TCCAACGACA 4380 

GTTCCCGATT CTATGCCATG TTTGCTGCTG CAGAGAACAA GGTCGCTGTC AAGGGCTGAC 4440 

TCCTCCCTGT TGACGAAGTC TCTTTTCTTT AGAGCATTGC CATTCCCTGC CTGGGGCGGG 4500 

CCCCTCATCG CGTCCTCCTA CCGAAACCTT GCCTTTCTCG ATTTTATCTT TCGCACAGCA 4560 

GTTCCGGATT GGCTTGTGTG TTTCACTTTT AGGGAGAGTC ATATTTTGAT TATTGTATTT 4620 

ATT CCATATT CATGTAAACA AAATTTAGTT TTTGTTCTTA ATTGCACTCT AAAAGGTTCA 4680 

GGGAACCGTT ATTATAATTG TATCAGAGGC CTATAATGAA GCTTTATACG TGTAGCTATA 4740 

TCTATATATA ATTCTGTACA TAGCCTATAT TTACAGTGAA AATGTAAGCT GTTTATTTTA 4800 

TATTAAAATA AGCACTGTGC TAATAACAGT GCATATTCCT TTCTAT CATT TTTGTACAGT 4860 

TTGCTGTACT AGAGATCTGG TTTTGCTATT AGACTGTAGG AAGAGTAGCA TTTCATTCTT 4920 

CTCTAGCTGG TGGTTTCACG GTGCCAGGTT TTCTGGGTGT CCAAAGGAAG ACGTGTGGCA 4980 

ATAGTGGGCC CTCCGACAGC CCCCTCTGCC GCCTCCCCAC AGCCGCTCCA GGGGTGGCTG 5040 

GAGACGGGTG GGCGGCTGGA GACCATGCAG AGCGCCGTGA GTTCTCAGGG CTCCTGCCTT 5100 

CTGTCCTGGT GTCACTTACT GTTTCTGTCA GGAGAGCAGC GGGGCGAAGC CCAGGCCCCT 5160 

TTTCACTCCC TCCATCAAGA ATGGGGATCA CAGAGACATT CCTCCGAGCC GGGGAGTTTC 5220 

TTTCCTGCCT TCTTCTTTTT GCTGTTGTTT CTAAACAAGA ATCAGTCTAT CCACAGAGAG 5280 

TCCCACTGCC TCAGGTTCCT ATGGCTGGCC ACTGCACAGA GCTCTCCAGC TCCAAGACCT 5340 



265 



WO 02/086443 

GTTGGTTCCA AGCCCTGGAG 
ATTCCCACAC CTCCACAGTT 
CTCACCGCAG TCGTCGCACA 
CAGCTCTTGC TAATCAGTGT 
ACCTCAGGTT GCTGGTTGCT 
GGGGCTGGTA GCTCAGGTGG 
ATGTCGTGAC CAACTAGACA 
CAAAAATCTG AAAATGTGAA 
AAAAAAAAAA AAAAAAAA 



Seq ID NO » 194 Protein sequence: 
Protein Accession #i NP_005679.1 



CCAACTGCTG 
CAGTGGCAGG 
GTCTCTCTCT 
CTCACACTGG 
GTGTGGTTTG 
GCGTGGTCAC 
TTCTGTCX3CC 
TAAAATTATT 



CTTTTTGAGG 
GCTCAGGATT 
CTCTCTCCCC 
CGTAGAAGTT 
GTGTGTTCCC 
TGCTGTCATC 
TTAGCAT GTT 
TTGGATTTTG 



TGGCACTTTT 
TCGTGGGTCT 
TCAAAGTCTG 
TTTGTACTGT 
GCAAACCCCC 
AGTTGAATGG 
TGCTGAACAC 
TAAAAAAAAA 



TCATTTGCCT 
GTTTTCCTTT 
CAACTTTAAG 
AAAGAGACCT 
TTTGTGCTGT 
TCAGCGTTGC 
CTTGTGGAAG 
AAAAAAAAAA 



5400 
5460 
5520 
5580 
5640 
5700 
S760 
5820 



I 

MKDIDIGKEY 
LDASMHSQLR 
VAHKKGELSM 
LSIVCLMITQ 
ALNYRTGVRL 
PWAILGMIY 
VLTYIKFIKM 
HMTIiGPDLTA 
NKPASPHIKI 
VLAEQKGHUIi 
KTSLI SAILG 
CCLRPDLAII* 
NHIFNSAIRK 
FNNLLLGETP 
VPWSVYGVYI 
VSDSMKDNPH 
KFFDTTPTGR 
LVILFSVLHI 
tiDDNQAPPFL 
GLFQFTVRLA 
RYRENLPLVL 
IGLADI<RSKL 
SEVMENGDNF 
MLTIAHRliHT 



11 
I 

IIPSPGYRSV 
ILDEEHPKGK 
EDVW8LSKHE 
LAGFSGPAFM 
RGAILTMAFK 
NV1 ILGPTGF 
YAWVKAFSQS 
AQAPTWTVF 
EMKNATLAWD 
LDSDERPSPE 
QMTLLEGSIA 
PSSDLTEIGE 
HLKSKTVLFV 
PVEINSKKET 
QAAGGPLAFI* 
MQYYASIYAI* 
ILNRFSKDMD 
VSRVLIRELK 
FTCAMRWLAV 
SBTEARFTSV 
KKVSFTIKPK 
SIIPQEPVIiF 
SVGERQLLCI 
VLiGSDRIMVL 



21 

1 

RERTSTSGTH 
YHHGLSALKP 
SSDVNCRRLE 
VKHLLEYTQA 
KILKLKNIKE 
LGSAVFILFY 
VQKIREEERR 
NSMTFALKVT 
SSHSSIQNSP 
EEEGKHIHLG 
ISGTFAYVAQ 
RGANLSGGQR 
THQLQYLVDC 
SGSQKKSQDK 
VIMALFMLNV 
SMAVMLILKA 
EVDVRLPFQA 
RLDNITQSPF 
RLDLISIALZ 
ERINHYIKTL 
EKIGIVGRTG 
SGTVRSNLDP 
ARALLRHCKI 
AQGQWEFDT 



31 
1 

RDREDSKFRR 
IRTTSKHQHP 
RLWQEELNEV 
TESNLQYSLL 
XSLGELINIC 
PAMMFASRbT 
I LEKAGYFQG 
PFSVKSLSEA 
KliTPKMKKDK 
HLRLQRTLH5 
QAWILNATLR 
QRISLARALY 
DEVIFMKEGC 
GPKTGSVKKE 
GSTAFSTWWL 
IRGWFVKGT 
EMFIQNVILV 
LSHITSSIQG 
TTTGLMIVLM 
SLEAPARIKN 
SGKSSLGMAL 
FNQYTBDQIW 
LILDEATAAM 
PSVliLSNDSS 



41 

I 

TRPLECQDAL 
VDNAGLFSCM 
GPDAASLRRV 
LVLGLLLTEI 
SNDGQRMFEA 
AYFRRKCVAA 
ITVGVAPIW 
SVAVDRFKSLt 
RASRGKKEKV 
IDLEIQEGKIi 
DNILFGKEYD 
SDRSIYILDD 
ITERGTHEEL 
KAVKPEEGQIj 
SYWIKQGSGN 
LRAS3RLHDE 
FFCVGMIAGV 
LATIHAYNKG 
HGQIPPAYAG 
KAPSPDWPQE 
FRLVEIiSGGC 
DALERTHMKB 
DTETDLLIQB 
RPYAMFAAAB 



51 
1 

ETAARAEGLS 
TFSWLS8LAR 
WJIFCRTRLI 
VRSWSLALTW 
AAVGSLLAGG 
TDERVQKMNE 
VIASWTFSV 
FLMEEVHMIK 
RQLQRTEHQA 
VGICGSVGSG 
EERYNSVLNS 
PLSALDAHVG 
MNLNGDYATI 



TTVTRGNETS 
LFRRILRSPM 
FPWFIjVAVGP 
QEFLHRYQEL 
LAISYAVQXjT 

gevtfenaem 
ikidgvrisd 
ciaqlplkle 
tireafadct 

NKVAVKG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



Seq ID NO: 195 DNA sequence 
Nucleic Acid Accession #: NMJJ06470 
Coding sequence: 228.. 1922 



1 
I 

GCTGTCCTGA 

CGCCAGCACA 

TTGCAGCAGC 

TGGGCCAAGG 

ATCTAATGGC 

CAGACTCTGG 

TGGGCTCCTC 

AGGGGGATOC 

GAAGAGTGAA 

TGCAGCCGCA 

ACCACAACTG 

ATCAGCAGTG 

TGGATGCAGC 

GGAAACTCAA 

TGGTGTCGGT 

CTGTGAGGAA 

TGAGCCAGGC 

GCAAGCAGGA 

ACTGCAAGTT 

ATAAACTCTC 

TGGAGAACTA 

CTCAAGTGTC 

GGGAACAGTT 

ATCICCGGCT 

ACCCGGACCT 

ACCTGCACAG 

CCTGCAAAGG 

TCTCCTGGAG 

CCCCACTCAA 

TCCTTTCCTT 

AATTTTCAGA 

TTGTAGATCT 

AGACTCCAGG 

GGTGATTTGT 

TCTGAATGAA 

GTTTGCAGTA 

AGAGCAGTGG 

CTGCCTCAGC 

TTTGTATTTT 



11 
I 

GCCTGAGTAC 
CAGTAATGAG 
TGCAATCATC 
GACAGAAGAA 
TCCAGGGCCA 
GTCACCCAGC 
GGAGAAGCTT 
TGCTGGTGAG 
GGCAGTGAAG 
TCAGGTGAAC 
GCGATACTGC 
CATCTGCCAG 
CCGCAGGGAC 
GTTGAATGAA 
GTCAGAGGTC 
GGCCCAGGCC 
CAACGGTATC 
GCTGGAGAGG 
TAAGAACACT 
GGGCATCCGC 
TAAGAAAAAG 
TGCCGTTGTT 
CCTCCAATAT 
GCAGGAGGAG 
CCCCAGCAGG 
GTACTATTTT 
CATCGACCGG 
CCTCCAATGG 
AGCTGGCCCT 
CTATGGCGTA 
ACCAGTCTAT 
GGGAGAGGAA 
AGCCATATCC 
GGGCAGAAAT 
AACATTCTCC 
ATTCTTTTTT 
CGCGATCTTG 
CTCCCGAGTA 
TAGTAGAGAT 



21 
I 

TCTAGCTGCC 

TGGCCGAGCT 

TAGGCGTGGT 

AGACAGCCTA 

CTGCCCAGGG 

CCAGATTCTG 

GGCAGGGAGA 

GGGAAAGAGG 

TCCTGTCTAA 

ATCAAACTGC 

CCTGCCCACC 

GACTGTTGCC 

AAGGAGGCTG 

AATGCCATCT 

AAAGCGGTGG 

AATGTGATGC 

AAGGCCCACC 

ATGGCGGCCA 

GAAGACATCA 

AAAGTTATCA 

CTCCAGGAGT 

CAGCGCAAAT 

GCGTATGACA 

AACCGCAAGG 

TTCCTGCACT 

GAGGTGGAGA 

AAAGGGGAGG 

AACGGGAAGG 

TTCCGGAGGC 

GAGTATGATA 

GCTGCCTTCT 

CCCGAGAAGC 

CAGACCTTTG 

AACTGCTGAT 

AGCTGCTCTC 

TT TTTTTTGA 

GCTCACTGCA 

GCTGGGATTA 

GGGGTTTCAC 



31 
1 

TTGTCGCCAT 
TCCTCTGGGA 
TCTCTTGTCT 
GGAGCAGAGC 
CCACTGCTCA 
GGTCAGCCAG 
CGGAGGAACA 
TCCTGTGTGA 
CCTGCATGGT 
AAAGCCACCT 
ACAGCCCACT 
AGGAGCACAG 
AACTCCAGTG 
CCAGGCTCCA 
CTGAAATGCA 
TCTTCTTAGA 
TGGAGTACAG 
TCAGCAACAC 
CCTTCCCTAG 
CGGAATCCAC 
TTTCCAAGGA 
ATTGGACTTC 
TCACGTTTGA 
TCACCAACAC 
GGCGGCAGGT 
TCTTCGGGGC 
AGCGCAACAG 
AGTTCACGGC 
TCGGGGTCTA 
CCATGACTCT 
GGCTTTCCAA 
CAGCACCGTC 
CCAGCTACAG 
GGTAGCTGGC 
TTTTGCTCCA 
GACGGAGTCT 
AGCTCCGCCT 
CAGGTGCCTG 
CATGTTGGCC 



41 

I 

CGCATCTGGC 
GGGAGGAAAC 
GACTTGGGCT 
CTCCCAGATG 
GCCCCCAGCC 
CCCAGTGGAA 
GGACAGCX3AC 
CTTCTGCCTT 
GAATTACTGT 
GCTGACCGAG 
GTCTGCTTTC 
TGGCCACACC 
CACCCAGTTA 
GGCTAACCAA 
GTTTGGGGAA 
GGAGAAGGAG 
GAGTGCCGAG 
TGTCCAGTTC 
TGTTTACGTA 
TGTACACTTA 
AGAGGAGTAT 
CAAACCTGAG 
CCOGGACACA 
CACGCCCTGG 
GCTGTCCCAG 
AGGCACCTAT 
TTGCATTTCC 
CTGGTACAGT 
TATCGACTTC 
GGTTCACAAG 
GAAGGAAAAC 
CTTGGGGGTG 
TGATGGGATT 
TTTTGAAATC 
TATGGTGCTG 
CGCACTGTTG 
CCCGAGTTCA 
CCACCACACC 
AGGCAGATCT 



51 
I 

TGCCATCCAG 
AGTTAAAATC 
GCACAGATCC 
GCTGAGTTGG 
CCTCTCAGCC 
GAAGAGGACG 
TCTGCAGAGC 
GATGACACCA 
GAAGAGCACT 
CCAGTGAAGG 
TGCTGCCCTG 
ATAGTCTCCC 
GACTTGGAGC 
AAGTCTGTTC 
CTCCTTGCTG 
CAAGCTGCGC 
ATGGAGAAGA 
TTGGAGGAGT 
GGGCTGAAGG 
ATCCAGTTGC 
GACATCAGAA 
CCCAGCACCA 
GCACACAAGT 
GAGCATCCCT 
CAGAGTCTGT 
GTTGGCCTGA 
GGAAACAACT 
GACATGGAGA 
CCGGGAGGGA 
TTTGCCTGCA 
GCCATCCGGA 
ACTGCTCCCT 
TGCATTTTAG 
CTATGGGGTC 
TTCTCTATGT 
CCCAGGCTGG 
AGCAATTCTC 
CAGCTAATGT 
CAAACTCCTG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
90 0 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 



266 



5 

10 
15 

20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

ACCTCGTGAT GCACCCACCT CGGCCTCCCA AAGTGCTGGG ATTACATGCG TGAGCCACTG 
CGCCCTGCCT GTTTGTAGTA ATTTTTAGGC ACCAAATCTC CCTCATCTTC TAGTGCCATT 
CTCCTCTCTG TTCAGGTAAA TGTCACACTG TGCCCAGAAT GGATGACCAG GAACCTTAAA 
GAGTGGCTGA AAAGATTGCA GAGTTATCAT AATAAATTGC TAACTTGCGT 



PCT/US02/12476 



2400 
2460 
2520 



Seq ID NOt 196 Protein sequence* 
Protein Accession #: NP_00646l 



MAELDLMAPG 
DSAEQGDPAG 
EPVKDHNWRY 
LDLERKLKLN 
EQAALSQANG 
VGLKDKLSGI 
EPSTREQPLQ 
QQSLYLHRYY 
SDMETPLKAG 
NAIRIVDLGE 



11 
I 

PLPRATAQPP 
EGKEVLCDFC 
CPAHHSPLSA 
ENAISRLQAN 
I KAHLE YRS A 
RKVITESTVH 
YAYDITPDPD 
PEVEIPGAGT 
PPRRLGVYID 
EPEKPAPSLG 



21 
I 

APLSPDSGSP 
LDDTRRVKAV 
PCCPDQQCIC 
QKSVLVSVSE 
BMEKSKQELE 
LIQLLENYKK 
TAHKYLRLQE 
YVGLTCKGID 
FPGGILSFYG 
VTAP 



31 
I 

SPDSGSASPV 
KSCLTCMVNY 
QDCCQEHSGH 
VKAVAEMQPG 
RMAAISNTVQ 



41 



ENRKVTNTTP 
RKGEERNSCI 
VEYDTMTLVH 



Seq ID NO: 197 DNA sequence 
Nucleic Acid Accession #j NM_004316 
Coding sequence: 433-1149 



1 
I 

CCCGAGACCC 
GCGCGTTCAG 
GCGCCAGCGG 
GGAGGAGGGG 
TTGCTCCCAC 
GCTCCCGCTT 
GTCCCCCTCG 
CCGCCGCTGC 
• CAGCCGCAGC 
GCCGCGGCGG 
CAGCAGCAGC 
TCAGGGGGCG 
GAACTGATGC 
CAGCAGCCGG 
AACCTGGGCT 
AGTAAGGTGG 
GACGAGCATG 
CCCAACTACT 
GACGAGGGCT 
TGGTTCTGAG 
ATCGCACAAC 
AAAAGAAAAA 
CCAACCCCAT 
AGCGCTCAGA 
ACCTGAGTCA 
GAG CAGCACA 
GCTCGGGTCC 
GAGTTGG TGT 



11 
I 

GGCGCAAGAG 
CACTGACTTT 
CAGCCTCACA 
AGGGAGGAGG 
TCTAAGAAGT 
CATATTTCCT 
CGGGCCCCGC 
GCATGGAAAG 
CCCAGCAGCC 
CCGCAGCCGC 
AGCAGCAGCA 
GTCACAAGTC 
GCTGCAAACG 
CCGCCGTGGC 
TTGCCACCCT 
AGACACTGCG 
ACGCGGTGAG 
CCAACGACTT 
CTTACGACCC 
GGGCTCGGCC 
CTGCATCTTT 
AAAAGAAGAA 
CGCCAACTAA 
ACAGTATCTT 
ATGCGCAAAA 
CGCGTTATAG 
CTTCACCTCC 
CTTTC 



21 
I 

AGCGCAGCCT 
TGCTGCTGCT 
CGCGAGCGCC 
AGGCGGCGTG 
CTCCCGGGGA 
TTTCTTTCCC 
ACCTCGCGTC 
CTCTGCCAAG 
CTTCCTGCCG 
CGCAGCGGCA 
GCAGG CGCCG 
AGCGCCCAAG 
CCGGCTCAAC 
GCGCCGCAAC 
TCGGGAGCAC 
CTCGGCGGTC 
CGCCGCCTTC 
GAACT CCATG 
GCTCAGCCCC 
TGGTCAGGCC 
AGTGCTTTCT 
GAAGAAGAAA 
GCGAGGCATG 
TGCACTCCAA 
TGCAGCTTGT 
TAACTCCCAT 
CCGCCCTTTC 



31 

I 

TAGTAGGAGA 
TCTGCTTTTT 
ACGCGAGGCT 
CAGGGAGGAG 
TTTTGTATAT 
TCTCTGTTCC 
CCGGATCGCT 
ATGGAGAGCG 
CCCGCAGCCT 
GCGCAGAGCG 
CAGCTGAGAC 
CAAGTCAAGC 
TTCAGCGGCT 
GAGCGCGAGC 
GTCCCCAACG 
GAGTACATCC 
CAGGCAGGCG 
GCCGGCTCGC 
GAGGAGCAGG 
CTGGTGCGAA 
TGTCAGTGGC 
AGAGAAGAAG 
CCTGAGAGAC 
TCATTCACGG 
GTGCAAAAGC 
CACCTCTAAC 
TTAGAGTGCA 



CEEHIiQPHQV 
TIVSLDAARR 
ELLAAVRKAQ 
FLEEYCKFKN 
YDIRTQVSAV 
WEHPYPDLPS 
SGNNFSWSLQ 
KFACKFSEPV 



41 
I 

GGAACGCGAG 
TTTTTCTTAG 
CCCGAAGCCA 
AAAAAGCATT 
ATTTTTTAAC 
TGCACCCAAG 
CTGATTCCGC 
GOGGCGCCGG 
GTTTCTTTGC 
CGCAGCAGCA 
CGGCGGCCGA 
GACAGCGCTC 
TTGGCTACAG 
GCAACCGCGT 
GCGCGGCCAA 
GCGCGCTGCA 
TCCTGTCGCC 
CGGTCTCATC 
AGCTTCTCGA 
TGGACTTTGG 
GTTGGGAGGG 



ATGGCTTTCA 
AGATATGAAG 
AGTGGGCTCC 
ACGCACAGCT 
GTTCTTAGCC 



51 
I 

LGRETEEQDS 
NIKLQSHLLT 
DKEAELQCTQ 
ANVMLFLEEK 
TEDITFPSVY 
VQRKYWTSKP 
RFLHWRQVIiS 
WNGKEFTAWY 
YAAFWLSKKE 



51 
1 

ACGCGGCAGA 
AAACAAGAAG 
ACCCGCGAAG 
TTCACCTTTT 
TTCCGTCAGG 
TTCTCTCTGT 
GACTCCTTGG 
CCAGCAGCCC 
CACGGCCGCA 
GCAGCAGCAG 
CGGCCAGCCC 
GTCTTCGCCC 
CCTGCCGCAG 
CAAGTTGGTC 
CAAGAAGATG 
GCAGCTGCTG 
CACCATCTCC 
CTACTCGTCG 
CTTCACCAAC 
AAGCAGGGTG 
GGAGAAAAGG 
AAACAGTCAA 
GAAAACGGGA 
AGCAACTGGG 
TGGCAGAAGG 
GAAAGTTCTT 
CTCTAGAAAC 



Seq ID NO i 198 Protein sequence: 
Protein Accession ft: np_ 004307 



11 



21 



31 



41 
I 



51 



•MESSAKMESG GAGQQPQPQP QQPFLPPAAC FPATAAAAAA AAAAAAAQSA QQQQQQQQQQ 
QQQQAPQLRP AADGQPSGGG HKSAPKQVKR QRSSSPELMR CKRRLNPSGF GYSLPQQQPA 
AVARRNERER NRVKLVNLGF ATLREHVPNG AANKKMSKVE TLRSAVEYIR ALQQLLDEHD 
AVSAAFQAGV LSPTISPNYS NDLNSMAGSP VSSYSSDEGS YDPLSPEEQE LLDPTNWF 

Seq ID NO: 199 DNA sequence 
Nucleic Acid Accession th NM_007015 
Coding sequence: 1-1005 



ATGACAGAGA 
TGCAGCCCCC 
AAGGTGGGAG 
GCCTTCTACT 
ATCAATGGGA 
TTTAAAATGG 
ACAGGAATTC 
ATTCCTGAGG 
ATGCCAGTCA 
GACAACAGCT 
CTTAAACCAA 
GTTCCAACTA 
CTGAATAATG 



11 

I 

ACTCCGACAA 
CGGCGTACGC 
CCGTGGTCCT 
TCTGGAAGGG 
AACTACAAGA 
GAAGTGGAGC 
GTTTTGCTGG 
TGGGCGCCGT 
AATATGAAGA 
TCTTGAGTTC 
CCTATCCAAA 
CCACAAAAAG 
AAACCAGACC 



21 
I 

AGTTCCCATT 
TACGCTGACG 
CATTTCGGGA 
GAGCGACAGT 
TGGGTCAATG 
TGAAGAAGCA 
AGGAGAGAAG 
GACCAAACAG 
AAATTCTCTT 
TAAGGTGTTA 
AGAAATCCAG 
ACCACACAGT 
CAGTGTTCAA 



31 
I 

GCCCTGGTGG 
GTGAAGCCCT 
GCTGTGCTGC 
CACATTTACA 
GAAATAGACG 
ATTGCAGTTA 
TGCTACATTA 
AGCATCTCCT 
ATCTGGGTGG 
GAACT CTGCG 
AGGGAAAGAA 
GGACCACGGA 
GAGGACTCAC 



41 
I 

GACCTGATGA 
CCAGCCCCGC 
TGCTCTTTGG 
ATGTCCATTA 
CTGGGAACAA 
ATGATTTCCA 
AAGCGCAAGT 
CCAAACTGGA 
CTGTAGATCA 
GTGACCTTCC 
GAGAAGTGGT 
GCAACCCAGG 
AAGCCTTCAA 



51 
I 

CGTGGAATTC 
GCGGCTGCTC 
GGCCATCGGG 
CACCATGAGT 
CTTGGAGACC 
GAATGGCATC 
GAAGGCTCGT 
AGGCAAGATC 
GCCTGTGAAG 
TATTTTCTGG 
AAGAAAAATT 
CGCTGGAAGA 
TCCTGATAAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 



60 
120 
180 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



267 



WO 02/086443 

CCTTATCATC AGCAGGAAGG GGAAAGCATG ACATTCGACC CTAGACTGGA TCACGAAGGA 840 

ATCTGTTGTA TAGAATGTAG GCGGAGCTAC ACCCACTGCC AGAAGATCTG TGAACCCCTG 900 

GGGGGCTATT ACCCATGGCC TTATAATTAT CAAGGCTGCC GTTCGGCCTG CAGAGTCATC 960 

ATG CCATGT A GCTGGTGGGT GGCCCGTATC TTGGGCATGG TGTGAAATCA CTTCATATAT 1020 

CACGTGCTGT AAAATAAGAA CTAGCTGAAG AGACAACCAA AGAAGCATTA AGGCAGGTTG 1080. 

ATGCTGATGG GACCATAAAA TATTTTTACA CGCAGCCTGA GCGGTTATTC TTGACACTCT 1140 

TAACAGAATT TTTTTAATCG TTTTCCAGAA CTTTAGTATA TGCAAATGCA CTGAAAGGGT 1200 

AGTTCAAGTC TAAAATGCCA TAACCCCGTT ATTTGTTATT TTTTATTTGC ATTGATTTGC 1260 

CATAAGTCTT CCCTTGCTTG CATCTTCCAA AGCTATTTCG AAATAAACAC GAAAATTTAC 1320 
AGTTTGCC 



Seq ID NO: 200 Protein sequences 
Protein Accession #: NP_00 8946 



l 

I 

MTENSDKVPI 
APYPWKGSDS 
TGIRFAGGEK 
DNSPLSSKVL 
LNNETRPSVQ 
GGYYPWPYNY 



11 
I 

ALVGPDDVEF 
HIYNVHYTMS 
CYIKAQVKAR 
ELCGDLPIFW 
EDSQAFNPDN 
QGCRSACRVI 



21 31 41 51 

| I I I 

CSPPAYATLT VKPSSPARLL KVGAWLISG AVLLLFGAIG 
INGKLQDGSM EIDAGNNLET FKMGSGAEEA IAVNDFQNGI 
IPEVGAVTKQ SISSKLEGKI MPVKYEENSL IWVAVDQPVK 
LKPTYPKEIQ RERREWRKI VPTTTKRPHS GPRSNPGAGR 
PYHQQEGESM TFDPRLDHEG ICCIECRRSY THCQKtCEPL 
MPCSWWVARI LGMV 



60 
120 
180 
240 
300 



Seq ID NO: 201 DMA sequence 

Nucleic Acid Accession tt: NMJ>00728.2 

Coding sequence: 112.. 495 



GTAATAAGAG 
GTCGACCGGC 
CGGAAGTTCT 
CAGGCGGCGC 
GAGGACGCGC 
GAGCTGAAGC 
AACACTG CCA 
GTGAAGAGCA 
GACCTTCAAG 
CATATCCTTA 
AAGGAGGCAC 
TGGAAGAAGA 
GAGAATAATT 
GGAAACTAAT 
GGTTATTTGG 
ACTGTACCAC 
GTATGTAGCA 
AGCATCTATT 
AAATTTTTGT 
TGGATGCAAG 
TTGCTTTTTC 
TCCAATTCAT 
TTGCCTAACT 
TATAGTTTTA 
TGAGAGGTGT 
TTTGTTAAAA 
CTGATCATAT 
ACCATTCTTT 
ACCAGATAAT 
CTAAAATATT 
TTCTGATGAG 
CATATTAATA 
GTTTTCTCTG 
ATATCTTGTT 
ATTTTTGTTT 
AAAAAAAAAA 



11 

I 

CGGGGTCTCC 
CGCTCGCGCT 
CCCCCTTCCT 
CATTCAGGTC 
GCCTCCTGCT 
AGGAGCAGGA 
CCTGTGTGAC 
ACTTCGTGCC 
CCTGAGCAGA 
TAAGAGATTC 
AAGCCAAGGA 
GCAGCCCTGC 
TCTGTTGTTT 
ACAATACATT 
AAAGTGTGTA 
TTCGCCTTCT 
GTATCTCATT 
TTACCATATG 
TGGCTTGCTT 
AT TGTTTTCA 
ATTTTCTTAG 

AAGGTCCCAA 
TATTTTATAT 
AGGTTGAAAT 
AGACTGTTAT 
TTGTGTGGGT 
TGCCAATGTC 
GTGGGTCTAC 
TTCTACATCT 
ATTTTTAATG 
ATATTAAGTC 
TTTTTTTTTT 
AGATTTTTAA 
TTAATTGTTC 
AAAAAAAAAA 



21 
I 

GCGGGGAAGG 
GCCCTGAAAC 
GGCTCTCAGT 
TGCCCTGGAG 
GG CTGCACTG 
GACACAGGGC 
TCATCGGCTG 
CACCAATGTG 
TGAATGACTC 
ACTCAGAAGA 
AGTCTGTGTC 
TGACACCTAG 
TAAGCCACAA 
TTCATTTATT 
TTTAACTCTG 
TGCCAGCCAC 
GCTGTTTTAA 
TTTATCACCT 
GCTTTATTAG 
GATATATAGT 
CAGTGTCTCT 
CTTTTATGTA 
GGTCACAATA 
GTAGATTAGT 
TCATACCTGT 
TTCACCATTT 
ATATTTCTGG 
ATACTGCCTT 
CAACATTGTT 
TTTATACATT 
GGATTGTGTT 
GTTCAATTCA 
TTTAACAGTG 
CTATTTTATT 
ATTGCTAGTA 
AAAAAAAAAA 



31 
I 

CGCCCACAGC 
TCTAGTCGCC 
ATCTTGGTCC 
AGCAGCCCAG 
GTGCAGGACT 
TCCAGCTCCG 
GCAGGCTTGC 
GGTTCCAAAG 
CAGGAAGAAG 
CACATGTGGA 
TACCAGAAGC 
AGTTTGGACT 
AGTTTGTGGT 
TTGGGTAAAT 
TAAGAAACTG 
ATATGAGAGC 
TTTGTATTTC 
TTATTGAAGG 
TGTTGAGTTT 
TTGGAAACTT 
CACAGAGAAA 
TTGTGCTTTT 
ACCTTATTCT 
GATCTATTTT 
GAATATAGAT 
AATTGCCCCT 
GTTCTCAATT 
GATTAGTGTA 
CATTCTTGTT 
TTAGAATCAG 
AAATCAGTGG 
TGAACACAAT 
TTCTCAGTTT 
TTTTGGTGCT 
GATAGAAATA 



41 

) 

AGGTGTGGTG 
AGAGAGGCGG 
TGTACCAGGC 
ACCCGGCCAC 
ATGTGCAGAT 
CTGCCCAGAA 
TGAGCAGATC 
CCTTTGGCAG 
GTGTGTCCTA 
GAAGGTGACA 
CAGAATCACA 
TCCAGCTTCC 
AATTTGTTAT 
GCCTTGGAGT 
CCAAACTATT 
TCTAGTATTT 
CCCAATGACT 
GTCTGTTTAA 
TTAGAGCTCT 
CCTTCCCCTG 
AAGTTGTAAT 
AGTTCATGTC 
ATACTTTCTT 
GAGTTAATTT 
ACCCAATTGT 
GCACCTTTGT 
CTGTCTCATT 
GTGTTAAAGT 
CAAAAAGATT 
TGTGTTACTA 
GTTAATTTTG 
ACATGTTTTC 
TCAACAGAAA 
AATGTAAATG 
CAATATTTAA 



51 
I 

TTCATCCCGG 
CATGGGTTTC 
GGGCAGCCTC 
ACT CAG T AAA 
GAAGGCCAGT 
GAGAGCCTGC 
AGGGGGCATG 
GCGCCGCAGG 
AATCCAATGA 
TGACAGAGGC 
GAACAGTCTC 
AGAACTGTGA 
GACAGCCCTA 
GGGATTGCTG 
TTCTGAAGTG 
CCACAAATAG 
AATGACGTTG 
ATCTTCTGCT 
TTATATGTTG 
AATCTGCGGA 
TTGAATAAGA 
TAAGAACTCT 
GTAAAAGTTT 
TTGTATAAGG 
TTCAGTGCCA 
CAAAAAGCAA 
GATTGATTTG 
GAATCTCAAA 
TTAGCTACAT 
TCTACAAAAT 
GGAGAATTAG 
ACTTATTTAG 
TATTCTACAC 
GTACTTAAAC 
AATATTAGGA 



Seq ID NO: 202 Protein sequence: 
Protein Accession #: NP_000719.1 
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51 



1 11 M 31 f 1 ( 

MGFRKFSPFL ALSILVLYQA GSLQAAPFRS ALESSPDPAT LSKEDARLLL AALVQDYVQM 
KASELKQEQE TQGSSSAAQK RACNTATCVT HRliAGLLSRS GGMVKSNFVP TNVGSKAFGR 
RRRDLQA 

Seq ID NO: 203 DNA sequence 
Nucleic Acid Accession «: NM_001741 
Coding sequence: 71.. 496 



60 
120 



1 11 21 

I I 
CTCTGGCTGG ACGCCGCCGC 
GAGAGGTGTC ATGGGCTTCC 
GTTGCAGGCA GGCAGCCTCC 
AGACCCGGCC ACGCTCAGTG 
CTATGTGCAG ATGAAGGCCA 



I 

CGCCGCTGCC 
AAAAGTTCTC 
ATGCAGCACC 
AGGACGAAGC 
GTGAGCTGGA 



31 

! 

ACCGCCTCTG 
CCCCTTCCTG 
ATTCAGGTCT 
GCGCCTCCTG 
GCAGGAGCAA 



41 51 

1 I 
ATCCAAGCCA CCTCCCGCCA 
GCTCTCAGCA TCTTGGTCCT 
GCCCTGGAGA GCAGCCCAGC 
CTGGCTGCAC TGGTGCAGGA 
GAGAGAGAGG GCTCCAGCCT 



60 
120 
180 
240 
300 



268 



5 
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50 
55 
60 
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75 
80 
85 



TCTGAGTACT 
CCAAACTGCA 
GAGAGACCAT 
TTCCTAATTT 
TGGTGGCTCT 
AGATGGGAGG 
GAAAATGAGG 
TGAATGTGCT 



TGCATGCTGG 
ATTGGGGTTG 
CGCCCTCATG 

TTGGGCTGGT 
AAAGAGAGCA 
GCCGCTTTGA 
GATCATTTGA 



GCACATACAC 
GAGCACCTGG 
TTAGCATGCC 
ATCCTTCCTA 
ATTGGTGGCT 
GGACTCACAG 
GTCCCCCAGA 
GGAATAAAAT 



WO 02/086443 

GGACAGCCCC AGATCTAAGC GGTGCGGTAA 
GCAGGACTTC AACAAGTTTC ACACQTTCCC 
AAAGAAAAGG GATATGTCCA GCGACTTGGA 
CCAGAATGCC AACTAAACTC CTCCCTTTCC 
TAACTTGATG CATGTGGTTT GGTTCCTCTC 
TTCCTTGTGG CAGAGGATGT CTCAAACTTC 
GTTGGAAGAG AATCACCTGG GAAAATACCA 
GATGTCATCA GAGCTCCTCT GTCCTGCTTC 
TATTTTTCCC C 



Seq ID NO: 204 Protein sequence: 
Protein Accession #: NP_001732 

1 11 21 31 41 51 

I I I.I I I 

MGFQKFSPFL ALSILVLLQA GSLHAAPFRS ALESSPADPA TIiSEDEARLL LAALVQDYVQ 
MKASELEQEQ EREGSSLDSP RSKRCGNLST CMLGTYTQDP NKPHTFPQTA IGVGAPGKKR 
DMSSDLERDH RPHVSMPQNA N 

Seq ID NO: 205 DNA sequence 
NUcleic Acid Accession #: NM_00S361 
Coding sequence: 1-945 



PCT/US02/12476 



i 
I 

ATGCCTCTTG 
GAGGCCCTGG 
TCCTCTTCTA 
CCTCCCCACA 
AGACAATCCG 
CTGGAGTCCG 
CTCCTCAAGT 
AG AAAT TGCC 
GTCTTTGGCA 
TGCCTGGGCC 
CTCCTGATAA 
ATCTGGGAGG 
CATCCCAGGA 
GTGCCCGGCA 
ACCAGCTATG 
TACCCACCCC 



11 
I 

AGCAGAGGAG 
GCCTGGTGGG 
CTCTAGTGGA 
GTCCTCAGGG 
ATGAGGGCTC 
AGTTCCAAGC 
ATCGAGCCAG 
AGGACTTCTT 
TCGAGGTGGT 
TCTCCTACGA 
TCGTCCTGGC 
AGCTGAGTAT 
AGCTGCTCAT 
GTGATCCTGC 
TGAAAGTCCT 
TGCATGAACG 



21 
I 

TCAGCACTGC 
TGCGCAGGCT 
AGTTACCCTG 

agcctccagc 
cagcaaccaa 
agcaatcagt 

GGAGCCGGTC 
TCCCGTGATC 
GGAAGTGGTC 
TGGCCTGCTG 
CATAATCGCA 
GTTGGAGGTG 
GCAAGATCTG 
ATGCTACGAG 
GCACCATACA 
GGCTTTGAGA 



31 
I 

AAGCCTGAAG 
CCTGCTACTG 
GGGGAGGTGC 
TTCTCGACTA 
GAAGAGGAGG 
AGGAAGATGG 
ACAAAGGCAG 
TTCAGCAAAG 
CCCATCAGCC 
GGCGACAATC 
ATAGAGGGCG 
TTTGAGGGGA 
6TGCAGGAAA 
TTCCTGTGGG 
CTAAAGATCG 
GAGGGAGAAG 



41 
I 

AAGGCCTTGA 
AGGAGCAGCA 
CTGCTGCCGA 
CCATCAACTA 
GGCCAAGAAT 
TTGAGTTGGT 
AAATGCTGGA 
CCTCCGAGTA 
ACTTGTACAT 
AGGTCATGCC 
ACTGTGCCCC 
GGGAGGACAG 
ACTACCTGGA 
GTCCAAGGGC 
GTGGAGAACC 
AGTGA 



Seq ID NO: 206 Protein sequence: 
Protein Accession ff: NP_005352 



l 
I 

MPLEQRSQHC 
PPHSPQGASS 
LLKYRAREPV 
CLGLSYDGLL 
HPRKLLMQDL 
YPPLHERALR 



11 21 

I I 
KPEEGLEARG EALGLVGAQA 
FSTTINYTLW RQSDEGSSNQ 
TKAEMLESVL RNCQDPFPVI 
GDNQVMPKTG LLIIVLAIIA 
VQENYIiEYRQ VPGSDPACYE 



31 



41 
I 



PATEEQQTAS SSSTLVEVTL 
EEEGPRMFPD LESEFQAAIS 
FSKASEYLQL VFGIEWEW 
IEGDCAPEEK IWEELSMLEV 
FLWGPRALIE TSYVKVLHHT 



Seq ID NO: 207 DNA sequence 
Nucleic Acid Accession #: NM_021115 
Coding sequence: 743-2893 



1 

I 

AAAGGAAGGG 
GGCACCGCCC 
CCCAAACTAA 

GCACCCTQAA 
GGGCGAGCTG 
ACCGCTGCTT 
TTCGCTCAAG 
CACTGTCCAA 
CACGGAGAAG 
AGAAGTGCCC 
GCAAATCTCC 
ACCCGGGGAG 
GGCCCTGATG 
GACCACTACC 
CTGCAGTGTG 
GCCCCTCAAC 
GGAGCTCCAG 
GGACGGCCCT 
CCGAAGCCCC 
GACCTTCCAO 
CTCTGGGGAT 
CCTGGGCTAT 
CTGGAGCAGC 
CATCGGCCGC 
CTGGACGATT 



11 

1 

AGGGAGGGAG 
TTAGGAGGGC 
CTGGTGTCTT 
CCTTACCTCC 
GAGAGAGTGG 
GTGCTGGATG 
CCAGAGGAGG 
CAGGTGAACT 
AGGGCAGGGT 
CCTGGCCCAC 
CTTTGGCTGG 
CCCTTCACTT 
CCTGGGCCTG 
GACAAAGGTG 
TCCACCATTA 
AGCTTCTCCA 
AACTTTCTGG 
GTGAAGAGTG 
ACCCTGACCG 
ACCAACACCA 
CTTCACTACC 
GTCACGGTGA 
GAGCTCCAGG 
CAGGAGCCCA 
GTCCTCTCCC 
GAAGCTCCAG 



21 

1 

AAAGGAGAAG 
CACCCTCAGA 
TTCTCCTCTT 
TGCCCTCAGG 
TAACAGCGCC 
GGACCGCACC 
CCCGCCCCAA 
CTGCCAGGAA 
CCCAGCCAGC 
CGGGGGACCC 
ACCGAAAGGA 
CGCAGCCCTA 
ACATGGCCCA 
AGAATGAGCT 
TCACCACCAC 
ATCCTGAGGG 
AGTGCACATA 
TGAACCTGTC 
TCCTGGCCAA 
TCTCCGTCTA 
AGGCCTTCAT 
TGGACCTGCA 
GCGCTAAGAT 
TCTGCTCAGC 
CAAGTTACCC 
AGGGCCAGAA 



31 

] 

TTGGTTTAGA 
GTCTGACAGC 
CCAAGATGCT 
AGCCCCGGAG 
CCCCAGTTCC 
CTCTGCACAT 
GCACGCCTTG 
GCAGCTGAGG 
GTCCCAGGGC 
GGACCCCATC 
GAGTGCGGTC 
TGTGGCCCAC 
GGAGGCCCCC 
GACTGGGTCA 
GGTCATCACC 
GTACATTGAC 
CAACGTGACA 
CGATGGGGAA 
CCAGACACTC 
CTTCCGGACC 
GCTGAGCTGC 
CTCAGGTGGG 
GCTGACATGC 
TCCTTGTGGA 
TGAAAACACA 
GCTGCACCTG 



41 

I 

GGCCAGCCGG 
AGGTGAAGGT 
CTTCCCGAGG 
AGAGGCAGTC 
TCACAGTCGG 
CACGACArCC 
CCCCCCAAGA 
CCCAAGGCCA 
CTAGATCTCC 
GTGGCCTCCG 
CCTACAACAC 
ACACTCCCCC 
CAGGAGGACA 
GCCTCAGAGG 
ACCGAGCAGG 
TCCAGCGACT 
GTCTACACTG 
CTGCTCTCCA 
CTGGTGGAGG 
TTCCAGGACG 
AACTTTCCCC 
GTGGCCCACT 
ATCAATGCCT 
GGGGCAGTGC 
AATGGGAGCC 
CACTTTGAGA 



51 

I 

GGCCCGAGGA 
GACCGCTTCT 
CTCACCGAGT 
CACTCTTTGG 
GTTTCCCGAC 
TCATTTTCTG 
GAGTGTCCTC 
CTTGCAGCTG 
CCTTGTCACC 
CAAGACAGGC 
TGAGGAGAAA 
TGTCTTCGCA 
GTACCGGCAG 
CCTCATTGAA 
TCACATTTCC 



51 
I 

GEVPAADSPS 
RKMVELVHFL 
PISHLYILVT 
FEGREDSVFA 
LKIGGEPHIS 



51 
I 

ACGAGCTTTG 
CCTAAATCTC 
GAGATGCTAG 
CTGGCAAAGA 
CGGAAGTGCT 
CAGCCCTGTC 
AGAAACTGCC 
CCTCCGCAGC 
TCTCCTCCTC 
AGGAGGCATC 
CCGCACCCCT 
AGAGGCCAGA 
CCAGCCCCAT 
AGAGCCAGGA 
CACCAGCTCT 
ACCCACTGCT 
GCTATGGGGT 
TCCGCGGGGT 
GGCAGGTAAT 
ACGGCCTTGG 
GCCGGCCTGA 
TTCACTGCCA 
CCAAGCCGCA 
ACAATGCCAC 
AATTCTGCAT 
GGCTGTTGCT 



360 
420 
480 
540 
600 
660 
720 
780 



60 
120 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
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GCATGACAAG GACAGGATGA COGTTCACAG 
CGACTCCCTT CAAACCGAGA GTGTCCCTTT 
CCGCATCGAG TTCACGTCCG ACCAGGCCCG 
AGCGTTTGAG AAAGGCCACT GCTATGAGCC 
CGACCCGACC TATAACATTG GGACTATAGT 
GGAGCAGGGC CCGGCCATCA TCGAATGCAT 
AGAGCCCCTG TGCAGAGCCA TGTGTGGTGG 
GTCCCCAAAC TGGCCCGAGC CCTACGTGGA 
GGGAGAAGAG AAACGGATCT TCTTAGATAT 
CTTGACCATC TACGATGGCG ACGAGGTCAT 
CAGTGGCCCC CAGAAACTGT ACTCCTCCAC 
. CCCTGCTGGC CTCATCTTTG GAAAGGGCCA 
AAGGAATGAC TCCTGCTCGG ATTTACCCGA 
CACGGAGTTG GTGCGGGGAG CCAGAATCAC 
GGGGAGTGAC ACCCTCACCT GCCAGTGGGA 
TGAGAAAATT ATGTACTGCA CCGACCCCGG 
GGATCCTGTG CTGCTGGTGG GGACCACCAT 
TGAAGGGAGT TCTCTTCTGA CCTGCTACAG 
TCGCCTGCCC CACTGCGTTT CAGAAGCGGC 
GGCCCTGGCT ATCTTCATCC CGGTCCTCAT 
TTACATCACA AGATGTCGCT ACTATTCCAA 
CTACAGCCAG ATCACCGTGG AAACCGAGTT 
CCAAAAGGTT TAGGGTTTCA TTTAAAAAGA 
AACCCCAATT TCCCCGAGAC ATTTATCCAA 
AAAGGCGGCT GTTTTTTGGT TAAACTTTTT 
TTTATAAATT TTAAAAGTG 



CGGGCAGACC AACAAGTCAG CTCTTCTCTA 1620 

TGAGGGCCTG CTGAGCGAAG GCAACACCAT 1680 

GGCGGCCTCC ACCTTCAACA TCCGATTTGA 1740 

CTACATCCAG AATGGGAACT TCACTACATC 1800 

GGAGTTCACC TGCGACCCCG GCCACTCCCT 1860 

CAATGTGCGG GACCCATACT GGAATGACAC 1920 

GGAGCTCTCT GCTGTGGCTG GGGTGGTATT 1980 

AGGTGAAGAT TGTATCTGGA AGATCCACGT 2040 

CCAGTTCCTG AATCTGAGCA ACAGTGACAT 2100 

GCCCCACATC TTGGGGCAGT ACCTTGGGAA 2160 

GCCAGACTTA ACCATCCAGT TCCATTCGGA 2220 

GGGATTTATC ATGAACTACA TAGAGGTATC 2280 

GAT CCAGAAT GGCTGGAAAA CCACTTCTCA 2340 

CTACCAGTGT GACCCCGGCT ATGACATCGT 2400 

CCTCAGCTGG AGCAGCGACC CCCCATTTTG 2460 

AGAGGTGGAT CACTCGACCC GCTTAATTTC 2520 

CCAATACACC TGCAACCCCG GTTTTGTGCT 2580 

CCGTGAAACA GGGACTCCCA TCTGGACGTC 2640 

AGCAGAGACG TCGCTGGAAG GGGGGAACAT 2700 

CATCTCCTTA CTGCTGGGAG GAG CCTACAT 2760 

CCTCCGCCTG CCTCTGATGT ACTCCCACCC 2820 

TGACAACCCC ATTTACGAGA CAGGGGGAAC 2880 

GGTACCCTTT AAAAAGGGGC TTGTGAACTC 2940 

AGGCCCTGGG GGCCTTGATT TAAACCCCCA 3000 

AACAAAGGGT TACGGGTTTT TTCCCCGGAT 3060 



Seg ID NOt 208 Protein sequence: 
Protein Accession #* NP_06693 8 

1 11 21 31 41 51 

I I I I I I 

MAQEAPQEDT SPMALMDKGE NELTGS AS EE SQETTTSTII TTTVITTEQA PAIiCSVSFSN 60 

PEGYIDSSDY PLLPLNNFLE CTYNVTVYTG YGVELQVKSV NLSDGELLSI RGVDGPTLTV 120 

LANQTLLVEG QVIRSPTNTI SVYFRTFQDD GLGTFQIiHYQ AFMLSCNFPR RPDSGDVTVM 180 

DLHSGGVAHP HCHLGYELQG AKMLTCINAS KPHWSSQBPI CSAPCGGAVH NATIGRVLSP 240 

SYPENTNGSQ FCIWTIEAPE GQKLHLHFER LLLKDKDRMT VHSGQTNKSA LLYDSLQTES 300 

VPFEGLLSEG NTIRIEFTSD QARAASTFNI RFEAFEKGHC YEPYIQNGNF TTSDPTYNIG 360 

TIVEFTCDPG HSLEQGPAI I ECINVRDPYW NDTEPLCRAM CGGELSAVAG WLSPNWPEP 420 

YVEGEDCIWK IHVGEEKRIF LDIQFTiNLSN SDILTIYDGD EVMPHILGQY LGNSGPQKLY 480 

SSTPDLTIQF HSDPAGI»IFG KGQGFIMNYI EVSRNDSCSD LPEIQNGWKT TSHTEIiVRGA 540 

RITYQCDPGY DIVGSDTLTC QWDLSWSSDP PFCEKIMYCT DPGEVDHSTR LISDPVLLVG 600 

TTIQYTCNPG FVLEGSSItLT CYSRETGTPI WTSRLPHCVS EAAAETSLEG GNMALAIFIP 660 
VIiIISLLLGG AYIYITRCRY YSNLRLPIiMY SHPYSQITVE TEFDNP I YET GGTQKV 



Seg ID NO i 209 DNA sequence 

Nucleic Acid Accession #» NM_001327.1 

Coding sequence: 89-631 

1 11 21 31 41 51 

I I I I I I 

AGCAGGGGGC G CTGTGTGT A CCGAGAATAC GAGAATACCT CGTGGGCCCT GACCTTCTCT 60 

CTGAGAGCCG GGCAGAGGCT CCGGAGCCAT GCAGGCCGAA GGCCGGGGCA CAGGGGGTTC 120 

GACGGGCGAT GCTGATGGCC CAGGAGGCCC TGGCATTCCT GATGGCCCAG GGGGCAATGC 180 

TGGCGGCCCA GGAGAGGCGG GTGCCACGGG CGGCAGAGGT CCCCGGGGCG CAGGGGCAGC 240 

AAGGGCCTCG GGGCCGGGAG GAGGCGCCCC GCGGGGTCCG CATGG CGGCG CGGCTTCAGG 300 

GCTGAATGGA TGCTGCAGAT GCGGGGCCAG GGGGCCGGAG AGCCGCCTGC TTGAGTTCTA 360 

CCTCGCCATG CCTTTCGCGA CACCCATGGA AGCAGAGCTG GCCCGCAGGA GCCTGGCCCA 420 

GGATGCCCCA CCGCTTCCCG TGCCAGGGGT GCTTCTGAAG GAGTTCACTG TGTCCGGCAA 480 

CATACTGACT ATCCGACTGA CTGCTGCAGA CCACCGCCAA CTGCAGCTCT CCATCAGCTC 540 

CTGTCTCCAG CAGCTTTCCC TGTTGATGTG GATCACGCAG TGCTTTCTGC CCGTGTTTTT 600 

GGCTCAGCCT CCCTCAGGGC AGAGGCGCTA AGCCCAGCCT GGCGCCCCTT CCTAGGTCAT 660 

GCCTCCTCCC CTAGGGAATG GTCCCAGCAC GAGTGGCCAG TTCATTGTGG GGGCCTGATT 720 
GTTTGTCGCT GGAGGAGGAC GGCTTACATG TTTGTTTCTG TAGAAAATAA AACTGAGCTA 



Seq ID NO s 210 Protein sequence: 
Protein Accession #: NP_001318.1 

1 11 21 31 41 51 

I I I I 1 I 

MQAEGRGTGG STGDADGPGG PGIPDGPGGN AGGPGEAGAT GGRGPRGAGA ARASGPGGGA 60 
PRGPKGGAAS GLNGCCRCGA RGPE S RLLE F YLAMP F ATPM EAELARRSIiA QDAPPt»PVPG 120 
VLLKEFTVSG NILTIRLTAA DHRQLQLSIS SCLQQLSIiLM WITQCFLPVF LAQPPSGQRR 



Seq ID NO i 211 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: S2-459 

1 11 21 31 41 51 

I I I I I I 

CCTCGTGGGC CCTGACCTTC TCTCTGAGAG CCGGGCAGAG GCTCCGGAGC CATGCAGGCC 60 
GAAGGCCAGG GCACAGGGGG TTCGACGGGC GATGCTGATG GCCCAGGAGG CCCTGGCATT 120 
CCTGATGGCC CAGGGGGCAA TGCTGGCGGC CCAGGAGAGG CGGGTGCCAC GGGCGGCAGA 180 
GGTCCCCGGG GCGCAGGGGC AGCAAGGGCC TCGGGGCCGA GAGGAGGCGC CCCGCGGGGT 240 
CCGCATGGCG GTGCCGCTTC TGCGCAGGAT GGAAGGTGCC CCTGCGGGGC CAGGAGGCCG 300 
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GACAGCCGCC TGCTTCAGTT CCGACTGACT GCTGCAGACC ACCGCCAACT GCAGCTCTCC 360 

ATCAGCTCCT GTCTCCAGCA GCTTTCCCTG TTGATGTGGA TCACGCAGTG CTTTCTGCCC 420 

GTGTTTTTGG CTCAGGCTCC CTCAGGGCAG AGGCGCTAAG CCCAGCCTGG CGCCCCTTCC 480 

TAGGTCATGC CTCCTCCCCT AGGGAATGGT CCCAGCACGA GTGGCCAGTT CATTGTGGGG 540 

GCCTGATTGT TTGTCGCTGQ AQGAGGACGG CTTACATGTT TGTTTCTGTA GAAAATAAAG 600 
CTGAGCTA 

Seq ID NO; 212 Protein sequence: 
Protein Accession #> Eos sequence 

1 11 21 31 41 51 

I I 1 I I I 

MQAEGQGTGG STGDADGPGG PGIPDGPGGN AGGPGBAGAT GGRGPRGAGA ARASQPRGGA 60 

PRGPHGGAAS AQDGRCPCGA RRPDSRLLQF RLTAADHRQL QLSISSCLQQ LSLLMWITQC 120 
PLPVFLAQAP SGQRR 

Seq ID NO: 213 DNA sequence 
Nucleic Acid Accession #: NMJ>00555 
Coding sequences 416.. 1498 

1 11 21 31 41 51 

I I I I I I 

CTTATTTTTT ATGAATGTCG GATAGCTGCA CCAGCTTGGT GGGGAAAGGG TTTGATGAAT 60 

AGCACAAAGA CACTGGCTGT TCCCTGGAGG CTGTCCCTTT AAAGGAGAAT CTTAGTTTAT 120 

TCTGGGGGGA GGGGATGCAC ACATTAGAGT AGGAAAGAGG GCTTGGAATA AAATGAAAAC 180 

ACTCCCCCTT CATAGTCATT GTACTGAAAT GCAAAGACTG CTTCCTAAGC TGGAGATGCT 240 

AACCTTGGGT AGCTCCTTCT GTTCTCTTCA AGGGGAATTT TGTCAGGCTA TGGATTCATT 300 

TACAACTGTT AGTCATGTGG GCATGTGTGA GGAAACAGAT GCCAGTTTTA ATGTATTTAG 360 

CCCGAAGTTC CAATTTGATA GGAGCCACTG TCAGTCTCTG AGGTTCCACC AAAATATGGA 420 

ACTTGATTTT GGACACTTTG ACGAAAGAGA TAAGACATCC AGGAACATGC GAGGCTCCCG 480 

GATGAATGGG TTGCCTAGCC CCACTCACAG CGCCCACTGT AGCTTCTACC GAACCAGAAC 540 

CTTGCAGGCA CTGAGTAATG AGAAGAAAGC CAAGAAGGTA CGTTTCTACC GCAATGGGGA 600 

CCGCTACTTC AAGGGGATTG TGTACGCTGT GTCCTCTGAC CGTTTTCGCA GCTTTGACGC 660 

CTTGCTGGCT GACCTGACGC GATCTCTGTC TGACAACATC AACCTGCCTC AGGGAGTGCG 720 

TTACATTTAC ACCATTGATG GATCCAGGAA GATCGGAAGC ATGGATGAAC TGGAGGAAGG 780 

GGAAAGCTAT GTCTGTTCCT CAGACAACTT CTTTAAAAAG GTGGAGTACA CCAAGAATGT 840 

CAATCCCAAC TGGTCTGTCA ACGTAAAAAC ATCTGCCAAT ATGAAAGCCC CCCAGTCCTT 900 

GGCTAGCAGC AACAGTGCAC AGGCCAGGGA GAACAAGGAC TTTGTGCGCC CCAAGCTGGT 960 

T AC CAT CAT C CGCAGTGGGG TGAAG CCTCG GAAGGCTGTG CGTGTGCTTC TGAACAAGAA 1020 

GACAGCCCAC TCTTTTGAGC AAGTCCTCAC TGATATCACA GAAGCCATCA AACTGGAGAC 1080 

CGGGGTTGTC AAAAAACTCT ACACTCTGGA TGGAAAACAG GTAACTTGTC TCCATGATTT 1140 

CTTTGGTGAT GATGATGTGT TTATTGCCTG TGGTCCTGAA AAATTTCGCT ATGCTCAGGA 1200 

TGATTTTTCT CTGGATGAAA ATGAATGCCG AGTCATGAAG GGAAACCCAT CAGCCACAGC 1260 

TGGCCCAAAG GCATCCCCAA CACCTCAGAA GACTTCAGCC AAGAGCCCTG GTCCTATGCG 1320 

CCGAAGCAAG TCTCCAGCTG ACTCAGCAAA CGGAACCTCC AGCAGCCAGC TCTCTACCCC 1380 

CAAGTCTAAG CAGTCTCCCA TCTCTACGCC CACCAGTCCT GGCAGCCTCC GGAAGCACAA 1440 

GGACCTGTAC CTGCCTCTGT CCTTGGATGA CTCGGACTCG CTTGGTGATT CCATGTAAAG 1500 

GAGGGGAGAG TGCTCAGAGT CCAGAGTACA AATCCAAGCC TATCATTGTA GTAGGGTACT 1560 

TCTGCTCAAG TGTCCAACAG GGCTATTGGT GCTTTCAAGT TTTTATTTTG TTGTTGTTGT 1620 

TATTTTGAAA AACACATTGT AATATGTTGG GTTTATTTTC CTGTGATTTC TCCTCTGGGC 1680 

CACTGATCCA CAGTTACCAA TTATGAGAGA TAGATTGATA ACCATCCTTT GGGGCAGCAT 1740 

TCCAGGGATG CAAAATGTGC TAGTCCATGA CCTTTCAATG GAAAGCTTAG GGGCCTGGGG 1800 

TAAATTTGCC CCGTTTAAAT TTGCCCAAAC AGTTTTCCTT TTGTAGAGGG GTGTTTAAAT 1860 

ATACAGCAAT TAAAAAGTTT GTGTGGGGAA AAAAAAAACT CATTGGCAGA TCCAAGAATG 1920 

ACAAACACAA GTGCCCCTTT TCTCTGGATC TCAAGAATGG TGGAGGACCC TGGAAGGACA 1980 

GCAAGGCAGC TCCCCAGCCT CACTCTTCAC TCCTGATTGA GGCCCGGGTT TGTTGTCCAG 2040 

CACCAATTCT GGCTGTCAAT GGGGAGAAAT AAACCAACAA CTTATAATTG TGACACCAGA 2100 

TGCTTAGGAT CCTGGTGCTG GGTTAGCTAA GAGAATAGAC AGAATTGGAA AATACTGCAG 2160 

ACATTTCCGA AGAGTTTATA AAGCACAGTG AATTCCTGGT CAATCTCTCC ACTGAGGCAA 2220 

TTTGGAATCA ATAAGCAATT GATAATAGTT TGGAGTAAGG GACTTCATAT ACCTGATTCC 2280 

TCTAGAAGGC TGTCTAACAT ACCACATGAT TACATGAACT GTATGGTATC CATCTATCTC 2340 

TGTTCTATTG AATGCCTTGT TAACAGCCAA CACTGAAAAC ACTGTGAGAA TTTGTTTTCA 2400 

GGTCTGACAC CTTTCAGTCT CTTTTTATAG CAAGAAATCA ATATCCTTTT TATAAAAATT 2460 

CATGTCTGTA TTTCAGGAGC AAACTCTTCA GGCTCCTTTT TTATAAACTG GTGATTTTTC 2520 

TTTTGTCTAA AAAACACATG AAGAAAATTT ACCAGAAAAA AAAAAAAAAG CCGAAGAATA 2580 

ATGT TATTT A GAAATTATGC TGTCACTGCC AAACAGTAAC CTCCAGGAGA AAACAAGATG 2640 

AATAGCAGAG GCCAATTCAA TAGAATCAGT TTTTTGATAG CTTTTTAACA GTTATGCTTG 2700 

CATTAATAAT TTCAATGTGG ACCAGACATT CTAATTATAT TTTAAATGAA ATGTTACAGC 2760 

ATATTTTAAG CAACTCTTTT TATCTATAAT CCTAATATTT CATACTGAAG ACACAGAAAT 2820 

CTTTCACTTG TCTTTAACAT TAGAAAGGAT TTCTCTTTAC TAAGGACTGA TCATTTGAAA 2880 

TAGTTTTCAG TCTTTTGAGA TACAGGTTTA TAACACTGCT TTTTTTTTCC TGTAAACATA 2 940 

GCCCATAATG GCAAAAACAA CTAATTTTAA TTGAAGGTCT TGCTTGCCAN TCCTGTGTTG 3000 

GCTTTNACCA AATATAAAAA TTCCCTTATT CCTTGGTAAT GGTGCAAATN TTTGGAAAGG 3060 

CACAGCATCC AAACCAAGCT GCTGTTTGGC TACTGAATGG CTTGCAGTTG TTCCTCCACT 3120 

CTAAATGGAA TGAGCTTGCT GTGTGTGTGT GTGGTGGTGG TGGGAGGGGG TGGTGCATGT 3180 

GTGTGTGTGT GTGTGCATCT GCAGCTGCTT CAAAATTAAG AAATACTACA AGACACCCCT 3240 

GTAATGGATT GGTGGCAACT GGGTGGCACT GCTGATGTGC ACTGTGTAGG GGGGAACCCA 3300 

GTGGTGGTGG GGTATCTCAA ATGCCCCTAG ACAAGCTTCA GATGTCTGTA GCTACCAAAA 3360 

ACATTTTCGG TTCAAGAAAA GTGAGATGAT GGTAGTACTG GTTTCTGGTG AAATTGAAAA 3420 

ACCCCAAATG ATGAGGATCT CTTTTTGCCC CCTCTCCTTT TTTTGTAAAC CCATTCAAAA 3480 

CCATTAATAA GCCCATTTTA CTAANCCCCT ATTTCTTTCT AGAAGCTCAG GGTTTNCTTA 3540 

GTGCCTCCCA NAACATTTTG TAGTTAATTG GGAAAAAGTG ATACTTGGAT TAGGGGGTGT 3600 

GGGCATAAAG AATGGTGGGA GGCCTGATTT TAAAATTCAG GCCAGAACCC CCAATGACTC 3660 

CACCCATAGT NTCACTTTAG GTCTCATTTA GTCCATCACC TTTATTTTAA GTTGAGGAAG 3720 

TGGAGGCTGG TAAAGAGCAG GACCAGAGGA AGAATCCAGA TTTCCTTATG CTTGGGCCTC 3780 

ACACTAGCTC TNTGAGTATT TCCTTGATTG CGGTATATGT ACTACTAGAA AATACCAAAT 3840 

GGATATATTT TCTTTAGGAT AACCTTTGAA CCAACAATNT TCAATAACAA TAGTACATCT 3 900 
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TCCATCTTAC TTTTAATCGA GTATAAGGAA ATGTTTCTTT ATGQCCATTT TGGAGGGAGC 3960 

AGGGGATGAG GCTTGGCATA GTCCAAAATT TAAGNCTCCA ATAATTAATT GCATTTTAAA 4020 

TTGTTTTAAA TTGGCCCACT TTCAAGGCAA TTTTTTTTGT GTGTCTGTAA CTGAGCTCCT 4080 

CCACCCCTGT CATTCACTTC CAATTTTACC CAATCCAATT TTAGCACTCA AGTTCCATTG 4140 

TGTTAATTTT TGCACGGTCT ACACACATCA AGTCAGCAAG CATTTGCCAC CACTCCCTAT 4200 

ACTTCTCCCT CTTTTTTACA CACACACACA CACACACACA CACAATCCAT CTCTTGCTTG 4260 

TTCCTACCTC CCTGATTTTT CTTCCCTACA GAAATAGAAA TAGGGACAAA GAAGGGGAAA 4320 

ATGTATATAT TGGGGCTGGG CTGAACAACT AACTTCATAA GTAGTATTAA CTAGGGGTAA 4380 

ATTGAGAGAA AAGCTCCTTT TCTCTTCACT GTTTTGGAAA GGATAGCCAT TAGCATGACT 4 440 

GCTTTGTGTC CTTATGGACT TTAGTATTAG CCTAGATTGA ATTATAGCGT TTTTCTAGCT 4500 

GAAGGAACCT TAAGATCACA TCATCTACTC CTCTACTCCA AATTTCTCAT TCTTCAGGCC 4560 

AGGAAACCGA GACACAGAGG TAAAGTAATT TCCCCAAGGT CACACAGCTG GCTGGGGCAG 4620 

GATTGGGTTT ACAACCCACA TCTCCTGGCT CTTATTCCAG GGCCTTTTCC CACTAAGTAG 4680 

TATTGCCTTC CATTAGGCTC CTGAGAGTTA TTTCTCAGGG TCATGTTGCA TCTTGGAGCC 4740 

ACATGCTGCT GCCCTGATCT CAGTGGGAAA TNCACCCAGC AACCTAATAC AGCCCCTTTT 4 BOO 

CCCTGCATTC ACCTGGTTCC CATCCACATG GGTTGCAGAT GTCCTTGAAG AGAGTGAGGC 4860 

ATTGAGGGCC AATAGGAGCA ATGGGGTCCC TGGCCTTGTC CATCTGATTC AGGAGATCAC 4920 

TGCTCCATCG TGAGGAGCCC TCTGAATAGC CCCCCACTGA ATGCTTGCCT TGCCCAAATG 4980 

GAATGGAGGA AGATTGATTT TCTCCATCAG TTCACCTTGT GTCATCTCAT AATGGTTGGT 5040 

CTTTCCAGGC TGAGGGAAAT GTTTCTTGTT TCCANAGTAN AAAAAAGAAA GAGTGGAACA SI 00 

ATANCTTTGT TCATCCTAAC TTTCTGAGAT GGCTTTTCAA CATTTAAAAA AAACTAGTGT 5160 

GGTACCATTC ACTGGCANGA TTTNTTTTAG AATATGGGAG TAAGATGAGG TAGAGAAAAT 5220 

AACCTGGTCT CACTGTGGTT GCCCTCATCC ACAATGTCCC CAAAGCCATC CTGCTNTGAT 5280 

GAGGACAATT TCCAGGTATA AGCAAGGGGC TTTGTGACAA AAATGTACCC TGGCTGATGT 5340 

TAAACATTGG CTCCTGTGTT TGCACCAAAA TAGCAAGCTG TGTGCTCTAT ACACTCTTCC 5400 

CATCGTCTTG TGTACACTGC TCCTGTGGCC TTCCACAGCA GAAACCAGGG CAAAAGGGTC 5460 

CAAACACATG GTTTTCCTTG CTGCAAGGCT NTTCCTGGGA ACTAAGGGGG TATTTATTAG 5520 

TTCAGTTNTA AGAGACCTCC TTCTGGGCTT ACCCCACTCC TCAGGTACTT CTCTCTCCTT 5580 

CCTCCTTCTC CTCCACAGTC ACAAGTAACC AAGGAACCTG AAAGTGGATG TGTAGCTATT 5640 

TGAAGAAGGC AAGGAACCCT GAGATTCTTC TTTGAATCCT TTAGTCCAAG TCTTAGACCA 5700 

GTGATTGGTG CTTACCTTGA ACAAAATTTT GTCTGTGTTC CTAATCCCTT CAATACTNTG 5760 

GGTACAATGC TCCCAATCAC CCTGCACATT TGATTCTAAA TGGCTTTTAT TTTTTAAAAA 5820 

TCCATATCCC TAGGACAAGA NAACAGGATG CCTATATCCC CAAAATGAGC TCCAGGACAC 5880 

TGATGGGAAT GATCCCAANG ATCACCCCAC CTCAGAAAAC GTCTGTGCCA ANAGACTTCC 5940 

CCAGATAGAA MCACTGGGAC AGTGGTTTGA ACGACTTCTT TTATGGTTGT CCAGTTTGCT 6000 

ATGGAAATAA AAGG CATTGA TTTTTTAAAA AAGATGATTG GAACCTGTCT TTGGCCACAT 6060 
AGGGCCACTT GGATCCATTT CCAGGCCTTA CTCATATATT GCCTTCACTG AAGGGCTTTG .6120 

GCTTTAAGTC CCAGACTGGT CTCCCAAGTG AACCATAAGT GTTTTGGAGC TCATCTGGGG 6180 

7GAGGCATGA GAATGTTGCC CCATCTATCC CTTCAGGAAA AGGTGCCTTC CCTCCCTTTC 6240 

TCCTAAAGCC TGGTCCCCAA AAATTGTTTT TGTCTCCAAA AGTCTAGTAT GGTCTTTATA 6300 

CACCCANACT CTTAGTGTTG CGTCCTGCCT TGTTTCCTTG TTAAGGATCT ATGCANACCT 6360 

CCCGCTTTGG CTTAGCTAGC GTGACATTGG CTATCATTTG ACAAGACTAA CTTTTTTTTT 6420 

TTTTTTTTTG ACTGAGTCTC CCTCTGTCAC CTAGGCTGGA GTGCAGTGGC ACAATCTTGG 6480 

CTCGCTGCAA CCTTCACCCT TCACCTCCCA GGTCGAAGCG ATTCTCCTGC CTCAGTCTCC 6540 

CGAGTAGCTG GGATTACAGG CGTGCGCCAC CAAATCTGGC TATTTTTTTA TTATTATTAT 6600 

TTTTAGTAGA GATGGGGTTT CACCATGTTG GCCAGACTGG TCTTGAACTC TTGGCCTCAA 6660 

ATTATCTGCC CACCTCGGCC TCCCAAAGTG CTGGGATTAC AGGCATGAGC ACCATGCCCA 6720 

GCTGACAAGA CTAATTTTTT ATCCCTTGGT TTATTGGCTT CAACATCTTC TGGAATCAGA 6780 

GGTGATTTTT TCTTACCTTG GATGCCTGAG ACTAGGGGAG TATAGAATTC CAATTGGTAA 6840 

TT AAGG CAT C TTTCTGCTCC TGATCAGAAG GGCAGGTTAG TTGGGAGAGG TCAGATGGCA 6900 

CAACAGAAGT CACCTTGTAA GTAAGGCAAA GACTTTGAAG GCATTAGCGT TTCTCATTAC 6960 

TTAGGTCAAT AACCTTGAGG GAATCAATGG CTTTTTTGCC GCTCTACCTC TTTGTGTATC 7020 

TCTTTGACTT TTCTTTCTCT GTCTAGTTTC CTCTGTTCTC AGTTTATATT CTATGTTATC 7080 

AGTCTCTCTT TCCACAGTAC AAACATCCAT CCTTTCTCCT GTGCAATTCT GTCTCTCCCT 7140 

CTTATTATCT TTATTTGTAC TTTTTCCTTC CTCCCTGTCT AGGCATTGGG CATGTGCCTC 7200 

TTCTTAGCCT GTGATTTTGC CTTGGGACTG ATGATAAATT ATTTCCAGAT TCAATCAGCC 7260 

CTGGTCCTAC CCCAGTCCAA TCAGAAGTAT GTTGGTGGGG AATCAACCTG ATCCTGGCCC 7320 

TTTCTTCTTC TCCATTTTCA TTCGTAATCC CCCTCAGCAG ATCTTTACAA GCAGTTTCCT 7380 

TAT AG CT CAT GTATCTTTAG GTCTTTGCCT TCCAAGCACT GTACAGAATA CTTTGTGGTT 7440 

CCTTTTTAGT CTGACATTTT GTGGAGCAGT GAAGCGTGCT CAGAGACATA ATCAGCTGAA 7500 

GAGAAAAAAT CCACCCATGG ATTT AT AT CA GCTAAATACT AATAATTGAT TTTGTTTGAT 7560 

GTGCCCATAA TTTTTAAAGC TGCAATATAA TATAATGAGG GACCACAGGT AATTTCTCCT 7620 

GTCATTTGTT TTGGCTGGAT GGGGGTGGGG GAGTAATTGC TTAAAGTTTT ACCATTACAC 7680 

ATTAAACTCT CTATAATAAT CTTGTTTGGG GCTTGCTAAC TGTTGAGCTG TTTTAACTAA 7740 

ACTGGTAGGC AATCGGAGTT GATTTAAATG AAAAGATAAT TTAACAAATC TATACTATAA 7800 

AAAGAGACAT TTGCTTAATT GACATGTATT TTTTCCTTCT GAGTCACCTA AACATTTACT 7860 

CTTGACACCA ACTGTTCATG ATACTGAATA GACAGTCCAT ATAAGAGAAA TTAGTGGACC 7920 

TAAAGAAGCC AGATTGTAGG TGTTAATTTA TTAAACAGAA TTGCAAAGCC CTTGGAAATG 7980 

TCACTGCTTG GCAATACCAT ATGGCATGCC AAAATTTACA ATGACTTTTC TTTATAAGTT B040 

ATCCAAAAGG GATTTGAACA AGTAAGAGGT TATGCCAAAA TGTCTCCAAT GTATGGTCCT 8100 

GTAATATATT GCAGCTTGAA GCCAATGATC CCTTATGACT TGTATACAAC TAATGCATGT 8160 

TTTATTGAAT TTTGCATTTC CCACGTGTGG TAAGTCTTTA AAATGTTTTT GATCACCTTT 8220 

NTGTGCCATT AAACTTGTAC AGAAAATGTT TTTATGGCCA TTTTCAAAGG GAGAAAGTTT 8280 

AAAATGGAAA CAGCCCACCC TTTCTGCCCT ATAGCTGTAG TTAGAATTGA GTACCTGTAG 8340 

CAAAACAGCT GTAATTGGTG GTTGTAGTGT TAGAGGTGTT AGCTTGCTAG TGACTAGCTT 8400 

TGGAGAGTAA ATGCATGGTA TTGTACATCA CATTTCTTAA CTCGTTTTAA CCTCTGAAAA 8460 

GAATATATTC TTCTTTGTAG TCCTTCTTCC CACCCCCTTG CCCTCTCCCT CTCCCTGCTC 8520 

CCAGTTGTCT TACAGTTGTA AATATCTGAT TTGAGGCCCA ATAACTCTTG CCAAGTAAAG 8580 

TCAGCAAACA ACAAACAAAC CAAAATGTGG GGAAAAGGCA TTTCTCAACC ATCTCTCAGC 8640 

AGTTATTGAT CATTTCTTAA GGAACAGCAT TGTGATCAAA GACTCAACTT TACGTAAAAA 8700 

TCAGTGGTAA ATTGGGGTTG TATTGGCCAT TGATTACATT CAGGATTGAA TAGTTTTCAG 8760 

AATCACATGT AATCCAAAGA CAGTAGGTAG TGATGTCCCT TATCCCTGCA GCTGTTTTAA 8820 

GATAGAGACC TCAGAAGACT CTGCTTGACC GATGACCAAT AATTATTTGA AAAAAAAAGA 8880 

AAAAATGAGA GAAATAAAAC AGATATTTAA GAACTTTAGC CACCTATTTA GAATAGTTAT 8940 

AGCCAGAAAA AAAAACAAGG GCATGAGTTC AAATGCATTA CTATCAGTGT CCTAGGCAAT 9000 

ACCTAACCTA CTCTGAAATT GTGATTCAAA AGCAGTATTT CAAGAGGCAT TCTCCTTTTT 9060 

TGGTTTGCTG ACCCCACTTG GACTGGTAGG TTTGGTGAGG CCCCCATAAA CCAGCTGGAG 9120 
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CAGACCCTTT TCATCTCCTG 
AATGAGGGCT TTCTTGGGTC 
TAAGGTGCTG TGAACTGTGA 
AGCTGCTCTT GCAATTATTT 
TGTACATTTG CAAGATGTGT 



TGCCTGTAAC ACCCCTCTTC CCCCACCCCC TCCGCAATTC 
AGAGGACTTC AAGGTTGTCT AGAGAAGTTT GCCATGTGTG 
GTGCTGAAGA TTCGCAGCAT TCAATACCAQ GCAGCCAAAG 
TGGCTCTCAA GCTCTGTTCT TCATCGCATT CTCATTTCTG 
GTAATGTCAT TTTCCAAAAA TAAAATTTGA TTTCAAT 



seq id NO: 214 Protein sequence: 
Protein Accession #: NP_000546 



1 11 21 

I I I 

MELDFGHFDE RDKTSRNMRG SRMNGLPSPT 
GDRYFKGIVY AVSSDRFRSF DALLADLTRS 
EGESYVCSSD NFFKKVEYTK NVNPNWSVNV 
LVTIIRSGVK PRKAVRVLLN KKTAHSPEQV 
DFFGDDDVFI ACX3PEKFRYA QDDPSLDENE 
MRRSKSPADS ANGTSSSQLS TPKSKQSPIS 



31 
I 

HSAHCSFYRT 
LSDNINLPQG 
KTSANMKAPQ 
LTD I TEA I KL 
CRVMKGNPSA 
TPTSPGSLRK 



41 

I 

RTLQALSNEK 
VRYIYTIDGS 
SLASSNSAQA 
ETGWKKLYT 
TAGPKASPTP 
HKDLYLPLSL 



51 
I 

KAKKVRFYRN 
RKIGSMDELE 
RENKDFVRPK 
LDGKQVTCLH 
QKTSAKSPGP 
DDSDSLGDSM 



Seq ID NO: 215 DNA sequence 
Nucleic Acid Accession NM_130467 
Coding sequence: 312.. 644 



1 
I 

GGCACGAGGC 
CTTTCCAACA 
GTCTTCCTGG 
TCCTGTGGCA 
CCCAGGTCGT 
AAGTGAGAGA 
AAGAGTCTTC 
AAGAAGAGGA 
AAGGAGCACC 
TTAAGATAGA 
TTGATCCCAC 
ATGAAGACTG 
TAATAAAGTT 



11 
1 

AGAGCTCTGC 
TCTTCGTTCT 
TAATTTAGTT 
CAGTCCGTGG 
GATGCAGGCG 
TATGAGTGAG 
CCAGCCAGTT 
ACCACCAACT 
TGCTGTTCAA 
GGATGCACCT 
TAAAGTGCTG 
AAACCAAGAA 
TTACAGTTTT 



21 
I 

AAGGAGAGGT 
TTCTCACTGA 
GTGAGTGAAT 
CTTTGAGGGA 
CCATGGGCCG 
CATGTAACAA 
GGACCTGTGA 
GATAATCAGG 
GGGACTGATG 
GGAGATGGTC 
GAAGCAGGTG 
TATTGTTCTT 
CTGCAAAAAA 



31 
I 

TGTGTCTTCG 
CCGAGACTCA 
GTGTGGAGGA 
AAAGGGCCTC 
GTAATCGTGG 
GATCCCAATC 
TTGTCCAGCA 
GTATTGCACC 
TGGAAGCTTT 
CTGATGTCAG 
AAGGGCAACT 
ATGCTGGAAA 
AAAAAAAAAA 



41 
I 

TTCTTTCCGC 
GCCGGTAGGT 
GCCAGCGGGC 
GCGGTGGTCC 
CTGGGCTGGA 
CTCAGAAAGA 
GCCCACTGAG 
TAGTGGGGAG 
TCAACAGGAA 
GGAGGGGACT 
ATAGGTTTAA 
TTTGACTGCT 



CTGCAGAGTG 
TTAGGACAGG 
TCCGCCTTCC 
ACGAGGGAGG 
GGAAATGACC 
GAAAAACGTC 
ATCAAAAATG 
CTGGCTCTGC 
CTGCCCACTT 
ACCAAGACAA 
AACATTCTCT 



9180 
9240 
9300 
9360 



PCT/US02/12476 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



Seq ID NO i 216 Protein sequence: 
45 Protein Accession #: NP_569734 

1 11 21 31 41 51 

I I I 1 I I 

MSEHVTRSQS SERGNDQESS QPVGPVIVQQ PTEEKRQEEE PPTDNQGIAP SGEIKNEGAP 
50 AVQGTDVEAF QQELALLKIE DAPGDGPDVR EGTIjPTPDPT KVLEAGEGQL 



55 



65 
70 
75 
80 



Seq ID NO: 217 DNA sequence 

Nucleic Acid Accession #: NM_0 014 76.1 

Coding sequence: 82.. 43 5 



1 11 21 31 41 51 

I I I I I 1 

GCCAGGGAGC TGTGAGGCAG TGCTGTGTGG TTCCTGCCGT CCGGACTCTT TTTCCTCTAC 60 

TGAGATTCAT CTGTGTGAAA TATGAGTTGG CGAGGAAGAT CGACCTATTA TTGGCCTAGA 120 

60 CCAAGGCGCT ATCTACAGCC TCCTGAAGTG ATTGGGCCTA TGCGGCCCGA GCAGTTCAGT 160 

GATGAAGTGG AACCAGCAAC ACCTGAAGAA GGGGAACCAG CAACTCAACG TCAGGATCCT 240 

GCAGCTGCTC AGGAGGGAGA GGATGAGGGA GCATCTGCAG GTCAAGGGCC GAAGCCTGAA 300 

GCTGATAGCC AGGAACAGGG TCACCCACAG ACTGGGTGTG AGTGTGAAGA TGGTCCTGAT 360 

GGGCAGGAGG TGGACCCGCC AAATCCAGAG GAGGTGAAAA CGCCTGAAGA AGGTGAAAAG 420 

CAATCACAGT GTTAAAAGAA GACACGTTGA AATGATGCAG GCTGCTCCTA TGTTGGAAAT 4 SO 
TTGTTCATTA AAATTCTCCC AATAAAGCTT TACAGCCTTC TGCAAAA 



Seq ID NOt 218 Protein sequence: 
Protein Accession #: NP_001467.1 

1 11 21 31 41 51 

I I I I I I 

MSWRGRSTYY WPRPRRYVQP PEVIGPMRPE QPSDEVEPAT PEEGEPATQR QDPAAAQEGE 60 
DEGASAGQGP KPEADSQEQG HPQTGCECED GPDGQEVDPP NPEEVKTPEE GEKQSQC 

Seq ID NO: 219 DNA sequence 
Nucleic Acid Accession #: NM_001476 
Coding sequence: 90-3671 



11 21 31 41 51 

1 I 1 I I I 

ACAGCGGAGC GCAGAGTGAG AACCACCAAC CGAGGCGCCG GGCAGCGACC CCTGCAGCGG 60 

AGACAGAGAC TGAGCGGCCC GGCACCGCCA TGCCTGCGCT CTGGCTGGGC TGCTGCCTCT 120 

GCTTCTCGCT CCTCCTGCCC GCAGCCCGGG CCACCTCCAG GAGGGAAGTC TGTGATTGCA 180 

85 ATGGGAAGTC CAGGCAGTGT ATCTTTGATC GGGAACTTCA CAGACAAACT GGTAATGGAT 240 

TCCGCTGCCT CAACTGCAAT GACAACACTG ATGGCATTCA CTGCGAGAAG TGCAAGAATG 300 

GCTTTTACCG GCACAGAGAA AGGGACCGCT GTTTGCCCTG CAATTGTAAC TCCAAAGGTT 360 
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CTCTTAGTGC TCGATGTGAC AACTCTGGAC GGTGCAGCTG TAAACCAGGT GTGACAGGAG 420 

CCAGATGCGA CCGATGTCTG CCAGGCTTCC ACATGCTCAC GGATGCGGGG TGCACCCAAG 480 

ACCAGAGACT GCTAGACTCC AAGTGTGACT GTOACCCAGC TGGCATCGCA GGGCCCTGTG 540 

ACGCGGGCCG CTGTGTCTGC AAGCCAG CTG TTACTGGAGA ACGCTGTGAT AGGTGTCQAT 600 

CAGGTTACTA TAATCTGGAT GGGGGGAACC CTGAGGGCTG TACCCAGTGT TTCTGCTATG 660 

GGCATTCAGC CAGCTGCCGC AGCTCTGCAG AATACAGTGT CCATAAGATC ACCTCTACCT 720 

TTCATCAAGA TGTTGATGGC TGGAAGGCTG TCCAACGAAA TGGGTCTCCT GCAAAGCTCC 780 

AATGGTCACA GCGCCATCAA GATGTGTTTA GCTCAGCCCA ACGACTAGAC CCTGTCTATT 840 

TTGTGGCTCC TGCCAAATTT CTTGGGAATC AACAGGTGAG CTATGGGCAA AGCCTGTCCT 900 

TTGACTACCG TGTGGACAGA GGAGGCAGAC ACCCATCTGC CCATGATGTG ATTCTGGAAG 960 

GTGCTGGTCT ACGGATCACA GCTCCCTTGA TGCCACTTGG CAAGACACTG CCTTGTGGGC 1020 

TCACCAAGAC TTACACATTC AGGTTAAATG AGCATCCAAG CAATAATTGG AGCCCCCAGC 1080 

TGAGTTACTT TGAGTATCGA AGGTTACTGC GGAATCTCAC AGCCCTCCGC ATCCGAGCTA 1140 

CATATGGAGA ATACAGTACT GGGTACATTG ACAATGTGAC CCTGATTTCA GCCCGCCCTG 1200 

TCTCTGGAGC CCCAGCACCC TGGGTTGAAC AGTGTATATG TCCTGTTGGG TACAAGGGGC 12 60 

AATTCTGCCA GGATTGTGCT TCTGG CTACA AGAGAGATTC AGCGAGACTG GGGCCTTTTG 1320 

G CACCTGT AT TCCTTGTAAC TGTCAAGGGG GAGGGGCCTG TGATCCAGAC ACAGGAGATT 1380 

GTTATTCAGG GGATGAGAAT CCTGACATTG AGTGTGCTGA CTGCCCAATT GGTTT CTACA 1440 

ACGATCCGCA cgacccccgc AGCTGCAAGC CATGTCCCTG TCATAACGGG TTCAGCTGCT 1500 

CAGTGATGCC GGAGACGGAG GAGGTGGTGT GCAATAACTG CCCTCCCGGG GTCACCGGTG 1560 

CCCGCTGTGA GCTCTGTGCT GATGGCTACT TTGGGGACCC CTTTGGTGAA CATGGCCCAG 1620 

TGAGGCCTTG TCAGCCCTGT CAATGCAACA ACAATGTGGA CCCCAGTGCC TCTGGGAATT 1680 

GTGACCGGCT GACAGGCAGG TGTTTGAAGT GTATCCACAA CACAGCCGGC ATCTACTGCG 1740 

ACCAGTGCAA AGCAGGCTAC TTCGGGGACC CATTGGCTCC CAACCCAGCA GACAAGTGTC 1800 

GAGCTTGCAA CTGTAACCCC ATGGGCTCAG AGCCTGTAGG ATGTCGAAGT GATGGCACCT 1860 

GTGTTTGCAA GCCAGGATTT GGTGGCCCCA ACTGTGAGCA TGGAGCATTC AGCTGTCCAG 1920 

CTTGCTATAA TCAAGTGAAG ATTCAGATGG ATCAGTTTAT GCAGCAGCTT CAGAGAATGG 1980 

AGGCCCTGAT TTCAAAGGCT CAGGGTGGTG ATGGAGTAGT ACCTGATACA GAGCTGGAAG 2040 

GCAGGATGCA GCAGGCTGAG CAGGCCCTTC AGGACATTCT GAGAGATGCC CAGATTTCAG 2100 

AAGGTGCTAG CAGATCCCTT GGTCTCCAGT TGGCCAAGGT GAGGAGCCAA GAGAACAGCT 2160 

ACCAGAGCCG CCTGGATGAC CTCAAGATGA CTGTGGAAAG AGTTCGGGCT CTGGGAAGTC 2220 

AGTACCAGAA CCGAGTTCGG GATACTCACA GGCTCATCAC TCAGATGCAG CTGAGCCTGG 2280 

CAGAAAGTGA AGCTTCCTTG GGAAACACTA ACATTCCTGC CTCAGACCAC TACGTGGGGC 2340 

CAAATGGCTT TAAAAGTCTG GCTCAGGAGG CCACAAGATT AGCAGAAAGC CACGTTGAGT 2400 

CAGCCAGTAA CATGGAGCAA CTGACAAGGG AAACTGAGGA CTATTCCAAA CAAGCCCTCT 2460 

CACTGGTGCG CAAGGCCCTG CATGAAGGAG TCGGAAGCGG AAGCGGTAGC CCGGACGGTG 2520 

CTGTGGTGCA AGGGCTTGTG GAAAAATTGG AGAAAACCAA GTCCCTGGCC CAGCAGTTGA 2580 

CAAGGGAGGC CACTCAAGCG GAAATTGAAG CAGATAGGTC TTATCAGCAC AGTCTCCGCC 2640 

TCCTGGATTC AGTGTCTCGG CTTCAGGGAG TCAGTGATCA GTCCTTTCAG GTGGAAGAAG 2700 

CAAAGAGGAT CAAACAAAAA GCGGATTCAC TCTCAACGCT GGTAACCAGG CATATGGATG 2760 

AGTTCAAGCG TACACAAAAG AATCTGGGAA ACTGGAAAGA AGAAGCACAG CAGCTCTTAC 2820 

AGAATGGAAA AAGTGGGAGA GAGAAATCAG AT CAGCTGCT TTCCCGTGCC AATCTTGCTA 2880 

AAAGCAGAGC ACAAGAAGCA CTGAGTATGG GCAATGCCAC TTTTTATGAA GTTGAGAGCA 2940 

TCCTTAAAAA CCTCAGAGAG TTTGACCTGC AGGTGGACAA CAGAAAAGCA GAAGCTGAAG 3000 

AAGCCATGAA GAGACTCTCC TACATCAGCC AGAAGGTTTC AGATGCCAGT GACAAGACCC 3060 

AGCAAGCAGA AAGAGCCCTG GGGAGCGCTG CTGCTGATGC ACAGAGGGCA AAGAATGGGG 3120 

CCGGGGAGGC CCTGGAAATC TCCAGTGAGA TTGAACAGGA GATTGGGAGT CTGAACTTGG 3180 

AAGCCAATGT GACAGCAGAT GGAGCCTTGG CCATGGAAAA GGGACTGGCC TCTCTGAAGA 3240 

GTGAGATGAG GGAAGTGGAA GGAGAGCTGG AAAGGAAGGA GCTGGAGTTT GACACGAATA 3300 

TGGATGCAGT ACAGATGGTG ATTACAGAAG CCCAGAAGGT TGATACCAGA GCCAAGAACG 3360 

CTGGGGTTAC AATCCAAGAC ACACTCAACA CATTAGACGG CCTCCTGCAT CTGATGGACC 3420 

AGCCTCTCAG TGTAGATGAA GAGGGGCTGG TCTTACTGGA GCAGAAGCTT TCCCGAGCCA 3480 

AGACCCAGAT CAACAGCCAA CTGCGGCCCA TGATGTCAGA GCTGGAAGAG AGGGCACGTC 3540 

AGCAGAGGGG CCACCTCCAT TTGCTGGAGA CAAGCATAGA TGGGATTCTG GCTGATGTGA 3600 

AGAACTTGGA GAACATTAGG GACAACCTGC CCCCAGGCTG CTACAATACC CAGGCTCTTG 3660 

AGCAACAGTG AAGCTGCCAT AAATATTTCT CAACTGAGGT TCTTGGGATA CAGATCTCAG 3720 

GGCTCGGGAG CCATGTCATG TGAGTGGGTG GGATGGGGAC ATTTGAACAT GTTTAATGGG 3780 

TATGCTCAGG TCAACTGACC TGACCCCATT CCTGATCCCA TGGCCAGGTG GTTGTCTTAT 3840 

TGCACCATAC TCCTTGCTTC CTGATGCTGG GCAATGAGGC AGATAGCACT GGGTGTGAGA 3900 

ATGATCAAGG ATCTGGACCC CAAAGAATAG ACTGGATGGA AAGACAAACT GCACAGGCAG 3960 

ATGTTTGCCT CATAATAGTC GTAAGTGGAG TCCTGGAATT TGGACAAGTG CTGTTGGGAT 4020 

ATAGTCAACT TATTCTTTGA GTAATGTGAC TAAAGGAAAA AACTTTGACT TTGCCCAGGC 4080 

ATGAAATTCT TCCTAATGTC AGAACAGAGT GCAACCCAGT CACACTGTGG CCAGTAAAAT 4140 

ACTATTGCCT CATATTGTCC TCTGCAAGCT TCTTGCTGAT CAGAGTTCCT CCTACTTACA 4200 

ACCCAGGGTG V TGAACATGTT CTCCATTTTC AAGCTGGAAG AAGTGAGCAG TGTTGGAGTG 4260 

AGGACCTGTA AGGCAGGCCC ATTCAGAGCT ATGGTGCTTG CTGGTGCCTG CCACCTTCAA 4320 

GTTCTGGACC TGGGCATGAC ATCCTTTCTT TTAATGATGC CATGGCAACT TAGA QATT GC 4380 

ATTTTTATTA AAGCATTTCC TACCAGCAAA GCAAATGTTG GGAAAGTATT TACTTTTTCG 4440 

GTTTCAAAGT GATAGAAAAG TGTGGCTTGG GCATTGAAAG AGGTAAAATT CTCTAGATTT 4500 

ATTAGTCCTA ATTCAATCCT ACTTTTCGAA CACCAAAAAT GATGCGCATC AATGTATTTT 4560 

ATCTTATTTT CTCAATCTCC TCTCTCTTTC CTCCACCCAT AATAAGAGAA TGTTCCTACT 4620 

CACACTTCAG CTGGGTCACA TCCATCCCTC CATTCATCCT TCCATCCATC TTTCCATCCA 4680 

TTACCTCCAT CCATCCTTCC AACATATATT TATTGAGTAC CTACTGTGTG CCAGGGGCTG 4740 

GTGGGACAGT GGTGACATAG TCTCTGCCCT CATAGAGTTG ATTGTCTAGT GAGGAAGACA 4800 

AGCATTTTTA AAAAATAAAT TTAAACTTAC AAACTTTGTT TGTCACAAGT GGTGTTTATT 4860 

GCAATAACCG CTTGGTTTGC AACCT CTTTG CTCAACAGAA CATATGTTGC AAGACCCTCC 4 920 

CATGGGGGCA CTTGAGTTTT GGCAAGGCTG ACAGAGCTCT GGGTTGTGCA CATTTCTTTG 4980 

CATTCCAGCT GTCACTCTGT GCCTTTCTAC AACTGATTGC AACAGACTGT TGAGTTATGA 5040 

TAACACCAGT GGGAATTGCT GGAGGAACCA GAGGCACTTC CACCTTGGCT GGGAAGACTA 5100 

TGGTGCTGCC TTGCTTCTGT ATTTCCTTGG ATTTTCCTGA AAGTGTTTTT AAATAAAGAA 5160 
CAATTGTTAG ATGCC 

Seg ID NO: 220 Protein sequence) 
Protein Accession #:NP__005553 

1 li 21 31 41 51 

I I I I.I I 
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MP AliWLG C CL CFSLLLPAAR ATSRREVCDC NGKSRQCIFD 
DQIHCEKCKN GFYRHRERDR CLPCNCNSKG SLSARCDNSG 
HMLTDAGCTQ DQRLLDSKCD CDPAGIAGPC DAGRCVCKPA 
PEGCTQCFCY GHSASCRSSA EYSVHKITST FHQDVDGWKA 
SSAQRLDPVY PVAPAKFLGN QQVSYGQSLS FDYRVDRGGR 
MPLGKTLPCG LTKTYTFRLN EHPSNNWSPQ LSYFEYRRLL 
DNVTLISARP VSGAPAPWVE QCICPVGYKG QFCQDCASGY 
GGACDPDTGD CYSGDENPDI ECADCP1GFY NDPHDPRSCK 
CNNCPPGVTG ARCELCADGY FGDPFGEHGP VRPCQPCQCN 
CIHNTAGIYC DQCKAGYFGD PLAPNPADKC RACNCNPMGS 
NCEHGAFSCP ACYKQVKIQM DQFMQQLQRM EAL1SKAQGG 
QDILRDAQIS EGASRSLGLQ LAKVRSQENS YQSRLDDLKM 
RLITQMQLSL AESEASLGNT N I PASDHYVG PNGFKSLAQE 
ETEDYSKQAL SLVRKALHEG VGSGSGSPDG AWQGLVEKL 
ADRSYQHSLR LLDSVSRLQG VSDQSFQVEE AKRIKQKADS 
NWKEEAQQLL QNGKSGREKS DQLLSRANLA KSRAQEALSM 
QVDNRKAEAE EAMKRLSYIS QKVSDASDKT QQAERALGSA 
IEQEIGSLNL. EANVTADGAL AMEKGLASLK SEMREVEGEL 
AQKVDTRAKN AGVTIQDTLN TLDGLLHLMD QPIjSVDEEGI* 
MMSELEERAR QQRGHLHLLE TSIDGILADV KNLENIRDNL 

Seg ID NO: 221 DNA sequence 
Nucleic Acid Accession #: NM_016529 
Coding sequence t 13-1854 



1 
I 

GTCAAGAAAA 
AAAGGGGCTG 
ACATTATGCC 
GCTGATCTCT 
ATATTGAAGG 
CTGCTACTTG 
GCAACACTGT 
GCGATTAATA 
AAGGAGGACT 
AATTTGCTGG 
GCGCTCTCCT 
ATATGCTGCA 
GTGAAGGCCA 
GCCCACGTGG 
TACGCCATCG 
TACAACCGGG 
ATTGAGCTTT 
TGCATCGGCC 
GAGAGGTCTT 
AATGGGQAAG 
TCCCTCATCC 
GGTCATGCTA 
GTTTGTCTGA 
TGGGGAAGCA 
ATTCCCATTG 
TGGTTGGGAT 
GCCAAGCACA 
CGAGTCCTGG 
CGCCTGATCA 
CAGCAGGGCG 
GAAGAAGTCA 
AATTTTCCTG 
TTTGTCAGAG 
AGTTAAGCAG 
AGCTATCTTT 
ATGAAGCATT 
AGAAAAAAAA 



11 
1 

GAATGTCTGT 
ATAATGTGAT 
ATCTGGAATA 
CTGAGAATGA 
ACAGAGCTCA 
GAGCCACAGC 
TGAAGGCAGA 
TAGGGTATTC 
CTTTGGATGC 
GCAAGGAAAA 
TCGAAGTCCG 
GAGTGTCTCC 
TCACCCTCGC 
GTGTGGGAAT 
CACAGTTTTC 
TGACCAAGTG 
GGTTCGCCTT 
TGTACAATGT 
GCACTCAGGA 
GCTTCAACAC 
TCTTCTGGTT 
CCGACTATTT 
AAGCTGGTTT 
TGCTGACCTG 
CTCCAGATAT 
TATTTCTGGT 
CCTGCAAAAA 
GAAAAGCGGT 
AGAGGCTGGG 
TCCCGCATGG 
TCCGTGCTTA 
ACTGATCTTA 
AAGACTGGGG 
TTTGTTAGTT 
GCCCTCCCAA 
CAACTGTGCT 
AAAAAAA 



21 
1 

AATTGTTCGA 
TTTTGAGAGA 
CTTTGCCACG 
GTATGAGGAG 
ACGGTTGGAA 
CATAGAAGAT 
AATTAAAATA 
CTGCCGATTG 
CACAAGGGCA 
TGACGTGGCC 
GAGGAGTTTC 
TCTGCAGAAG 
CATCGGAGAC 
CAGTGGGAAT 
CTACTTAGAG 
CATCTTGTAC 
TGTTAATGGA 
GATXTTCACC 
GAGCATGCTC 
AAAGGTTTTC 
TCCCATGAAA 
ATTTGTTGGA 
GGAGACCACA 
GCTGGTGTTT 
GAGAGGACAG 
TCCTACTGCC 
GACATTGCTG 
GCTGCGGGAT 
CCGGAAGACG 
GTATGCTTTT 
TGACACCACC 
GGAAAGAGAT 
TCCAAGGCCA 
ACATATTCCC 
CTCGTCTGCA 
CTGTGAGGTC 




GCTTTGCCGC 
AGGTTTCCCC 
TGGGGTCACT 
GCTCTGGAGC 
AATATTGTTT 
GCTTGGACTA 
TTTGGCATCT 
GCAACTATGG 
TGTTTGATTG 
GAGGAGGTGC 
AGCAATGGAA 
CCCCCGACGC 
TCTCAAGAAG 
AAAAAGAAAT 
TCAGTTTGTT 
AAACACCAGG 
TCGCAAACCT 
GTGCTTAGCC 
TCAAATTAAA 



RELHRQTGNG 
RCSCKPGVTG 
VTGERCDRCR 
VQRNGSPAKL 
HPSAHDV I IjE 
RNLTALRIRA 
KRDSARLGPF 
PCPCHNGFSC 
NNVDPSASGN 
EPVGCRSDGT 
DGWPDTEU3 
TVERVRALGS 
ATRliAESHVE 
EKTKSLAQQL 
LSTLVTRHMD 
GNATFYEVES 
AADAQRAXNG 
ERKELEFDTN 
VLLEQKLSRA 
PPGCYNTQAIi 



41 

I 

GACGACTTCG 
ACTCAAAATA 
GGACTCTCTG 
TCTATCAGGA 
AG AT CATTGA 
CAGGAGTTCC 
CAGGAGACAA 
ATATGGCCCT 
AGCACTGCAC 
ATGGCCACAC 
CACTCTCX3TG 
TGGATGTGGT 
ATGTCGGGAT 
AGGCCACCAA 
TGGTTCATGG 
AGAACGTGGT 
AGATTTTATT 
CCTTCACTCT 
AGCTCTACAA 
GCATCAACGC 
ATGATACTGT 
ACACATATGT 
AATTCAGTCA 
ACTCGACCAT 
TCCTGAGCTC 
AAGATGTGGC 
AGGAGCTGGA 
AGAGGCTGAA 
TGTTCCGGGG 
AACACGGAGC 
CCAGGAAGAA 
GCACCCAGTG 
AAACACATTT 
GGAGTGCAGA 
TAACTTTTGT 
AACATTATGT 



FRCLNCNDNT 
ARCDRCLPGF 
SGYYNLDGGN 
QWSQRHQDVF 
GAGLRITAPL 
TYGEYSTGYI 
GTCIPCNCQG 
SVMPETEEW 
CDRLTGRCXiK 
CVCKPGFGGP 
GRMQQAEQAL 
QYQNRVRDTH 
SASNMEQLTR 
TREATQAEIE 
EFKRTQKNLG 
ILKNLREFDL 
AGEAliEISSE 
MDAVQMVITE 
KTQINSQLRP 
EQQ 



SI 
I 

GCTTTACTGT 

TATGGAGGAA 

TGTGGCTTAT 

AGCCAGCACC 

GAAGAATTTG 

AGAAACCATC 

ACAAGAAACT 

TATCCTATTG 

TGACCTTGGG 

CCTGAAGTAC 

CAAAGCGGTC 

GAAGAAGCGG . 

GATCCAGACA 

CAACT CGGAT 

AGCCTGGAGC 

CCTGTATATT 

TGAACGTTGG 

GGGAATCTTT 

AATCACCCAG 

CTTGGTCCAC 

GTTTGACAGT 

TGTTGTTACT 

TCTGGCTGTC 

CTGGCCCACC 

CGCACACTTC 

ATGGAGAGCA 

AACCAAGTCT 

CGAGCGCGAC 

CAGCTCCCTG 

TGTTAGTCAG 

ATAAGACATG 

TTAACACATC 

CTGTGGCCTT 

CCACAGGGGA 

TTATGTCGTT 

TTCACCAATA 



9eq ID NO: 222 Protein sequence t 
Protein Accession #» NP_057613 



1 
I 

MSVIVRTPSG 
ENEYEEWLKV 
KAEIKIWVLT 
KENDVADI ID 
TLAIGDGAND 
TKCILYCFYK 
TQESMLRFPQ 
DYtiFVGNIVY 
PDMRGQATMV 
KAVLRDSNGK 
RAYDTTKKKS 



11 
I 

RIiRLYCKGAD 
YQEASTILKD 
GDXQETAINI 
GHTLKYALSF 
VGMIQTAHVG 
NWIjYIIEIjW 
LYKITQNGEG 
TYVWTVCLK 
LSSAHFWLGL 
RLNERDRLrIK 
RKK 



21 

I 

NVIFERLSKD 
RAQRLEECYE 
GYSCRLVSQN 
EVRRSFLDLA 
VGISGNBGMQ 
FAFVNGFSGQ 
FNTKVFWGHC 
AGIjETTAWTK 
FLVPTACLIE 
RLGRKTPPTL 



31 
I 

SKYMEETLCH 
IIEKNLLLLG 
MALIIiLKEDS 
LSCKAVICCR 
ATNNSDYAIA 
ILFERWCIGL 
INALVHSLIL 
FSHLAVWGSM 
DVAWRAAKHT 
FRGSSLQQGV 



41 

I 

LEYFATEGLR 
ATAIEDRLQA 
LDATRAAITQ 
VSPLQKSEIV 
QFSYLEKLrLL 
YNVIFTAIjPP 
FWFPMKALEH 
LTWLVFFGIY 
CKKTXrLEEVQ 



51 
I 

TLCVAYADLS 
GVPETIATLL 
HCTDLGNLLG 
DWKKRVKAI 
VHGAWSYNRV 
FTLGIFERSC 
DTVFDSGHAT 
STIWPTIPIA 
ELETKSRVLG 
HGAVSQEEVI 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
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60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



Seq ID NO i 223 DNA sequence 
Nucleic Acid Accession #: BC017001 
Coding sequence: 1-394 



11 



21 



31 



51 



275 



60 



WO 02/086443 

| | I I I I 

AACGCTGGGC AGGGCCGGCG CQGGTCGGGQ QGCGCCCGAQ GGGCCOGGGC CGAGCGGCGG 

CGCGCAGGGC GGCAGCATCC ACTCGGGCCG CATCGCCGCG GTGCACAACG TGCCGCTGAG 120 

CGTGCTCATC CGGCCGCTGC CGTCCGTGTT GGACCCCGCC AAGGTGCAGA GCCTCGTGGA 180 

CACGATCCGG GAGGACCCAG ACAGCGTGCC CCCCATCGAT GTCCTCTGGA TCAAAGGGGC 240 

CCAGGGAGGT GACTACTTCT ACTCCTTTGG GGGCTGCCAC CGCTACGCGG CCTACCAGCA 300 

ACTGCAGCGA GAGACCATCC CCGCCAAGCT TGTCCAGTCC ACTCTCTCAG ACCTAAGGGT 360 

GTACCTGGGA GCATCCACAC CAGACTTGCA GTAGCAGCCT CCTTGGCACC TGCTGCCACC 420 

TTCAAGAGCC CAGAAGACAC ACCTGGCCTC CAGCAGGCTG GGCCATGCAG AAGGGATAGC 480 

AGGGGTGCAT TCTCTTTGCA CCTGGOGAGA GGGTCTGACT CTGGGCACCC CTCTCACCGG 540 

CTACAAGGCC TTGGACTCAC TGTACAGTGT GGGAGCCCCA GTTCCCACCT CTGTGACAAT 600 

AGGATCATGG CCTTACCCTT GAAGCATTAC CGAGAAGGAG AACAGAGATG GGCTTGAAGA 660 

GCCACGTGCT GCCGGCTCCA AATTCCCAAG GACAAGGATC CCTCTGCATT TTTGTCTATG 720 

TAACCTCTTA TATGGACTAC ATT CAGCTGC AAGGAAAGGA AAACCTTGAT TGCAGTGGTT 780 

TAAACAAACA GAAGATTGTT TTTCCACATA GCATGGATTC TGGAGATGGG TGGCTAATGG 840 

TATTGGTTCA ACAACTCCAC GGAGGTAGGG GTCACGTCTT GGATCCTTTT GCCTTAATCT 900 

CAGTGCTCGT TACTTCATGG TCCCAAGATG GCTGCTGTAT CCCCAAGAAT CATGTCTGCG 960 

TTCAAGGAAG GAGGGGTGGA GGAAGAGGAA GGGCCAAACT AGCTGGACCC GTCACCTTCT 1020 

ATCAGAAAGT AAAACCTCGT CAGAAGTCTG TTTCCTGCTC TCTCCCTCTG CATATCTTCA 1080 

CTTAGATGCC CTTGGCCCGA GCCAGCTACC ATTGCACCTC TAGCTGCAAA CAAAGCTAAG 1140 

ACAGCAGGGA ACAGAATTGT CATGGCTGAA TAGACCAATC GTGTTCCATC TACTGAGACT 1200 

GGCACACTGC CTCCTGCAAT AAAACTGGGA TCCCATTACC AAGAGAGAAA TGCAGAATTG 1260 

TGTACCAGTT AGCTTTTGCT GTGTAACAAA CCATCCCCAA ACTTGGCAGC TAGAAACAAA 1320 

CCCTGTATTT TCCCACAATC CTATGGGTTG GCAATTTGGG CTGGGCTCAA CAGGGCAGTT 1380 

CTGCTGCTCA CACCTGGGAT CCCTCATGGA GCTAAGGTCA GCTGTTACCT CAGCTGGGCC 1440 

TGGATGGTCT AGGATAGCCT TACTCACTTG CCTGGCAGGT GACAGGCTGT TGGCTGGAAT 1500 

TGCTTGGTTC TCCTCCATGT GGCCTCTCCA GCAGGCTAGC TCAGGCTTAT TCACATGATG 1560 

GCTTCAGGAT TCCAAAGAGA GTGAGAGTAG AAGCTGAAAG ACTTCTTGAG TTCTTGGCCT 1620 

GGAACTGGGA CTAGGACAGT GTCACTTCTG CTAAGTTCTT TTGGTCAGAG CAAATCACAA 1680 

GGCTTTACCC AGATTCAAGG GATGAGAAAC AGACTACATG TCTTGATGAG GGGAACCACA 1740 

AAG AG CTTGT GGCCATTTTT CACCTATCAC AAATAATTTT GGATGGGTAT TTATTTGGAT 1800 

AAAGGTATTT CCCTCTTCCC CCTTTCTCTC TGTCTCATGG GGCCTCACTC TGCCAAGTTG I860 

GAAGGCACTA AGACATTGTC CTGGCCCTCA GGGTCTAGGG GAAGAGGTGT TGGGGCAGGA 1920 

AGTGAGTCTC TCCATGGGCT GGACCCACTG TAGTAGGAGT GCCTCCTTGT CTGCACTGCT 1980 

GGTATGGGGT TAGGCCAGGT AGGACATTCC AGAGGGGCTT CTGAAAACCA AGAGTCCCTG 2040 

GGGAAAGGGA ACAGAGTAAG GCAGGCCTTG TTCTCACTGC CCTCTAAGGG AACTTGGTCA 2100 

CTCGGCACTT TTAAGCCTCA GTTTCTCCAG TTCAATAATA AGGACAAGAG CTTTTCCCAT 2160 

GCATTCTCTT TCCCCGGGAA AGTTGACTGA GGTGACCAGT AATAGAATTG AAAAGGGAGA 2220 

GTGTCTTCAG TGCAATGTGG CATCCTGGAT TGGGTCTTGG AACAAAAACA GGACATTAGT 2280 

GGGAAAATTG GAAAT CTGAA AAAAGTCTGA ATTTTAGTTA ATATACCAAT TTCAGTCTCT. 2340 

TGGTTTTGAC AGATGTACCA TGGTGATGTA AGATGTTGAC CTTGGGGTAG GCTGGGTGAA 2400 

GGGTATACAG GAACTCTTTG TACTATCTCT GCAACTTCTC TGTAAATCTA GTATCATTCC 2460 
AAAATAAAAG TTTATTTAAT TTAAAAAAAA AAAAAAAAAA AA 

Seq ID NO: 224 Protein sequence! 
Protein Accession 8: AAH17001.1 

1 11 21 31 41 51 

l)ltll 

TLGRAGAGRG APEGPGPSGG AQGGSIHSGR IAAVHNVPLS VLIRPLPSVL DPAKVQSLVD 60 

TIREDPDSVP PIDVLWIKGA QGGDYFYSFG GCHRYAAYQQ LQRETIPAKL VQSTLSDLRV 120 
YLGASTPDLQ 

Seq ZD NOt 225 DNA sequence 
Nucleic Acid Accession #s NM_021048 
Coding sequence : 1 . . 1110 



1 11 21 31 41 51 

ATGCCTCGAG CTCCAAAGCG TCAGCGCTGC ATGCCTGAAG AAGATCTTCA ATCCCAAAGT 60 

GAGACACAGG GCCTCX3AGGG TGCACAGGCT CCCCTGGCTG TGGAGGAGGA TGCTTCATCA 120 

TCCACTTCCA CCAGCTCCTC TTTTCCATCC TCTTTTCCCT CCTCCTCCTC TTCCTCCTCC 180 

TCCTCCTGCT ATCCTCTAAT ACCAAGCACC CCAGAGGAGG TTTCTGCTGA TGATGAGACA 240 

CCAAATCCTC CCCAGAGTGC TCAGATAGCC TGCTCCTCCC CCTCGGTCGT TGCTTCCCTT 300 

CCATTAGATC AATCTGATGA GGGCTCCAGC AGCCAAAAGG AGGAGAGTCC AAGCACCCTA 360 

CAGGTCCTGC CAGACAGTGA GTCTTTACCC AGAAGTGAGA TAGATGAAAA GGTGACTGAT 420 

TTGGTGCAGT TTCTGCTCTT CAAGTATCAA ATGAAGGAGC CGATCACAAA GGCAGAAATA 480 

CTGGAGAGTG TCATAAAAAA TTATGAAGAC CACTTCCCTT TGTTGTTTAG TGAAGCCTCC 540 

GAGTGCATGC TGCTGGTCTT TGGCATTGAT GTAAAGGAAG TGGATCCCAC TGGCCACTCC 600 

TTTGTCCTTG TCACCTCCCT GGGCCTCACC TATGATGGGA TGCTGAGTGA TGTCCAGAGC 660 

ATGCCCAAGA CTGGCATTCT CATACTTATC CTAAGCATAA TCTTCATAGA GGGCTACTGC 720 

ACCCCTGAGG AGGTCATCTG GGAAGCACTG AATATGATGG GGCTGTATGA TGGGATGGAG 780 

CACGTCATTT ATGGGGAGCC CAGGAAGCTG CTCACCCAAG ATTGGGTGCA GGAAAACTAC 840 

CTGGAGTACC GGCAGGTGCC TGGCAGTGAT CCTGCACGGT ATGAGTTTCT GTGGGGTCCA 900 

AGGGCTCATG CTGAAATTAG GAAGATGAGT CTCCTGAAAT TTTTGGCCAA GGTAAATGGG 960 

AGTGATCCAA GATCCTTCCC ACTGTGGTAT GAGGAGGCTT TGAAAGATGA GGAAGAGAGA 1020 

GCCCAGGACA GAATTGCCAC CACAGATGAT ACTACTGCCA TGGCCAGTGC AAGTTCTAGC 1080 
GCTACAGGTA GCTTCTCCTA CCCTGAATAA 

Seq ID NO: 226 Protein sequences 
Protein Accession #: NP_0663 86 

1 11 21 31 41 51 

MPRAPKRQRC MPEEDLQSQS ETQGLEGAQA PIAVEEDASS STSTSSSFPS SPPSSSSSSS 60 



276 



5 
10 
15 
20 
25 
30 
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75 
80 
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WO 02/086443 

SSCYPLIPST PEEVSADDET 
QVLPDSESLP RSEIDEKVTD 
ECMLLVFGID VKEVDPTGHS 
TPEEVIWEAL NMMGLYDGME 
RAHAEIRKMS LLKFLAKVNG 
ATGSFSYPE 



PCT/US02/12476 



PNPPQSAQIA CSSPSWASL PLDQSDEGSS SQKEESPSTL 
LVQFLLFKYQ MKEPITKAEI LESVIKNYED HFPLLFSEAS 

FVIjVTSLGIiT ydgmlsdvqs mpktgilili lsiipiegyc 
HLIYGEPRKL LTQDWVQENY LEYRQVPGSD PARYEFLWGP 
SDPRSFPLWY EEALKDEEER AQDRIATTDD TTAMASASSS 



Seq ID NO: 227 DNA sequence 

Nucleic Acid Accession tt: NM_005025.1 

Coding sequence : 82-1314 



GCGGAGCACA 
GAGGCTTGAA 
AGTATGGCTA 
TATAATCGTC 
GCTCTTGCAA 
CACTCAATGG 
TCAAACATGG 
GTGCAAAATG 
GCAGCAGTAA 
TGGGTGGAGA 
GCTGCCACTT 
TTTAGGCCTG 
ATTCCAATGA 
GAAGCTGGTG 
ATGCTGGTGC 
CAGCTGGTTG 
AGGTTCACAG 
GAAATTTTCA 
TCCAAAGCAA 
GTCTCAGGAA 
CAT CCATTT T 
GTCATGCATC 
TTATTTGAAT 
TAGGATTTGT 
AATATATGTA 
TGTTATGTCA 



11 
I 

GTCCGCCGAG 
ACTGTTACAA 
CAGGGGCCAC 
TTAGAGCCAC 
TGGGAATGAT 
GATATGACAG 
TAACTGCTAA 
GATTTCATGT 
ATCATGTGGA 
ATAACACAAA 
ATCTGGCCCT 
AAAATACTAG 
TGTATCAGCA 
GTATCTACCA 
TGTCCAGACA 
AAGAATGGGC 
TGGAACAGGA 
TCAAAGATGC 
TTCACAAGTC 
TGATTGCAAT 
TCTTTCTTAT 
CTGAAACAAT 
AACAAGGAAA 
GTTTTACAGT 
AATTATAAGT 

TTGTGTTTGT 



21 
I 

CACAAGCTCC 
TATGGCTTTC 
TTTCCCTGAG 
TGGTGAAGAT 
GGAACTTGGG 
CCTAAAAAAT 
AGAGAGCCAA 
CAATGAGGAG 
CTTCAGTCAA 
CAATCTGGTG 
CATTAATGCT 
AACCTTTTCT 
AGGAGAATTT 
AGTCCTAGAA 
GGAAGTTCCT 
AAACTCTGTG 
AATTGATTTA 
AAATTTGACA 
CTTCCTAGAG 
TAGTAGGATG 
CAGAAACAGG 
GAACACAAGT 
ACAGTAACTA 
ATATCTTAAG 
AACTTGTCAA 



31 
I 

AG CAT CCCGT 
CTTGGACTCT 
GAAGCCATTG 
GAAAATATTC 
GCCCAAGGAT 
GGTGAAGAAT 
TATGTGATGA 
TTTTTGCAAA 
AATGTAGCCG 
AAAGATTTGG 
GTCTATTTCA 
TTCACTAAAG 
TATTATGGGG 
ATACCATATG 
CTTGCTACTC 
AAGAAGCAAA 
AAAGATGTTT 
GGCCTCTCTG 
GTTAATGAAG 
GCTGTGCTGT 
AGAACTGGTA 
GGACATGATT 
AGCACATTAT 
ATAATATTTA 
GGAATGTTAT 
TTAAAATAAA 



41 

I 

CAGGGGTTGC 
TCTCTTTGCT 
CTGACTTGTC 
TCTTCTCTCC 
CTACCCAGAA 

AAATTGCCAA 
TGATGAAAAA 
TGGCCAACTA 
TATCCCCAAG 
AGGGGAACTG 
ATGATGAAAG 
AATTTAGTGA 
AAGGAGATGA 
TGGAGCCATT 
AAGTAGAAGT 
TGAAGGCTCT 
ATAATAAGGA 
AAGGCTCAGA 
ATCCTCAAGT 
CAATTCTATT 
TCGAAGAACT 
GTTTGCAACT 
AAATAGTTCC 
CAGTATTAAG 
AGTACCTATT 



Seq ID NO: 228 Protein sequence: 
Protein Accession #: NP_J)05016.1 



l 
I 

MAFLGLFSLL 
ELGAQGSTQK 
NEEFIiQMMKK 
INAVYFKGNW 
VLEIPYEGDE 
IDLKDVLKAI* 
SRMAVLYPQV 



11 
I 

VLQSMATGAT 
EIRHSMGYDS 
YPNAAVNHVD 
KSQFRPENTR 
ISMMLVLSRQ 
GITEIPIKDA 
IVDHPFFFLI 



21 
I 

FPEEAIADLS 
LKNGEEFSFL 
FSQNVAVANY 
TFSFTKDDES 
EVPIiATLEPL 
NLTGLSDNKE 
RNRRTGTILP 



31 41 

I 1 
VNMYNRLRAT GEDENILFSP 
KEFSNMVTAK ESQYVMKIAN 
INKWVENNTN NLVKDLVSPR 
EVQIPMMYQQ GEFYYGEFSD 
VKAQLVEEWA NSVKKQKVEV 
IFLSKAIHKS FLEVNEEGSE 
MGRVMHPETM NTSGHDFEEL 



Seq ID NO: 229 DNA sequence 
Nucleic Acid Accession #i NMJ>03 695 
Coding sequence: 12-398 



1 
I 

CGACATCAGA 
CAGCCCTTAC 
TCTGCCCGGC 
AT CTGGTGAA 
TCAGCAGCGG 
ACAACGCTGC 
TGAGCCTCCT 
TCATGCCTTT 
GGGTGCCAGG 
CATGGAATGC 
ACAGAGGATG 
GATTTCACAC 
TAAATGATTT 



11 
I 

GATGAGGACA 
CCTGCGCTGC 
CAGCTCTCGC 
GAAGGACTGT 
CACCAGCTCC 
ACCCACCCGC 
GGCCGTCATC 
OCTTCCCTTT 
AGCCCCAGGC 
TGATGACTTG 
CAGCCCCCAG 
TCCTTCTGTT 
AAACC 



21 
I 

GCATTGCTGC 
CACGTGTGCA 
TTCTGCAAGA 
GCGGAGTCGT 
ACCCAGTGCT 
ACCGCCCTCG 
TTAGCCCCCA 
CTCTGGGGAT 
TGAGGGCTTC 
GAGCAGGCCC 
CTGCATGGAA 
TTGTTGCCGT 



31 

I 

TCCTTGCAGC 
CCAGCTCCAG 
CCACGAACAC 
GCACACCCAG 
GCCAGGAGGA 
CCCACAGTGC 
GCCTGTGACC 
TCCACACCTC 
CCCGAAAGTC 
CACAGACCCC 
GGTGGAGGAC 
TTATTTTGTA 



41 

I 

CCTGGCTGTG 
CAACTGCAAG 
AGTGGAGCCT 
CTACACCCTG 
CCTGTGCAAT 
CCTCAGCCTG 
TTCCCCCCAG 
TCTTCCCCAG 
TGGGACCAGG 
ACAGAGGATG 
AGAAGCCCTG 
CTCAAATCTC 



Seq ID NO: 230 Protein sequence: 
Protein Accession #: NP_003686 

1 11 21 31 41 51 

I I I I I I 

MRTALLLLAA LAVATGPAJLT LRCHVCTSSS NCKHSWCPA SSRFCKTTNT VEPLRGNLVK 
KDCAESCTPS YTLQGQVSSG TSSTQCCQED LCNEKLHNAA PTRTAIiAHSA LSLGLALSLL 
AVILAPSL 



Seq ID NO: 231 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 126-752 



120 
180 
240 
300 
360 



51 
I 

AGGTGTGTGG 
GGTTCTGCAA 
AGTGAATATG 
ATTGAGTATT 
AGAAATCCGC 
GAAGGAGTTT 
TTCCTTGTTT 
ATATTTTAAT 
CATCAATAAG 
GGATTTTGAT 
GAAGTCGCAG 
TGAAGTCCAA 
TGGCTCCAAT 
AATAAGCATG 
AGTCAAAGCA 
ATACCTGCCC 
TGGAATAACT 
GATTTTTCTT 
AGCTGCTGCT 
TATTGTCGAC 
CATGGGACGA 
TTAAGTTACT 
GGTATATATT 
AGATAAAAAC 
CTAATGGTCC 
GAACATGTG 



51 
I 

LSIALAMGMM 
SLFVQNGFHV 
DFDAATYLAL 
GSNEAGGIYQ 
YLPRFTVEQE 
AAAVSGMIAI 



51 
I 

GCTACAGGGC 
CATTCTGTGG 
CTGAGGGGGA 
CAAGGCCAGG 
GAGAAGCTGC 
GGGCTGGCCC 
GGAAGGCCCC 
CCGGCAACGG 
TCCAGGTGGG 
AAGCCACCCC 
TGGATCCCCG 
TACATGGAGA 



60 
120 
180 
240 
300 
3 60 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 



60 
120 
180 
240 
300 
360 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



60 
120 



277 



WO 02/086443 

1 11 21 31 41 51 

I I I I I i 

CCGGG CAGGT GGCTCATGCT CGGGAGCGT6 GTTGAGCGGC TGGCGCGGTT GTCCTGGAGC 60 

AGGGGCGCAG GAATTCTGAT GTGAAACTAA CAGTCTGTGA GCCCTGGAAC CTCCACTCAG 120 

AGAAGATGAA GGATATCGAC ATAGGAAAAG AGTATATCAT CCCCAGTCCT GGGTATAGAA 180 

GTGTGAGGGA GAGAACCAGC ACTTCTGGGA CGCACAGAGA CCGTGAAGAT TCCAAGTTCA 240 

GGAGAACTCG ACCGTTGGAA TGCCAAGATG CCTTGGAAAC AGCAGCCCGA GCCGAGGGCC 300 

TCTCTCTTGA TGCCTCCATG CATTCTCAGC TCAGAATCCT GGATGAGGAG CATCCCAAGG 360 

GAAAGTACCA TCATGGCTTG AGTGCTCTGA AGCCCATCCG GACTACTTCC AAACACCAGC 420 

ACCCAGTGGA CAATGCTGGG CTTTTTTCCT GTATGACTTT TTCGTGGCTT TCTTCTCTGG 480 

CCCGTGTGGC CCACAAGAAG GGGGAGCTCT CAATGGAAGA CGTGTGGTCT CTGTCCAAGC 540 

ACGAGTCTTC TGACGTGAAC TGCAGAAGAC TAGAGAGACT GTGGCAAGAA GAGCTGAATG 600 

AAGTTGGGCC AGACGCTGCT TCCCTGCGAA GGGTTGTGTG GATCTTCTGC CGCACCAGGC 660 

TCATCCTGTC CATCGTGTGC CTGATGATCA CGCAGCTGGC TGGCTTCAGT GGACCAAATT 720 

TTCAGGATGG CTGTATTCTG CGGTCAGAAT GAGAGAGTCA AGCTGGGCAG AATCTCTCGC 780 

CAAGAGTTCA GCCTTCCTTT GGAGACTGCT CCATCAGTGC CGAGGTGTGT GGGAACAGGC 840 

TTCACTGCAC CGCCATCTTA CTGAGTTGCT TCACGTGAGG AAAAGGGGGC TTTGGCCCTG 900 

TGACTCAGTT CCACATTTTG GATTGCATAC TGGAAAAGAA GCCAATCTTC TTGCTAGTAA 960 

ACCAGCAACC CGGCTGTATA CAGTGGTGAC CCAAGCAATG GATATAAACC TAAAAATCTG 1020 

AGGGAGGGGA GAGGTGGAAT ACAGTAGTTC TTGGAATCTG AAGTCTCCTA TTTGATCAGG 1080 

TTATTTCCTG GGACTTGGCA AAAATCTGAT TGGTGGGGAT CTCCTAGGAC CTAGTGGACA 1140 

TCTGGTATTA ATTTAATCTC AGGAAAAACA AGAAATTAAC CCAGAGAGAG TCTGGGTTTT 1200 

GGAATTCAGC GTAGCTACCT CCAGACCGTG GTGTCTGGCC TCCATTTTTG TCTGTCATTC 1260 

AGCTCTGACT TACAGCTGCA GTCACCTTTG CTATAAGGCA CCTGGGTAGA AGGGTGGATG 1320 

GGCTTCACAT CAATTTTTTT CTTCCTTTAG GGTGGGGGAT TGGTTTGGCT TTCTTTTGTT 1380 

GTGGTTTTTT GTTTTATTTT TGTCAAGATT GATTTTTAGA TGCAAGGACT TGAAAAGACC 1440 

CAGAAGGATG CCACCAGTTT TTCCTTGAGG CCTAGGATTT TTTATTCTGT CCCGAGCAGA 1500 

GGTAATTCCT CACAACTTAG TGCACCAGTA GCACCAGCCA TTTTGAGCAG AGTACCTCTT 1560 

TGGGGAGCTT TTCGTTTTGT TTTGTTTTTA ATTCTCTTTC CTTAGCAGCA AGGTCTTTTT 1620 

TCCTAGAGAA TCTACTCCGT TGCAGAATCA TTGCAACCTC AGGAGCCCTC ACTGATTGAG 1680 

TGCTGTCAGC CTGATATACT ACTTTGGACT CTGGAAACAG ATATGGGTTC TATTCTCTAT 1740 

TTCTACTGTG TGTCGTTAAA CAACCGTCGG AGACCAGATG ACCTGTTAGA TGGCTAGTCC 1800 

TGTATAACTC GACTCTGTAT GTTTCAATGT ATGTTACTGC AATGCTTCAC CTGCTGTACA 1860 
GTGTTTGTGA GATGCTCTTT GAAGATGGTA CTTTTATATT T 

Seq ID NOi 232 Protein sequence: 
Protein Accession #: Eos sequence 

1 11 21 31 41 51 

I I i I I I 

MKDIDIGKEY IIPSPGYRSV RERTSTSGTH RDREDSKFRR TRPLECQDAL ETAARAEGLS 60 

LDASMHSQLR ILDEEHPKGR YHHGLSALKP IRTTSKHQHP VDNAGLFSCM TFSWLSSIAR 120 

VAHKKGELSM EDVWSLSKHE SSDVNCRRJjE RLWQEELNEV GPDAASLRRV VWIFCRTRLI 180 
LSIVCLMITQ LAGFSGPNFQ DGCILRSE 

Seq ID NO: 233 DNA sequence 

Nucleic Acid Accession #j CAT cluster 

1 11 21 31 41 51 

I I I I I I 

TTTTAATGGT GCTCATATAT ACTGTATTTT TTGTTGTTTA GTTTTACTTA TTGAGAGTGT 60 

CACAACATGA ATCACATAAT CATGATTTTT TTTTTTTACT TTTACTCCCC AAATTATTCA 120 

TGTTTCTTAG AT CG T AG T CA TTGAGAAGTC CCAATAACTC TAAACTTTTG AGTTATAACG 180 

TAGTAAACTT CTCTTTCATC TTTGTGTTAG CTCTGTAGTC TTAACCTGGA TTTTAATTTT 240 

TTTGTTTCCA AAGTCACAAT TGAATTATTC TTAGATACCT TAAGCCACTG AATTCAGTTC 300 

TGTTTGACTG AAAGCAAAAC AACGTGACAG TTTATTTTCA AACACTAACT TCTTGATATT 360 

TTGTTATGGT ATATCTTTTT ATTAAATATT TATTTTGACT AAGCTTTCAT AAAATATTTG 420 

AAGCTATTTT AATCATCAAG TATGGAAAAC AAATTACTAT TGCATTTTCC TATATATGCA 480 

TATATTATGG ATTAACCAGA ATTGTATCAT TTTTGGCCTA ATGTCTGGAT ATAAAAGATA 540 

ATTAGCCTAC TATAGTATTA ATAAATTTTT CAGTTGGTTT GGG CAAATTT AAA CCT GAAA 600 

AATAGGTTAA AAAGTAGTTA CAAATTAAAC TTACTAATTT ATACCTGATT TTTTTTCTTG 660 

AATTAAAGTA CATTTTAAAT GAGCTTTATA ATACCTTAAA AAGTTGGTTC TAATTTAAAA 720 

TATGAAAGCT CTGGCTATCA TCCTGGGATA GTAATTTCTA ATTATATAGT ATTTCAAAAC 780 

TATATATTTT TTAGTTCCTT TGAGATAACT AATTTCTAAT TATATATGTT TCAAAAACCA 840 

TATCCTGTAT TTTTTTTAAG AATTGTTTTA TAAATAGGTC ATAAGATACA AGGTCTGCAT 900 

TAGAAGACCC ACTCTTACTA GGTTCCCTAA GGATCTGCCA TAGATTTTTT TTTTTTTTTT 960 

TTTTTTTTAG GTAGTTTAAA GCAAGCACTG ATACCAGTGG GAGTTGGTCT TGATCTAGGA 1020 

GATTCTGTTA AGCATCCAAA AACAATGCCT AATTTCAGTT CTTAGGTTAT GGCTTGTGAC 1080 

TCCAGATAAA AGATGGAGAA TACCTCATGT ACTGTGACTT GAAAATGAAT TCTTAAAATT 1140 

CTTAGGCTCT CTCCATGTAT CTTTCTTAAG GAAAAGTTTC TGAGTGTGAT CTCTCTTTTG 1200 

CCATAGTATC AAGTGGAGGG TAGTTCAGAA AAGTTAATAG GAAATCTTTT GTGACAGCAG 1260 

ACTATAATAG AAGTTTGAGT AATATTTTAA TAAATTTATA TAATTCAAAT GATAAAAATG 1320 

TATCAATGTT ATCCAATGAT TTTTATTAAA AAATTACCTT ATTATTAGAA CTGTGCCTAT 1380 

TACATAAAAA GTGCTCATGT ATTTGAATTT TAAATAATTT ATTTAAATCA AGACCACCAT 1440 

AAGTCATTAA TAATTTAATA ATTGTTTTAA ATCAGTGGTT TTCAACCCTC ACTTCATATT 1500 

AGAATCATCT GAGGACTTTT AATATGGAAT CCACCTCATA ACAATTAAGT CTAAATTTCT 1560 

GGAAGATGGA GCCATGCTTG TTTTTCCAAA AGCTCTTTGA GTGATTCTAA TTTGTAGTCA 1620 

GAGTTGAAGA CCACTGCTCT AAATTAGTGC AGGAAAATGC TTTTATTTCT CCCATGTTAA 1680 

CTTTTAAAAC TAGTAATGTA CCCAGTTAAG TTTTGATGGT TTAAATTCCA CTAAAGAACA 1740 

TATTCTTCTA ATAACTAGCA TTTATTACAT GAAATTTAAG AGTTTAAGTT CCATCAAACT 1800 

AGCCCTTGTG TAAGATTATT ATTTCTTCTC TATAACTTCA AAATAGATAT TTCATTCAAA I860 

CTGTTCAGGT GAGAAAACAT AATGGATTTT TTTTTTTTTC CTCTGGAQCT GCCTGTTCAG 1920 

TGAGATGGAG GAGGTGGGCA CATTTAAGGT CAGTTCACTA ACCTATGGTT CAGAGTTCTG 1980 

ATCATATGGA AGTTTGGAAA AGAGAGCTTA TCACAGGTTT GTATGCTGGT GAATGGATAG 2040 

TTTTAATTCT CACTGTCTCA AAAGAGAATC AGCTCTCCAG CAGTTCTAGA AAAGCTTTGA 2100 

CAATCCCCAA GGGGCAGTGT TACCTTACTC CTTCACTGCT TCTTAGAAGG TAGAATTAAG 2160 

TTTCTGGAAT TGCACCTACA TGTTTTCTTA TTAACATT CA GAATTGGGAA TATTAATTTT 2220 
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TCCAGTGAGT AGTTTTCTGA AATTGGTAAC TTGGAGAOTA AAATAACGTA TTTTGCTTTT 2280 

CAATTTTGTG TTTGTTTACT TTTATGTAAA AATTTGATAT GTGAATTACA CAGTTCTAAT 2340 

AAAACCTCAT GCCTTTTCAT TACATCTAAT TTGAACTCTC AACTTCAGTG CCAGAAGTGC 2400 

TTTAAAGATG CTTTAATGAA AAGTATTAAG AAAATATATA GATTTGTATG TCAGTTTATA 2460 

CTTCAGAAAT CCATATATTT GTCATATTTA TTTTTTTAGA AACCTCCTAA TTGGATAACT 2S20 

AGATGGTATT TAAAATGAAT GCCCAAAAAT ATCTTGTACC TTTGTCCAAA AGTTTATCTG 2580 

TTGGAAGCCG CCAGCCATTC ATGTAGAGAG TTTATAAGAA AATAATTTAA AATTGTATGC 2640 

ATTTTATATT ACTATGGTAT CTGTGTACCA TATTTCTAAG TATTCATTAT TAAATTGGTA 2700 

CTTCTTAAAA CCATAACCTG GCTTGCCTTT TAGTGTTAAA CACAAAATCC AACATTGTAT 2760 

ATAGAGATTC TTCTTTTATG AAGAAGAGCT GACGTAATTT ATTACCAGTG CATCTGCACA 2820 

AAGACATTAA CATAAGTCTC TGAGCAGTGA TACATTTTCA AACATOAAGA GTGACAACCA 2880 

CCACATTAAA CAACCACGGC AACACTCAGA CTTGGCACTT TCCTACGAAT CCATCCTATA 2940 

TGTGCCTGGT ATCGCCTCTG GCATAACTTA CACGAATCGT CCTCCCTACT TGTCTACGCT 3000 

CCTTCATCAA GCACTTGCCA ACACATTCAC CTCTAACTTG TACAACCTTA CCAACTCACC 3060 

ACAACATCTG CAACTCTACC CTATCAACTG CCAACCTAAA GACCCCCAAC ACAACACAAC 3120 

CCCCAAACAC AAAACCACTA AATCATAACC ACCACACACG CCACACACCA CACACCCACC 3180 
CACACAACCA ACACACCACG ACCAAACACC CCACCACAAA CAAGCTAACA ACCACAAACA .3240 
GACAACACAT CACATACACT CACTACCCCC CCATACTCCC ACCCACCA 

Seq ID NO i 234 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 27-281 

1 11 21 31 41 51 

11)111 

AGCAGGAGGA GAGCTGGCGG GAAGACATGC ACCCCTTGAA GACCCAGAGA GAGGCCGTCT 60 

GTCTACCGCG TAGCAGTTAC AT CAGACTG A GACACTTCCT GTTTACAGGA GACTATAAAA 120 

TTCCTGCCCC GTGCTCATTT GGGGCTGACG CCATTTTAGG CCTCAGCCCA TCTGCACCCA 180 

GGCGCTCACT GAAACAGTGT GTTGCTCCAC ACCGCCTTGT TTTGCTTGTT GGCGCGCTCT 240 

CAGGGTTCCG ACCAATCCAA GAGCCTTGCA GAAAGCATTA ACGTGCTTTT CTCTTTGGCA 300 

GAGTTTTTCT TTGCTCTGAT CTTGGAGACA TCCCTCTGCC TAGTGGAAAC ATAAGGAATA 360 

CAGAAAGAAT GCAAGGAGAT AGACCAACGT GAGATTCTCC TTCATGCACT CAAGAGAAAG 420 

ATGTTGCAGG AAGAGCTAGT CTTTCAGGCT GGGCTGGTGA CCTGAGAAAG AATGTCCAGC 480 

TTTTCTTCTC CACTTGGCAT ATCAAGAGCC AGGCGTGGAA GACTAAAACA GGAAATGTTT 540 

ATAAAAACTG TTCAGCGGTT CGCCAACAAG AAGTGGTAAA GTAGCAAAAA TGGGGATGGA 600 

GATGCCAGGA GGAAAGATGC CAGGGGTAAA GTGGGAAAAT GGGAACCTGA AGCCAGGAGG 660 

TCAAGCCAAG CCAACAGGTG TTCTGTTTTT CATCACAGAA CTAATAAGTG GTGCTGAGGA 720 

CTCAAACCCG GGGAAGCCCA CTCTAGAACC CATGCTGGTC ATCCATATCC CCAAGGCCCT 780 

GGTCAGAACA CAGCTAAGCA GATGGCTTGG GTCATCAGGA CGTCCATTAC ATCCAAAGGA 840 

AGACAGCCTG TGACGTTTCA AAAGCAAAAG TCCCCTACCA GCCAGTGAAG CTACCTGATT 900 

TCTCAGTATC TTACGCCCAG TGACACGATC TACCCTCAAA ACTTAAAAAA AAAAGGGAAA 960 

CATAAACACA TAACAGCAGC AGCAATAATT AAAGATGAGA TGAGAACAAT TAAGAAAAAA 1020 

GGAAAGGTCT CCTGTGACTG TTTTATTTTT AGGGAAACAG AGAGGAAGAA GAATGATTTT 1080 

TCTTTTGATG ACTCTATATC CAACTCTGAG GTTTGATTAA AGAAATGACC TTGAACCACA 1140 

GCAAAGAAAA ATAAAAGACA ATTTCCAGTA AGTATGCCAG TTCGAATTAA TGATTTACTT 1200 

TTTATTTTTA AACTGAATTC AGCAGAGATT TACATGCATT ACGATGATTA ACATCTGAAA 1260 

TTTGACCTTG AAATAATCTT TACATTGTAA ATTCTTAATG ATCAAAACAA GGTTCTCAGT 1320 

GATTAAAACA TATTAGTAAT TAATTATTAA AGGAGAATAA TTGCAAATAC AACATTCCTA 1380 

AAATCTCAAG GCTTTTAAAG CATTTGTACA AATGACTGGA CATTTTTTAA ATTTGAAAAA 1440 

AAAAAAAAGC CCTCCATCTG ATTCTCATTT TCATTGTCAG TGCAACAACA AAAAAGGTAT 1500 

GCACTTCTCT TCTCATTTTC CACTGTCTCG CAAGCTAGAA ATTCTCACGA CTACCTTTGA 1560 

TCCCATCAAA GCCAAAGAAA GAAAAGAAAA TTGTTCTGTA CAGATATATG ACATTAAAAA 1620 
ATAATCCC 



Seq ID NO » 235 Protein sequence: 
Protein Accession #: Eos sequence 

1 11 21 31 41 51 

1 I I I I i 

MHPLKTQREA VCLPRSSYIR LRHFLFTGDY KIPAPCSPGA DAILGLSPSA PRRSLKQCVA 60 
PHRLVLLVGA LSGFRPIQEP CRKH 

Seq ID NO: 236 DNA sequence 

Nucleic Acid Accession 8: NM_002075 

Coding sequence: 406.. 1428 

1 11 21 31 41 51 

! I I I I I 

CCACAATAGG GGCAGACCTG TCCATCCTTC TCTGTGGGTC CCCTGTACCT TTCTCCCCCA 60 

ACAGGATCAG ACCCAGAGGC AGCTGGTTGG GGTTTGTCGA GAAGAAGGAT TATCCAGATC 120 

AGTCCTTTCT AATCTCAGCT CCTGCCTGTA CCCTCCCATA CTCACCAAAC CCTCTTCCCC 180 

ACCACCCTGA GCTGAGGAGC ACAGTTTGAG GCCCCCCCAA CCCCCCGCCG GTCGGGGCCA 240 

GGCCAGGCCA GGCCAGCTCC TCTGGCAGCA GAGCCTGGGC AGGTGACGGG CGGGCGCGGG 300 

CGTCGCAGCT GAGGGAGTAA GGAGGCTCCC AGGAACCGGA GCTGGAAACC CGGCCGAGGT 360 

CCAGCCAGAG CCCAAGAGCC AGAGTGACCC CTCGACCTGT CAGCCATGGG GGAGATGGAG 420 

CAACTGCGTC AGGAACCGGA GCAGCTCAAG AAGCAGATTG CAGATGCCAG GAAAGCCTGT 480 

GCTGACGTTA CTCTGGCAGA GCTGGTGTCT GGCCTAGAGG TGGTGGGACG AGTCCAGATG 540 

CGGACGCGGC GGACGTTAAG GGGACACCTG GCCAAGATTT ACGCCATGCA CTGGGCCACT 600 

GATTCTAAGC TGCTGGTAAG TGCCTCGCAA GATGGGAAGC TGATCGTGTG GGACAGCTAC 660 

ACCACCAACA AGGTGCACGC CATCCCACTG CGCTCCTCCT GGGTCATGAC CTGTGCCTAT 720 

GCCCCATCAG GGAACTTTGT GGCATGTGGG GGGCTGGACA ACATGTGTTC CATCTACAAC 780 

CTCAAATCCC GTGAGGGCAA TGTCAAGGTC AGCCGGGAGC TTTCTGCTCA CACAGGTTAT 840 

CTCTCCTGCT GCCGCTTCCT GGATGACAAC AATATTGTGA CCAGCTCGGG GGACACCACG 900 

TGTGCCTTGT GGGACATTGA GACTGGGCAG CAGAAGACTG TATTTGTGGG ACACACGGGT 960 

GACTGCATGA GCCTGGCTGT GTCTCCTGAC TTCAATCTCT TCATTTCGGG GGCCTGTGAT 1020 

GCCAGTGCCA AGCTCTGGGA TGTGCGAGAG GGGACCTGCC GTCAGACTTT CACTGGCCAC 1080 
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GAGTCGGACA TCAACGCCAT CTOTTTCTTC CCCAATGGAG AGGCCATCTG CACGGGCTCG 1140 

GATGACGCTT CCTGCCGCTT GTTTGACCTO CGGGCAGACC AGGAGCTGAT CTGCTTCTCC 1200 

CACGAGAGCA TCATCTGCGG CATCACGTCC GTGGCCTTCT CCCTCAGTGG CCGCCTACTA 1260 

TTCGCTGGCT ACGACGACTT CAACTGCAAT GTCTGGGACT CCATGAAGTC TGAGCGTGTG 1320 

GGCATCCTCT CTGGCCACGA TAACAGGGTG AGCTGCCTGG GAGTCACAGC TGACGGGATG 1380 

GCTGTGG CCA CAGGTTCCTG GGACAGCTTC CTCAAAATCT GGAACTGAGG AGGCTGGAGA 1440 

AAGGGAAGTG GAAGGCAGTG AACACACTCA GCAGCCCCCT GCCCGACCCC ATCTCATTCA 1S00 

GGTGTTCTCT TCTATATTCC GGGTGCCATT CCCACTAAGC TTTCTCCTTT GAGGGCAGTG 1560 

GGGAGCATGG GACTGTGCCT TTGGGAGGCA GCATCAGGGA CACAGGGGCA AAGAACTGCC 1620 

CCATCTCCTC CCATGGCCTT CCCTCCCCAC AGTCCTCACA GCCTCTCCCT TAATGAGCAA 1680 

GGACAACCTG CCCCTCCCCA GCCCTTTGCA GGCCCAGCAG ACTTGAGTCT GAGGCCCCAG 1740 

GCCCTAGGAT TCCTCCCCCA GAGCCACTAC CTTTGTCCAG GCCTGGGTGG TATAGGGCGT 1800 

TTGGCCCTGT GACTATGGCT CTGGCACCAC TAGGGTCCTG GCCCTCTTCT TATTCATGCT 1860 

TTCTCCTTTT TCTACCTTTT TTTCTCTCCT AAGACACCTG CAATAAAGTG TAG CACCCTG 1920 
GT 



Seq ID NO: 237 Protein sequence; 
Protein Accession #: NP_002066 

1 11 21 31 41 51 

MGEMEQLRQE AEQLKKQIAD ARKACADVTL AELVSGLEW GRVQMRTRRT LRGHLAKIYA 60 
MHWATDSKLL VSASQDGKLI VWDSYTTNKV HAIPLRSSWV MTCAYAPSGN FVACGGLDNM 120 
CSIYNLKSRE GNVKVSRELS AHTGYLSCCR FLDDNNIVTS SGDTTCALWD IETGQQKTVF 180 
VGHTGDCMSL AVSPDFNLP1 SGACDASAKL WDVREGTCRQ TFTGHESDIN AICPPPNGEA 240 
ICTGSDDASC RLFDLRADQE LICFSHESII CGITSVAFSL SGRLLFAGYD DFNCNVWDSM 300 
KSERVGILSG HDNRVSCLGV . TADGMAVATG SWDSFLKIWN 



Seq ID NO: 238 DNA sequence 

Nucleic Acid Accession #t CAT cluster 

1 11 21 31 41 51 

| ) | I I I 

TCCCAATGTG TNGAACCTAC CATAAATTCT TTTCTTACNG GACAATCTTA TNCTAANCAA 60 

TACCATTTGC TTTTAAGGCA GATAATCCTC CAAGTTTTCT AATGATATCT GAAACTATTA 120 

ACTGATTCTG TGAATTATGA AATCTGAAAA GGAATTGGAA GTTGCTAAAA AT CT ATCATT 180 

TGCATTGACC AGTGTGAAGC ACAGTGGAAT GAGAATGCGT GCCCTGACAC CAAAGAAAAA 240 

TAAGTGACTG GAAAGCTGAA GAATCACCGG CTTCAGTGAC ATGGAACCCA GTGATTTGAT 300 

TTTTGACGAG TAT CGGGTG A CTTTGAGGTG QTCAAGAAAC CACACTTTAA GAACAATGTC 360 

CAAAAAGGGG AAAAAAAAGA GCAACCAAAG AAAAAAAATC CATAAAATTG CACAGAAGAA 420 

AAGAAAGAAA AATAAAATAC ACAATATGGA CGATGGAGAA AAACAGTTAC ATTTCTTTAT 480 

GGATCAAGAA GTTTGTGTAC ACATAATCTC ATTTTGAGAT ATATAACTAT TTTTGTCTTT 540 

CAGAAGTGAA TCAAAATATT TCAAAATGCT GTCTTATGAA ACTACAATAT TCTCACAGAT 600 

TAGAAAAGTT TTTCTGTAAA AGT CAGAT AG TAAATATTTT AGGTTTTGCA GTGTCTTTTG 660 

CAACTACTCA ACTTTCCTAC TGTAGCACAA GAGTAGCTGT GGTACTGTGC AAATAAATTG 720 

CTTGTGTTCC AATAAAGCTT CATTTACAAA AACATGCCAT GGGCCATATT TGGCCTGTAC 780 

ACTGTTGTTT GCCAAGTCCT AATATAGTTG CTTAGCAAGT ATTGTGAGCT ATTTGAGGAA 840 

GACATGAAAG TTCATTGGGT TGCTAAAAAG TATGTAGAAA TTCAAAGGAA AATTAAAATT 900 

TAGGCTAAGT TATAATACAC TGTTTTAACA ATTGTAAAAT GTAAGAGAAA TTTACAAATA 960 
AAAATCCCAA ATAAAA 



Seq ID NO: 239 DNA sequence 

Nucleic Acid Accession #: NM_001786.1 

Coding sequence: 130-1023 

1 ii 21 31 41 51 

GGGGGGGGGG GGCACTTGGC TTCAAAGCTG GCTCTTGGAA ATTGAGCGGA GAGCGACGCG 60 

GTTGTTGTAG CTGCCGCTGC GGCCGCCGCG GAATAATAAG CCGGGATCTA CCATACCCAT 120 

TGACTAACTA TGGAAGATTA TACCAAAATA GAGAAAATTG GAGAAGGTAC CTATGGAGTT 180 

GTGTATAAGG GTAGACACAA AACTACAGGT CAAGTGGTAG CCATGAAAAA AATCAGACTA 240 

GAAAGTGAAG AGGAAG GGGT TCCTAGTACT GCAATTCGGG AAATTTCTCT ATTAAAGGAA 300 

CTTCGTCATC CAAATATAGT CAGTCTTCAG GATGTGCTTA TGCAGGATTC CAGGTTATAT 360 

CTCATCTTTG AGTTTCTTTC CATGGATCTG AAGAAATACT TGGATTCTAT CCCTCCTGGT 420 

CAGTACATGG ATTCTTCACT TGTTAAGAGT TATTTATACC AAATCCTACA GGGGATTGTG 480 

TTTTGTCACT CTAGAAGAGT TCTTCACAGA GACTTAAAAC CTCAAAATCT CTTGAT TGAT 540 

GACAAAGGAA CAATTAAACT GGCTGATTTT GGCCTTGCCA GAGCTTTTGG AATACCTATC 600 

AGAGTATATA CACATGAGGT AGTAACACTC TGGTACAGAT CTCCAGAAGT ATTGCTGGGG 660 

TCAGCTCGTT ACTCAACTCC AGTTGACATT TGGAGTATAG GCACCATATT TGCTGAACTA 720 

GCAACTAAGA AACCACTTTT CCATGGGGAT TCAGAAATTG ATCAACTCTT CAGGATTTTC 780 

AGAGCTTTGG GCACTCCCAA TAATGAAGTG TGGCCAGAAG TGGAATCTTT ACAGGACTAT 840 

AAGAATACAT TTCCCAAATG GAAACCAGGA AGCCTAGCAT CCCATGTCAA AAACTTGGAT 900 

GAAAATGGCT TGGATTTGCT CTCGAAAATG TTAATCTATG ATCCAGCCAA ACGAATTTCT 960 

GGCAAAATGG CACTGAATCA TCCATATTTT AATGATTTGG ACAATCAGAT TAAGAAGATG 1020 

TAGCTTTCTG ACAAAAAGTT TCCATATGTT ATGT CAACAG ATAGTTGTGT TTTTATTGTT 1080 

AACTCTTGTC TATTTTTGTC TTATATATAT TTCTTTGTTA TCAAACTTCA GCTGTACTTC 1140 

GTCTTCTAAT TTCAAAAATA TAACTTAAAA ATGTAAATAT TCTATATGAA TTTAAATATA 1200 
ATTCTGTAAA TGTGAAAAAA AAAAAAAAAA AAAAA 



Seq ID NO: 240 Protein sequence: 
Protein Accession #: NP_001777.1 

1 11 21 31 41 51 

) | I I I I 

MEDYTKIEKI GEGTYGWYK GRHKTTGQW . AMKKIRLESE EEGVPSTAIR EISLLKELRH 60 
PNIVSLQDVL MQDSRLYLIF EFLSMDLKKY LDSIPPGQYM DSSLVKSYLY QILQGIVFCH 120 
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SRRVLHRDLK PQNLLIDDKG TIKLADFGLA RAFGIPIRVY THEWTLWYR SPEVLLGSAR 
YSTPVDIWSI GTIFAELATK KPLPHGDSEI DQLFRIFRAL GTPNNEVWPE VESLQDYKNT 
PPKWKPGSLA SHVKNLDENG LDLLSKMLIY DPAKRISGKM ALNHPYFNDL DNQIKKM 



180 
240 



PCT/US02/12476 



Seq ID NO: 241 DNA sequence 

Nucleic Acid Accession #t NM_033379.1 

Coding sequence i 132-854 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



i 
I 

OGCCCGCGCG 
GCTTTGCAGA 
ATTGACTAAC 
TTGTGTATAA 
TAGAAAGTGA 
AACTTCGTCA 
ATCTCATCTT 
GTCAGTACAT 
TATTGCTGGG 
TTGCTGAACT 
TCAGGATTTT 
TACAGGACTA 
AAAACTTGGA 
AACGAATTTC 
TTAAGAAGAT 
TTTTTATTGT 
AGCTGTACTT 
ATTTAAATAT 



11 
I 

CGGGCTCAAC 
GAGCGCCCTC 
TATGGAAGAT 
GGGTAGACAC 
AGAGGAAGGG 
TCCAAATATA 
TGAGTTTCTT 
GGATTCTTCA 
GTCAGCTCGT 
AGCAACTAAG 
CAGAGCTTTG 
TAAGAATACA 
TGAAAATGGC 
TGGCAAAATG 
GTAGCTTTCT 
TAACTCTTGT 
CGTCTTCTAA 
AATTCTGTAA 



21 
I 

TTTGTAGAGC 
CAGGGACTAT 
TATACCAAAA 
AAAACTACAG 
GTTCCTAGTA 
GTCAGTCTTC 
TCCATGGATC 
CTTGTTAAGG 
TACTCAACTC 
AAACCACTTT 
GGCACTCCCA 
TTTCCCAAAT 
TTGGATTTGC 
GCACTGAATC 
GACAAAAAGT 
CTATTTTTGT 
TTTCAAAAAT 
ATGTGAAAAA 



31 

I 

GAGGGGCCAA 
GCGTGCGGGG 
TAGAGAAAAT 
GTCAAGTGGT 
CTGCAATTCG 
AGGATGTGCT 
TGAAGAAATA 
TAGTAACACT 
CAGTTGACAT 
TCCATGGGGA 
ATAATGAAGT 
GGAAACCAGG 
TCTCGAAAAT 
ATCCATATTT 
TTCCATATGT 
CTTATATATA 
ATAACTTAAA 
AAAAAAAAAA 



41 
I 

CTTGGCAGAG 
ACACGGGATC 
TGGAGAAGGT 
AGCCATGAAA 
GGAAATTTCT 
TATGCAGGAT 
CTTGGATTCT 
CTGGTACAGA 
TTGGAGTATA 
TTCAGAAATT 
GTGGCCAGAA 
AAGCCTAGCA 
GTTAATCTAT 
TAATGATTTG 
TATGTCAACA 
TTTCTTTGTT 
AATGTAAATA 
AAAAAA 



51 
I 

CGCGCGGCCA 
TACCCATACC 
ACCTATGGAG 
AAAATCAGAC 
CTATTAAAGG 
TCCAGGTTAT 
ATCCCTCCTG 
TCTCCAGAAG 
GGCACCATAT 
GATCAACTCT 
GTGGAATCTT 
TCCCATGTCA 
GATCCAGCCA 
GACAATCAGA 
GATAGTTGTG 
ATCAAACTTC 
TTCTATATGA 



Seq ID NO: 242 Protein sequence: 
Protein Accession #: NP 203698.1 



11 



21 



31 



41 



51 

I ! 1 1 i I 

MEDYTKIEKI GEGTYGWYK GRHKTTGQW AMKKIRLESE EEGVPSTAIR EISLLKELRH 
PNIVSLQDVL MQDSRLYLIP EPLSMDLKKY LDSIPPGQYM DSSLVKWTL WYRSPEVLLG 
SARYSTPVDI WSIGTIFAEL ATKKPLFHGD SEIDQLPRIP RALGTPNNEV WPEVESLQDY 
KNTFPKWKPG SLASHVKNLD ENGLDLLSKM LIYDPAKRIS GKMALNHPYP NDLDNQIKKM 

Seq ID NO: 243 DNA sequence 

Nucleic Acid Accession ih AF101051.1 

Coding sequence: 221-856 



1 

1 

GAGCAACCTC 
CGACCCAGAG 
GCGGGGCCCA 
ACCTGCCACC 
GCTGTTGGGC 
GCCCCAGTGG 
CGAGGGGCTG 
TGACTCCTTG 
CATCCTCCTG 
CTTGGAAGAC 
TCTTGCAGGT 
ATTCTATGAC 
TGGCTGGGCT 
CCGAAAAACA 
GAAAGACTAC 
GGACATTGAG 
GTATGGTATT 
AAACATGGCT 
TTGTATTACT 
TATATATAGA 
CTCATTATGT 
CCATATTGAT 
CAGTCAAATA 
CTAATTTACC 
TTATTTTTTA 
TTTCATTGGT 
AGCCAAGAAG 
GTGATAAATT 
TTTGCTTTGA 
CACAACTTTA 
ACCTTTTTGT 
TATATCTTCC 
GATAATCTGG 
TCTTTTTTCT 
AATATTAATT 
TTTATTTGCT 
CTTCATGTGA 
ACACATACCT 
AAACCTACGC 
ATTCTTTCAG 
TTTCCAGTCT 



11 

I 

AGCTTCTAGT 
CTTCTCCAGC 
GCCACCTTCG 
CCTGAGCCAG 
TTCATTCTCG 
AGGATTTACT 
TGGATGTCCT 
CTGAATCTGA 
GGAGTGATAG 
GATGAGGTGC 
CTGGCTATTT 
CCTATGACCC 
GCTGCTTCTC 
ACCTCTTACC 
GTGTGACACA 
ATACTATCAT 
ACAAAACAAA 
TAATCTTATT 
GCTTCCCATT 
TATGTATATA 
TGATACTAGC 
GAAGATGTTT 
TCATTTACTC 
AAGGATGAAT 
CCATAATCTT 
CTCTATCTCC 
AATTTATTAC 
CCTGTTGACC 
AAATATTTGT 
TTGATTGAAT 
TCCCCATTCC 
TAATAAGGTG 
TGACAAATAT 
ATCTGCCAAA 
AG TTT AT ATT 
CAGCTGGCTG 
TTCACTGCCT 
TCATGTGGTT 
ACATACCTTC 
CTGTGTCTGA 
GTACAGAATG 



21 
I 

ATCCAGACTC 
GGCGGCGCAG 
GGAGTCCGGG 
CGCGGGCGCC 
CCTTCCTGGG 
CCTATGCCGG 
GCGTGTCGCA 
GCAGCACATT 
CAAT CTTTGT 
AGAAGATGAG 
TAGTTGCCAC 
CAGTCAATGC 
TCTGCCTTCT 
CAACACCAAG 
GAGGCAAAAG 
TAACATTAGG 
CAAACAAACA 
TTATCTTCTT 
GAGTAATCAT 
TACATGTTTT 
ATACTTAAAA 
ATTGGTATAT 
TTCTTCATTA 
TCTTTCAATT 
ATAGCACTTG 
TGAATCTAAC 
AAATCAGAAC 
TTCCCACACA 
CCAATTGAGT 
TTTTAAGCTA 
TTAATTGTAT 
TGGTCTGTTT 
TCTCTCTGTA 
TTGAGATAAT 
ACTCTCATTC 
AGACACTGAA 
TCCTCTCTCT 
CAGTGCCTTC 
ATGTGGCTCA 
CATGTTTGTG 
CTATTTCACT 



31 

I 

CAGCGCCGCC 
CGAGCAGGGC 
TTGCCCACCT 
CGAGCGAGTC 
ATGGATCGGC 
CGACAACATC 
GAGCACOGGG 
GCAAGCAACC 
GGCCACCGTT 
GATGGCTGTC 
AGCATGGTAT 
CAGGTACGAA 
GGGAGGTGCC 
GCCCTATCCA 
GAGAAAATCA 
ACCTTAGAAT 
AAAAACCCAT 
TCCTCAATAT 
ACTCAAATGG 
TCTATTAAAA 
TATCTCTAAA 
TTTCTTTTTC 
GCTTTGGGTG 
CTTCATGCGT 
CATCGTTATT 
ACATTTCATA 
TT TGGAGGCA 
AT CCCTGTAC 
AGCTGCATGC 
CTT ATT CAT A 
TGTTTTCCCA 
GTCTGAACAA 
GCTGTAAGCA 
GATACTTAAC 
TTTGAACATG 
GAAGTCACTG 
ACCAGTCTAT 
CTCTCTCTAC 
GTGCCTTCCT 
CTCTGTTCCA 
TGAGCAAGAT 



41 
I 

CCGGGCGCGG 
TCCCCGCCTT 
GCAAACTCTC 
ATGGCCAACG 
GCCATCGTCA 
GTGACCGCCC 
CAGATCCAGT 
CGTGCCTTGA 
GGCATGAAGT 
ATTGGGGGTG 
GGCAATAGAA 
TTTGGTCAGG 
CTACTTTGCT 
AAACCTGCAC 
TGTTGAAACA 
TTTGGGTATT 
GTGTTAAAAT 
AGGAGGGAAG 
GGGAAGGGGT 
ATAGACAGTA 
ATAGGTAAAT 
GTCCTTATAT 
CCTTTGCCAC 
GCCCTTTTCA 
AAGCCCTTAT 
GCCTACATTT 
AATCTTTCTG 
TCTGACCCAT 
TGTTCCCCCA 
GTTTTATATC 
AGTGTAATTA 
AGTGCTAGAC 
AGTCACTTAA 
CAGTTAGAAG 
AACTATGCCT 
AACAAAACCT 
TTCCACTGAA 
CAGTCTATTT 
CTCTCTACCA 
TTTTAACAAC 
GATGTATGGA 



51 
I 

ACCCCAACCC 
AACTTCCTCC 
CGCCTTCTGC 
CGGGGCTGCA 
GCACTGCCCT 
AGGCCATGTA 
GCAAAGTCTT 
TGGTGGTTGG 
GTATGAAGTG 
CGATATTTCT 
TCGTTCAAGA 
CTCTCTTCAC 
GTTCCTGTCC 
CTTCCAGCGG 
AACCGAAAAT 
GTAATCTGAA 
ACTCAGTGCT 
ATTTTACCAT 
GCTCCTTAAA 
AAATACTATT 
GTATTTAATT 
ACATATGTAA 
AAGACCTAGC 
TATACTTATT 
TTGTTTTGTG 
TAGTTTCTAA 
CATGACCAAA 
AGCACTCTTG 
GGTGTTGTAA 
CCCCTAAACT 
TCATGCGTTT 
TTTCTGGAGT 
TCTTTCTACC 
AGGTAGTGTG 
ATGTAGTGTC 
ACACACGTAC 
CAAAACCTAC 
CCACTGAACA 
GTCTATTTCC 
TGCTCTTACT 
AAGGGTGTTG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
72 0 
780 
840 
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960 
1020 



60 
120 
180 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
' 660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 



281 
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GCACTGGTGT CTGGAGACCT QGATTTGAGT CTTGGTGCTA TCAATCACCG TCTGTGTTTG 2520 

AGCAAGGCAT TTGGCTGCTG TAAGCTTATT GCTTCATCTG TAAGCGGTGG TTTGTAATTC 2580 

CTGATCTTCC CACCTCACAG TGATGlTGTG GGGATCCAGT GAGATAGAAT ACATGTAAGT 2640 

GTGGTTTTGT AATTTGAAAA GTGCTATACT AAGGGAAAGA ATTGAGGAAT TAACTGCATA 2700 

CGTTTTGGTG TTGCTTTTCA AATGTTTGAA AATAAAAAAA TGTTAAGAAA TGGGTTTCTT 2760 

GCCTTAACCA GTCTCTCAAG TGATGAGACA GTGAAGTAAA ATTGAGTGCA CTAAACGAAT 2820 

AAGATTCTGA GGAAGTCTTA TCTTCTGCAG TGAGTATGGC CCAATGCTTT CTGTGGCTAA 2880 

ACAGATGTAA TGGGAAGAAA TAAAAGCCTA CGTGTTGGTA AATCCAACAG CAAGGGAGAT 2940 

TTTTGAATCA TAATAACTCA TAAGGTGCTA TCTGTTCAGT GATGCCCTCA GAGCTCTTGC 3000 

TGTTAGCTGG CAGCTGACGC TGCTAGGATA GTTAGTTTGG AAATGGTACT TCATAATAAA 3060 

CTACACAAGG AAAGTCAGCC ACCGTGTCTT ATGAGGAATT GGACCTAATA AATTTTAGTG 3120 

TGCCTTCCAA ACCTGAGAAT ATATGCTTTT GGAAGTTAAA ATTTAAATGG CTTTTGCCAC 3180 

ATACATAGAT CTTCATGATG TGTGAGTGTA ATTCCATGTG GATATCAGTT ACCAAACATT 3240 

ACAAAAAAAT TTTATGGCCC AAAATGACCA ACGAAATTGT TACAATAGAA TTTATCCAAT 3300 

TTTGATCTTT TTATATTCTT CTACCACACC TGGAAACAGA CCAATAGACA TTTTGGGGTT 3360 

TTATAATGGG AATTTGTATA AAGCATTACT CTTTTTCAAT AAATTGTTTT TTAATTTAAA 3420 
AAAAGGAAAA AAAAAAAAAA AAA 



Seq ID NO: 244 Protein sequence: 
Protein Accession #> AAD16433.1 

1 11 21 31 41 51 

I I I I I I 

MANAGLQIiLG PILAPLGWIG AIVSTALPQW RIYSYAGDNI VTAQAMYEGL WMSCVSQSTG 60 
QIQCKVFDSL LNLSSTLQAT RALMWGI1»L GVIAIFVATV GMKCMKCLED DEVQKMRMAV 120 
IGGAIFLLAG LAI L VAT AW Y GNRIVQEFYD PMTPVNARYE PGQALPTGWA AASLCLLGGA 180 
LLCCSCPRKT TSYPTPRPYP KPAPSSGKDY V 

Seq ID NO i 245 DNA sequence 

Nucleic Acid Accession #: CAT cluster 

1 11 21 31 41 51 

I 1 I I I I 

TTTTTTTTTT TTTTTTTTTT TTTTTCAAGG AGAGCACAAG GAACTTTATT AATGACTTTC 60 

TTAATGGTTA AATGCTGTTT ACCAAGTGAC CCAGAGGCAG CGTGGTTTAG TGGTTTCAAC 120 

AGCATGGTCC CGAGAGTCTG ACAAACCTCA GTTCAAATCC TTCTTTTGTC TTCACTTAGT 180 

TTTTCTTCCT GAGATTTAGT TTCTTCATCG TTAACAATGA GGATATTAAT ATGTTTCACA 240 

CAGTTGTTAT GAAGAATGCA TATATTAGAA TGCCTGTAGT CTCAGCTACT CAGGAGGCTA 3 00 

AGGTGGGGAG GTCGCTCAAG CCCAGGAATT CAAAGCTGCA ATGCATTATG ATTACAGCTG 360 

TTAATAGCCA CTGCACTTCA GCCTGGGCAA TGTAGTAAGA TCCCATCTCT GGCTCX3GAGG 420 
GTCCTACGCC CACGGAGTCT CGCTGATTGC TAGCACAGCA GTCTGAGATC AAACTGCA 

Seq ID NO: 246 DNA sequence 

Nucleic Acid Accession #: XM_058553.2 

Coding sequences 897-1400 

1 11 21 31 41 51 

I I I 1 1 I 

AATTTTCAGA AGTTTCGTAT GGGGATGGTT TTATATAAAT TCAGGTTTTT CCCACAATAA 60 

TAAATGTATT TAGTCTCAGT GCTCAATAGA AGAGATTTCT AATAGAAAAG GATTCAAACT 120 

GTGAAACCAT TTCTCTTTTA ATGTTTCACA TTCCTGTTAC AGATTTGTTC TCTTGTGACT 180 

CTGTT AT CCA TAATATGGAC AGTTCTTGAG TCCTAACATT GAGAGGTTTT CCCTTAGTGC 240 

ATAGAGGGAA TGAGTATTAA TTGGAGAAGC TTAAAGTATT GCCACTTTAG CACTGAAGAT 300 

TGGGATGAGA GGAGGTGAAA CCTCACTAGA AAAAGGGACA ATGTTAGTGT GGCCCTTCCT 3 60 

GATCATGTTT AAGAAAAGTC ATGAAAATGG TGAACTAGTG TTTCCAAGCA TATTGGAAGG 420 

GTTGAGTGTA TACTGTCTGT CAAAGACTTC CAGCATTTCC AGGTCCTAGA GAGGAACAAG 4 B0 

ACTGGTAACC TGCCTATCTG TATTTTTAAG AACCCAGGAG GAAAGCTTTA TAATAGAACA 540 

TTATTTCTGT GTTTATGTAT AAGGGGTTTT TTGTTTTTTT AAAGACAGGA TCTCACTCCA 600 

TTGTCCAGGC CAAGTGCAAT GGCACGAACC TCATAGCTCC TGGACTTAAG TGATCTGCCT 660 

GCCTTTGCCT CCTGAGTAGC TGGGACTACA GGCATGAGCC CCCATGCCTG GCTAAGTTTG 720 

TTTTTTTGTT TGTTTGTTTG TTTGTTTTTG GGGGGGGTTG TTTTGTTTTT TGTAGAGACG 780 

TAGTCTTGCT TTGTTGCCAG GCTAGTCTCA AACTCCTGGC TTCAAGTGAT CCTCCTGCCT 840 

CAGCCTCCCA GAGTGCTAGG ATTACAGCAC TTGGATTCAG CTTCTTCATT TCCAACATGG 900 

AAGAAACTTA CACCGACTCC CTGGACCCTG AGAAGCTATT GCAATGCCCC TATGACAAAA 960 

ACCATCAAAT CAGGGCTTGC AGGTTTCCTT ATCATCTTAT CAAGTGCAGA AAGAATCATC 1020 

CTGATGTTGC AAGCAAATTG GCTACTTGTC CCTTCAATGC TCGCCACCAG GTTCCTCGAG 1080 

CTGAAATTAG TCATCATATC TCAAGCTGTG ATGACAGAAG TTGTATTGAG CAAGATGTTG 1140 

TCAACCAAAC CAGGAGCCTT AGACAAGAGA CTCTGGCTGA GAGCACTTGG CAGTGCCCTC 1200 

CTTGCGATGA AGACTGGGAT AAAGATTTGT GGGAGCAGAC CAGCACCCCA TTTGTCTGGG 1260 

GCACAACTCA CTACTCTGAC AACAACAGCC CTGCGAGCAA CATAGTTACA GAACATAAGA 1320 

ATAACCTGGC TTCAGGCATG CGAGTTCCCA AATCTCTGCC GTATGTTCTG CCATGGAAAA 13 BO 

ACAATGGAAA TGCACAGTAA CTGAATACCT ATCTCATCAA ATGCCAGACC CTAGAAGACT 1440 

GTTGCTTCTT CTTCTACCAG TGGGTTCTCA TTTTCCTCCT AATCTAATTA TAGAATGGTA 1500 

AACTCCCTGT GACTTTCCAA ACTGACAAGC ACACTTTTTT CCTCCCCCCT TGAATCCTCA 1560 
TTTAATGCAA GAACCCTCAT ACTCAGAAGC TTCCAAATAA ACCTTTGATA CAGATTG 

Seq ID NO: 247 Protein sequence: 
Protein Accession #: XP_0S8S53.1 

1 11 21 31 41 51 

I I I I I I 

MEETYTDSLD PEKLLQCPYD KNHQIRACRF PYHLIKCRKN HPDV AS KLAT CPFNARHQVP 60 
RAEISHHISS CDDRSCIEQD WNQTRSLRQ ETLAESTWQC PPCDEDWDKD IjWEQTSTPFV 120 
WGTTHYSDNN SPASNIVTEH KNNLASGMRV PKSLPYVLPW KNNGNAQ 



282 



WO 02/086443 

Seq ID NO: 248 DNA sequence 
Nucleic Acid Accession ft: NMJ>03392 
Coding eequencei 7S8..1855 



PCT/US02/12476 
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TTAAGOAAAT 
AACTGATTAT 
CGCGCTGGTC 
CATCCTCCAC 
TCCACTCGCC 
GGCCCAGGTT 
GAGGGACTGA 
6GAAGGAG6C 
TCGCTGGCGT 
TGGGTGGATT 
GAGAAGCGCA 
TGTATGCTTG 
GTCCATTGGA 
CAAGTTCTTC 
CAATTCTTGG 
AGGAGCACAG 
CCACTTGTAT 
ATGCCAGTAT 
TTTTGGCAGG 
AGCAGGGGTG 
CTGCAGCCGC 
CGACAACATC 
GCGCATCCAC 
CGAGGCCGGC 
GTCCGGCTCA 
TGATGCCCTG 
GTTGGTACAG 
CCCCAGCCCT 
CCTGTGCAAC 
GTACGACCAG 
CTACGTCAAG 
GCCACCCAGC 
TGGTTTTTGG 
TTTTTTCCTG 
GGCAATAATG 
TACAAGACTT 
CTGTGTGGGA 
TGCCATCATA 
TTCAGCTTCT 
AAAACAAAAC 
CATTTTCAAA 
GGTATATCAC 
AATAGCTCAT 
CTCTTATGTC 
AAGACCCCCA 
ATGAAATATC 
TACATGAATC 
GCACTGCACC 
ACACTGAGCC 
GCAGCTCCAC 
TGGAAAACAG 
TAGTTTTCTA 
TATCACTGTT 
GTGTACCTTA 
GGTTTAATGG 
ATATATAAAT 
CTCTGGGGTT 
ATTCCAAAAG 
GCACGACGAA 
TGTTGGGTTG 
TTCTGTTCAC 
ATTCAAAACT 
AACCCATGCC 
CACTACATAG 
CAGGCCATCA 
ACATCTTTTC 
CTCTTAATTT 
ATAATGATAT 
AAAGCACTAA 
- TACTTTTTTT 
GTTGAGTTTA 
CATTCAGATA 
GTCTTGCGTG 
AATGGAAGAT 



11 
I 

CCGGGCTGCT 
GAAACATACG 
CCCGGGGCCT 
CCCCCGCGCT 
TCCCGTGCTC 
GGGGAGAGGA 
GCACAGCACC 
AGCGCCTGGC 
GCCCCGCGCA 
AATTTGGAAA 
GTCAATCAAC 
AAAATTATCT 
ATATTAAGCC 
CTAGTGGCTT 
TGGTCGCTAG 
CCTCTCTGCA 
CAGGACCACA 
CAATTCCGAC 
GTGATGCAGA 
GTGAACGCCA 
GCCGCGCGCC 
GACTATGGCT 
GCCAAGGGCT 
CGCAGGACGG 
TGTAGCCTGA 
AAGGAGAAGT 
GTCAACAGCC 
GACTACTGCG 
AAGACGTCGG 
TTCAAGACCG 
TGCAAGAAGT 
ACTCAGCCCC 
TTTTTAGAAA 
TTACCATCTA 
GGGGTGGGAA 
CTTTTGGATA 
CATGGTACAC 
TGGGATGGGT 
ATGACCAAAA 
AAAACAAAAC 
ATGATAATTT 
ATGTCTCATT 
GAAATTTGGG 
AAGATGTTGA 
ATGATTCTGG 
CTGTATTTTC 
CCATTCACAG 
AGAGCAGACA 
CTCTCTGATT 
TGGGTCCCCT 
TTCACTACTT 
TGTTTTAATG 
ATGATCCTGT 
AAACTGTTCC 
TGCCTGATAT 
ATAAATATAA 
ATCTCTCTGT 
TTTTTTGAGT 
GCAACCTCGT 
AAGATATCTT 
TTTGTGGAGA 
CAGAAGCATC 
TATTAGAAAT 
ATAGCTTTTT 
AATATGTTCT 
ATACCCCCCC 
ATTGCATAAT 
TCACATCCCC 
TTAGTTTAAA 
ATTTGCTAAA 
ACAATCCTAG 
TTATGTATAT 
ATTTGTATAT 
AGAATATAAA 



21 
I 

CTTCCCCATC 
ATGTTAATTC 
CGCCCCCCAC 
GGCCACCCCG 
CTCTCGCCCA 
CGGAGGGTGG 
AACTAGAGAG 
ACCAGGGCTT 
CAGGATCCCA 
AAGAAACTGC 
AGTAAACTTA 
GAGAGGGAAT 
CAGGAGTTGC 
TGGCCATATT 
GTATGAATAA 
GCCAACTGGC 
TGCAGTACAT 
ATCGACGGTG 
TAGGCAGCCG 
TGAGCCGGGC 
CCAAGGACCT 
ACCGCTTTGC 
CCTACGAGAG 
TGTACAACCT 
AGACATGCTG 
ACGACAGCGC 
GCTTCAACTC 
TGCGCAATGA 
AGGGCATGGA 
TGCAGACGGA 
GCACGGAGAT 
GCTCCCAGGA 
TATTTTTTAT 
AGAACTCTGT 
CCACGAAAAA 
GTATAGAATG 
ATCCAGAAGG 
AGGTTCCAGT 
TGAGTTGTAA 
AAACCTCCCT 
ACAATGGAAG 
CTCCTCAAAT 
CAGCAGGGAG 
TTTGAAGCTG 
ACACTAGATT 
TTAGGGATAC 
GTTTCTCAGC 
ACCTATTTGA 
CCTCCGTGTT 
TTGGTTGTAG 
AGGGATTTTT 
ACAGAACTTG 
GTTTAGATTA 
CAGTGTACTT 
CTCAAAGTCT 
ATATATCTCA 
CTAGAGCATT 
CTTGAGCTTG 
TTCTGAGGAA 
TTTTTCTTTT 
GGGCATTACT 
AGCAATGTTT 
GACAGTACTT 
TTTTTTTTTT 
TATCTCAGAC 
TTAGGAGGTT 
GATATCCACA 
TCAGTTGCAG 
ATGTCACTTT 
TCAGATTGTT 
CTTTTAAAAG 
CTTCTAGCCT 
TTCACTGGTT 
ATAAAACGTT 



31 

I 

TGGAAGTGGC 
GGAGCTGCAT 
CCCCTGCCCT 
CCTCCTTGGC 
TGGAATTAAT 
CCGCAGCGGG 
GGGTCAGGGG 
TG ACT CAACA 
GCGAAAATCA 
CTATATCTTG 
AGAGACCCCC 
AAACATCTTT 
TTTGGGGATG 
TTTCTCCTTC 
CCCTGTTCAG 
AGGACTTTCT 
CGGAGAAGGC 
GAACTGCAGC 
CGAGACGGCC 
GTGCCGCGAG 
GCCGCGGGAC 
CAAGGAGTTC 
TGCTCGCATC 
GGCTGATGTG 
GCTGCAGCTG 
GGCGGCCATG 
GCCCACCACA 
GAGCACCGGC 
TGGCTGCGAG 
GCGCTGCCAC 
CGTGGACCAG 
CCCGCTTATT 
TTTTCCCCAA 
GGTTTATTAT 
TATTTATTTT 
AAGGGGGAAA 
TAAAG AAAT A 
TGAAAGAGGG 
ATTCTCTGGT 
TCCCCAGCAG 
GACAAGAATG 
ATTCCATTTG 
GAAAGTCCCC 
TTATAAGAAT 



TTGGTTAGTA 
CCAAGCAACA 
GGAAAAACAG 
GTGATGTGAT 
GACAGGAAAT 
GTTT CCTAAA 
GCTAATGGAA 
TCCACTCATG 
GAACAGTTGC 
TTTGTACATA 
TTGCAGCCAG 
GTTGTCCTTC 
GGCTGTGGCC 
GAAGCTTGAG 
CTGCCTCACC 
TGTTCGTTAT 
CTCTTTTCTT 
ATTAATTGAG 
TTTTTTTTAA 
TTACGTTGTT 
GGGCTTTCAT 
TCAGCCAACT 
TGAATTGTGA 
TTTGGTTTTT 
CCTTTTTAGT 
AAACTATTTA 
TTATTCTGTA 
TAAAAAACAA 
ACTTGTAAAA 



41 

I 

TTTCCCCACA 
TTCCCAGCTG 
TCCCTCCCGC 
AGCCTCTGGC 
TCTGGCTCCA 
TTCCTGAGTG 
GTGCGGGACT 
GAATTGAGAC 
GATTTCCTGG 
CCATCAAAAA 
GATGCTCCCC 
TCCTTCTTCC 
GCTGGAAGTG 
GCCCAGGTTG 
ATGTCAGAAG 
CAAGGACAGA 
GCGAAGACAG 
ACTGTGGATA 
TTCACATACG 
GGCGAGCTGT 
TGGCTCTGGG 
GTGGACGCCC 
CTCATGAACC 
GCCTGCAAGT 
GCAGACTTCC 
CGGCTCAACA 
CAAGACCTGG 
TCGCTGGGCA 
CTCATGTGCT 
TGCAAGTTCC 
TTTGTGTGCA 
TATAGAAAGT 
GAATTGCAAC 
TAATATTATA 
GTGGATCTTT 
TAACACATAC 
CATTTTCTTT 
TGGTAGAAAT 
GCAAGATAAA 
GGCTGCTAGC 
TCATATTCTC 
CAGACAGACC 
AGAAATTAAA 
TGGGATTCCA 
GGAGGTTGGC 
AATTATAATA 
AGGTAATTGC 
TGAAATCCAC 
GCTGGCCACG 
GAAACATTAG 
ACTTTTATTT 
TTCACAGAGG 
CTTCT CCTAT 
ATTTATAAGG 
ACATATATAT 
TGATTTAGAT 
ACTGCAGTCC 
CCGCTGTGAT 
TTCTGACTCA 
CCTTTGTCTC 
AGACATGGAC 
AGTTCATTCT 
TCCCTAAGGA 
TAAGGACACC 
TTAAAAGTTT 
ATCACCTCAG 
GTGGCTCTTT 
GCAAAAGATC 
ATTATACAAA 
GACTCATGTT 
ATGTAAAATA 
CTTTTAATGT 
ACATCGAAAG 
AAAAAAAA 



SI 
I 

TCGGCTCGTA 
GGCACTCTCG 
GTCCTGCCCC 
GGCAGCGCGC 
CTTGTTGCTC 
AATTACCCAG 
CGAGCGAGCA 
ACGTTTGTAA 
TGAGGTTGCG 
ACTCACGGAG 
TGGTTTAACT 
CTCTCCAGAA 
CAATGTCTTC 
TAATTGAAGC 
TATATATTAT 
AGAAACTGTG 
GCATCAAAGA 
ACACCTCTGT 
CCGTGAGCGC 
CCACCTGCGG 
GCGGCTGCGG 
GCGAGCGGGA 
TGCACAACAA 
GCCATGGGGT 
GCAAGGTGGG 
GCCGGGGCAA 
TCTACATCGA 
CGCAGGGCCG 
GCGGCCGTGG 
ACTGGTGCTG 
AGTAGTGGGT 
ACAGTGATTC 
CGGAACCATT 
ATTATTATTT 
GAAAAGGTAA 
CCTAACTTAG 
TTCTCAAATA 
CTATTCACAA 
AGGTCTTGGG 
TTGCTTTCTG 
AAGGAAAAAA 
GTCATATTCT 
AAATTTAAAA 
GATTTGTAAA 
TTGAACATAA 
GTAGAAATAA 
GTGCCATTCA 
CTTCCTCTTC 
TTTCCAAACG 
GAGCTCTGCT 
TGAGGAGCAG 
TGTTGCAGCG 
TGTACTGCAG 
GGGGAAATGT 
ATATATACAT 
TTACAGCTTA 
AGTTGGGATT 
CATACCCTGA 
CTGAAATGCG 
CAACCTCCAT 
GTTAAGAGAT 
GCAGAATGGA 
ATATTCAGCC 
TCTTTCCAAA 
GGAAAGATAC 
CCAACTGTGG 
AATTTATTGC 
TTGAAAGCAA 
AACCATGAAG 
TATGAAGAGA 
TTCTACATGT 
ACATATTTCT 
GCTTATTCCA 
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Seq ID NOt 249 Protein sequence i 
Protein Accession #t NP_003383 



11 
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21 
I 
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I 



51 

I 
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WO 02/086443 

MAGSAMSSKP FLVALAIFFS FAQWIEAN3 WWSLGMNNPV 
SQGQKKLCHL YQDHMQYIGE GAKTGIKECQ YQFRHRRWNC 
AFTYAVSAAG WHAMSRACR EGELSTCGCS RAARPKDLPR 
FVDARERERI HAKGSYESAR ILMNLHNNEA GRRTVYNLAD 
LAD FRKVGDA LKEKYDSAAA MRLNSRGKLV QVNSRFNSPT 
GSLGTQGRLC NKTSEGMDGC ELMCCGRGYD QFKTVQTERC 
QFVCK 

Seq ID NO: 250 DNA sequence 
Nucleic Acid Accession #i N*I_014058 
Coding sequence i 56.. 1324 



PCT/US02/12476 



i 
I 

TGACTTGGAT 
TCGGCCAGAT 
CGTCATCTTC 
GAGATATAAT 
ACTATATGCT 
TGAATCAATO 
TCAGGTTATC 
TAQATTTCAC 
TGAAAAGCTG 
AAAAATCAAC 
TAAAACTCTA 
GCCCTGGCAG 
TGCCACATGG 
GACTGCTTCC 
AATTGTCCAT 
TTCTAGCCCT 
TGAGTTTCAA 
TTACAGTCAA 
TGAACCTCAA 
AGGAAAAACA 
AGATATCTGG 
GCCTGGTGTT 
CTAAGAGAGA 
CCATTTTTAG 
AATAAACTGT 



11 

I 

GTAGACCTCG 
GTGGTGAGGG 
ATATCCCTGA 
CAAAAGAAGA 
GAGTTTGGCA 
GTGAAAAATO 
AAGTTCAGTC 
TCTACTGAGG 
CAAGATGCTG 
AAGACAGAAA 
GGTCAGAGTC 
GCTAGCCTGC 
CTTGTGAGTG 
TTTGGAGTAA 
GAAAAATACA 
GTTCCCTACA 
CCAGGTGATG 
AATCATCTTC 
GCTTACAATG 
GATGCATGCC 
TACCTTGCTG 
TATACTAGAG 
AAAGCCTCAT 
AGATACAGAA 
TTGCTTGATG 



21 
I 

ACCTTCACAG 
CTAGGAAAAG 
TTGTCCTGGC 
CCTACAATTA 
GAGAGGCTTC 
CATTTTATAA 
AACAGAAGCA 
ATCCTGAAAC 
TAGGACCCCC 
CAGACAGCTA 
TCAGGATCGT 
AGTGGGATGG 
CTGCTCACTG 
CAATAAAACC 
AACACCCATC 
CAAATGCAGT 
TGATGTTTGT 
GACAAGCACA 
ACGCCATAAC 
AGGGTGACTC 
GAATAGTGAG 
TTACGGCCTT 
GGAACAGATA 
TTGGAGAAGA 
CAAAAAAAAA 



31 
I 

GACTCTTCAT 
AGTTTGTTGG 
AGTGTGCATT 
CTATAGCACA 
TAACAATTTT 
ATCTCCATTA 
TGGAGTGTTG 
TGTAGATAAA 
TAAAGTAGAT 
TCTAAACCAT 
TGGTGGGACA 
GAGTCATCGC 
TTTTACAACA 
TTCGAAAATG 
ACATGACTAT 
ACATAGAGTT 
GACAGGATTT 
GGTGACTCTC 
TCCTAGAATG 
TGGAGGACCA 
CTGGGGAGAT 
GCGGGACTGG 
ACATTTTTTT 
CTTGCAAAAC 
A 



QMSEVYIIGA 
STVDNTSVFG 
DWLWGGCGDN 
VACKCHGVSG 
TQDIiVYIDPS 
HCKFHWCCYV 



41 
I 

TGCTGGTTGG 
GAACCCTGGG 
GGACTCACTG 
TTGTCATTTA 
ACAGAAATGA 
AGGGAAGAAT 
GCTCATATGC 
ATTGTTCAAC 
CCTCACTCAG 
TGCTGCGGAA 
GAAGTAGAAG 
TGTGGAGCAA 
TATAAGAACC 
AAACGGGGTC 
GATATTTCTC 
TGTCTCCCTG 
GGAGCACTGA 
ATAGACGCTA 
TTATGTGCTG 
CTGGTTAGTT 
GAATGTGCGA 
ATTACTTCAA 
TTGTTTTTTG 
AGCTAGATTT 



QPLCSQLAGL 
RVMQIGSRET 
IDYGYRFAKE 
SCSLKTCWLQ 
PDYCVRNEST 
KCKKCTEIVD 



51 
I 

CAATGATGTA 
TTATCGGCCT 
TTCATTATGT 
CAACTGACAA 
GCCAGAGACT 
TTGTCAAGTC 
TGTTGATTTG 
TTGTTTTACA 
TTAAAATTAA 
CACGAAGAAG 
AGGGTGAATG 
CCTTAATTAA 
CTGCCAGATG 
TCCGGAGAAT 
TTGCAGAGCT 
ATGCATCCTA 
AAAATGATGG 
CAACTTGCAA 
GCTCCTTAGA 
CAGATGCTAG 
AACCCAACAA 
AAACTGGTAT 
GGTGTGGAGG 
GACTGATCTC 



Seq ID NO: 251 Protein sequence: 
Protein Accession #i NP_054777 



1 
I 

MYRPDWRAR 
DXLYABFGRE 
ICRFHSTEDP 
RSKTLGQSLR 
RWTASPGVTI 
SYEFQPGDVM 
LEGKTDACQG 
GI 



11 
I 

KRVCWEPWVI 
ASNNFTEMSQ 
ETVDKIVQIjV 
IVGGTEVEEG 
KPSKMKRGLR 
PVTGFGALKN 
DSGGPLVSSD 



21 
I 

GLVIPISLIV 
RLKSMVKNAF 
LHEKLQDAVG 
EWPWQASLQW 
RIIVHEKYKH 
DGYSQNHLRQ 
ARDIWYLAGI 



31 

I 

IAVCIGLTVH 
YKS PLREEFV 
PPKVDPHSVK 
DGSHRCGATL 
PSHDYDISIiA 
AQVTLIDATT 
VSWGDECAKP 



41 

I 

YVRYNQKKTY 
KSQVIKFSQQ 
IKKINKTETD 
INATWLVSAA 
ELSSPVPYTN 
CNE PCAYNDA 
NKPGVYTRVT 



Seq ID NO: 252 DNA sequence 

Nucleic Acid Accession ft: NM_003504.2 

Coding sequence: 71-1771 



1 
I 

GGCACGAGGC 
CGCCGTGGCT 
GAGGGTCCTT 
GGCCTTGTTC 
ACTTGAAACT 
TGGAGCTAAT 
GTGTGACACC 
ACTCATTAAA 
AGAGGAGGAT 
CACACGGTTA 
GGAGGCCCGG 
GTCAGCCATG 
GTGGTGGGCC 
ATAOGTGACT 
GGATGAGGAG 
CCTGGTGCTC 
AGCCAGGTTC 
CATGGGTCTT 
GGAGAATTTG 
CGTGCAGACT 
CTTTGCCACC 
CATCCAGGCT 
ACTCGCCAAG 
CCTCGTCATC 
CATGCTGTTC 
TGTGTGTTCG 



11 
I 

CTCGTGCCGC 
ATGTTCGTGT 
CTCTTCGTGG 
CAGTGTGACC 
GCATTTCTTG 
GTAGACCTAT 
CATAGGCCAG 
CAAGATGATG 
GAAGAGCATT 
GAAGAGGAGA 
AGAAGAGACA 
GTGATGTTTG 
ATCGTTGGAC 
GATGTTGGTG 
AACACACTCT 
TACCAGCACT 
AAGCTGTGGT 
CCCCTGAAGC 
CGGGAAATGA 
TTCAGCATTC 
ATGTCTTTGA 
CTGGACAGCC 
AAGCAGCTGC 
TCCCAGGGGC 
TCTAGGCCGG 
ACAAAGAACC 



21 
I 

CGGGCTCTTG 
CCGATTTCCG 
CCTCGGACGT 
ACGTGCAATA 
AGCATAAAGA 
TGGATATTCT 
TCAATGTCGT 
ACCTTGAAGT 
CAGGAAATGA 
TAGTGGAGCA 
TCCTCTTTGA 
AGCTGGCTTG 
TAACAGACCA 
TCCTGCAGCG 
CCGTGGACTG 
GGTCCCTCCA 
CTGTGCATGG 
AGGTGAAGCA 
TTGAAGAGTC 
ATTTTGGGTT 
TGGAGAGCCC 
TCTCCAGGAG 
GAGCCACCCA 
CTTTCCTGTA 
CATCCCTAAG 
GGCGCTGCAA 



31 
I 

GTACCTCAGC 
CAAAGAGTTC 
GQATGCTCT3 
TACGCTGGTT 
ACAGTTTCAT 
TCAACCTGAT 
CAATGTATAC 
TCCCGCCTAT 
CAGTGATGGG 
AACCATGCGG 
CTACGAGCAG 
GATGCTGTCC 
GTGGGTGCAA 
CCACGTTTCC 
CACACGGATC 
TGACAGCCTG 
ACAGAAGCGG 
GAAGTTCCAG 
TGCAAATAAA 
CAAGCACAAG 
CGAGAAGGAT 
TAACCTGGAC 
GCAGACCATT 
CTGCTCTCTC 
CCTGCTCAGC 
ACTGCTGCCC 



41 
I 

GCGAGCGCCA 
TACGAGGTGG 
TGTGCGTGCA 
CCAGTTTCTG 
TATTTTATTC 
GAAGACACTA 
AACGATACCC 
GAAGACATCT 
TCAGAGCCTT 
AGGAGGCAGC 
TATGAATATC 
AAGGACCTGA 
GACAAGATCA 
CGCCACAACC 
TCCTTTGAGT 
TGCAACACCA 
CTCCAGGAGT 
GCCATGGACA 
TTTGGGATGA 
TTTCTGGCCA 
GGCTCAGGGA 
AAGCTGTACC 
GCCAGCTGCC 
ATGGAGGGCA 
AAACACCTGC 
CTGGTGATGG 



60 
120 
180 
240 
300 
360 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 



51 
I 

NY Y STI»SFTT 
KHGVLAHMIiIj 
SYLNHCCGTR 
HCFTTYKNPA 
AVHRVCLPDA 
ITPRMLCAGS 
ALRDWITSKT 



51 
I 

GGCGTCCGGC 
TCCAGAGCCA 
AGATCCTTCA 
GGTGGCAAGA 
TCATAAACTG 
TATTCTTTGT 
AGATCAAATT 
TCAGGGATGA 
CTGAGAAGCG 
GGCGAGAGTG 
ATGGGACATC 
ATGACATGCT 
CTCAAATGAA 
ACCGGAACGA 
ATGACCTCCG 
GCTATACCGC 
TCCTTGCAGA 
TCTCCTTGAA 
AGGACATGCG 
GCGACGTGGT 
CAGATCACTT 
ATGGCCTGGA 
TTTGCACCAA 
CTCCAGATGT 
TCAAGTCCTT 
CTGCCCCCCT 



60 
120 
180 
240 
300 
360 
420 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 



284 



10 
15 
20 
25 



WO 02/086443 

GAGCATGGAG CATGGCACAG TGACCGTGGT GGGCATCCCC 
CAGGAAGAAC TTTTTTGGGA GGGCGTTTGA GAAGGCAGCG 
GCTGCACAAC CATTTTGACC TCTCAGTAAT TGAGCTGAAA 
TCTGGACGCA CTTATTTCCC TCCTGTCCTA GGAATTTGAT 
ATTTATGTAA CTGGCTTTCA TTTAGATTGT AAGTTATGGA 
CCATTTTTTA TTAAATAAAA TQCTTATTTT AGQCTCCGTC 
AAAAAAAAAA AA 

Seq ID NO: 253 Protein sequence: 
Protein Accession #* NP_003495.1 



PCT/US02/12476 



CCAGAGACCG 
GAAAGCACCA 
GCTGAGGATC 
TCTTCCAGAA 
CATGATTTGA 
CCCAAAAAAA 



ACAGCTCGGA 
GCTCCCGGAT 
GGAGCAAGTT 
TGACCTTCTT 
GATGTAGAAG 
AAAAAAAAAA 



1 
I 

MFVSDFRKEF 
AFLEHKEQFH 
QDDDLEVPAY 
RRDILFDYEQ 
DVGVLQRHVS 
KLWSVHGQKR 
FSIHFGPKHK 
KQLRATQQTI 
TKNRRCKLIjP 
HFDLSVIELK 



11 
I 

YEWQSQRVIi 
YFILINCGAN 
EDIFRDEEED 
YEYHGTSSAM 



LQEFLADMGL 
PLASDWFAT 
ASCLCTNLVI 
LVMAAPL.SME 
AEDRSKFLDA 



21 
I 

LFVASDVDAL 
VDLXjDILQPD 
EEH6GNDSDG 
VMFELAWMIjS 
NTLSVDCTRI 
PLKQVKQKFQ 
MSLMESPEKD 
SQGPFLYCSIi 
HGTVTWGIP 
hlSLLS 



31 
I 

CACKILQALF 
EDTIFFVCDT 
SEPSEKRTRL 
KDLNDMLWWA 
SFEYDLRLVL 
AMDISLKENL 
GSGTDHFIQA 
MEGTPDVMLF 



41 
I 

QCDHVQYTLV 
HRPVNWNVY 
EEEIVEQTMR 
IVGLTDQWVQ 
YQHWSLHDSL 
REMI EES ANK 
LDSLSRSNLD 
SRPASLSIiIiS 
FFGRAFEKAA 



51 
I 

PVSGWQELET 
MDTQIKLLIK 
RRQRREWEAR 
DKITQMKYVT 
CNTSYTAARF 
FGMKDMRVQT 
KLYHGLELAK 
KHLL.KSFVCS 
ESTSSRMLHN 



Seq ID NO: 254 DNA sequence 
Nucleic Acid Accession ft* NM_022337 
Coding sequence: 48.. 683 



1620 
1680 
1740 
1800 
I860 
1920 



60 
120 
180 
240 
300 
360 
420 
480 
540 



30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



i 
I 

GGCTGCGCTT 
ACAAGGAGCA 
TCATCAAGCG 
ACTTCGCGCT 
ATATCGCAGG 
GTGCATTTAT 
AAAATGATTT 
TGGCCAACAA 
AGTTCTGCAA 
ACATTGATGA 
TGGAGTCTAT 
GCTCTGGCTG 
TTGTTCCACA 
CACATGTGGC 
OTTCTTTCTA 
TCTGTTACAA 
TTATTTGCTT 
AACTAGCTGT 
AATATATTCT 
GACCTCCATT 
ACAGGTGTGC 
AACTGAATAT 
CTCAAGCTGT 
GCAAGTGAAC 



11 

I 

CCCTGGTCAG 
CCTGTACAAG 
CTACGTGCAC 
CAAGGTGCTC 
TCAAGAAAGA 
TGTCTTCGAT 
GGACTCCAAG 
ATGTGACCAG 
GGAGCACGGT 
AGCCTCCAGA 
TGAGCCGGAC 
TGCCAAATCC 
AATTGTGCCT 
AAGCCAAAGA 
TGCTTTCCTC 
ACTTCTGTCA 
CTTTTAATCA 
CAAGTCAAGG 
CTGATGGCCT 
CTCGGCAGAC 
TATATTGTCC 
TGTATGAAAA 
GGGGCTCCTC 
AATAAAACAT 



21 
I 

GCACGGCACG 
TTGCTGGTGA 
CAGAACTTCT 
CACTGGGACC 
TTTGGAAACA 
GTCACCAGGC 
TTAAGTCTCC 
GGGAAGGATG 
TTCGTAGGAT 
TGCCTGGTGA 
GTCGTGAAGC 
TAGTAGGCAC 
CTATTTTTAC 
TCTATGCCTC 
ACCATCATCA 
TGTAGCTGAC 
GCAAAGGCCT 
ACTGGCTTTC 
GACAGGCCTA 
CTAAGAGTTG 
TTGTCCTAAC 
GACATGCCTC 
TATACATGCT 
TAAAAGATAA 



31 
I 

TCTGGCCGGC 
TTGGCGACCT 
CCTCGCACTA 
CGGAGACTGT 
TGACGAGGGT 
CAGCCACATT 
CTAATGGCAA 
TGCTCATGAA 
GGTTTGAAAC 
AACACATACT 
CCCATCTCAC 
CTTTGCTGGT 
CATTTTGGGT 
TGTTTTTTCA 
CAGTGTTTAC 
CAAAATCCTG 
CAAGTCTTAA 
ACCTTGCCCT 
TTAAGTAGAT 
CCTCTGAGTT 
TGTCACTTGC 
CATATGTGCC 
ATACATGTAA 
AA 



41 
I 

CGCCAGGATG 
GGGCGTGGGG 
CCGGGCCACA 
GGTGCGCCTG 
CTATTACCGA 
TGAAGCAGTG 
ACCGGTTTCA 
CAATGGCCTC 
ATCAGCAAAG 
TGCAAATGAG 
ATCAACCAAG 
GTCTGGTAGG 
AAACGTCAGG 
ATGAGAGAGA 
AAACTTTTGA 
CAGGGCCACA 
AATAAAAGGG 
GGTGTCTTTT 
GTGATATTTT 
AGCTCTTTGG 
CATGGCCTGA 
TTTCTGTTAG 
TATATATTAT 



51 
I 

CAGGCCCCGC 
AAGACCAGTA 
ATCGGCGTGG 
CAGCTCTGGG 
GAAGCTATGG 
GCAAAGTGGA 
GTGGTTTTGT 
AAGATGGACC 
GAAAATATAA 
TGTGACCTAA 
GTTGCCAGCT 
AATGACCTCA 
ATAGATATAC 
AATAGCAAAT 
AAATATTTAG 
GTCGGCACTG 
GAGAAGAACA 
TCCAGATTTC 
CTTCCAAGAT 
AATCGTGAAC 
ATGTTGGCTT 
CTCTCTTTGA 
ATATATTTTT 



Seq ID NO: 255 Protein sequence: 
Protein Accession tt» NP_071732 



11 



21 
I 



31 



41 



51 



GGAACCGCCC 
CCGCTGGCGG 
GGTGTTCGTG 
GAACTTGAGC 
CGTGATGGTC 
CGCCGTGGGC 
GGGCTGGTTC 
CGCTGACTTC 
CCCCATTCAG 
CATTCTCCTT 
TGGCGCCTAC 
CAAGGAGAGA 
CCTGTGGATG 
CCGAQCCGCC 



11 

I 

GCTGCCAGCC 
CTGCCGCTCA 
CGCTACGACT 
GACATGGAGA 
TTCGTGGGCT 
TTCAACTTCC 
CACTTCTTAC 
TGCGTGGCCT 
CTGCTCATCA 
AACCTGCTAA 
TTTGGGCTCA 
CAGAATTCTG 
TACTGGCCCA 
ATCAACACCT 



21 
I 

CGGCCAGGCA 
CCTGCCTGCT 
TCGAGGCCGA 
ACGAATTCTA 
TCGGCTTCCT 
TGTTGGCAGC 
AAGACCGCTA 
CTGTCTGCGT 
TGACTTTCTT 
AGGTGAAGGA 
CAGTGACCCG 
TGTACCAGTC 
GCTTCAACTC 
ACTGCTCCTT 



31 
I 

CCCCTGCAGC 
CCTGCAGGTG 
CGCCCACTGG 
CTATCGCTAC 
CATGACTTTC 
CTTCGGCATC 
CATCGTCGTG 
GGCCTTTGGG 
CCAAGTGACC 
TGCAGGAGGC 
GATCCTCTAC 
GGACCTCTTT 
AGCCATATCC 
GGCAGCCTGC 



41 
I 

ATGGCCTGGA 
ATTATGGTGA 
TGGTCAGAGA 
CCAAGCTTCC 
CTGCAGCGCT 
CAGTGGGCGC 
GGCGTGGAGA 
GCAGTTCTGG 
CTCTTCGCTG 
TCCATGACCA 
CGACGCAACC 
G C CAT GAT TG 
TACCATGGGG 
GTGCTTACCT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



MQAPHKEHLY KLLVIGDLGV GKTSIIKRYV HQNFSSHYRA TIGVDFALKV LHWDPETWR 
LQLWDIAGQE RFGNMTRVYY REAMGAFIVF DVTRPATFEA VAKWKNDLDS KLSLPNGKPV 
SWLLANKCD QGKDVLMNNG LKMDQFCKEH GFVGWFETSA KEN IN IDEAS RCLVKHILAN 
ECDLMESIEP DWKPHLTST KVASCSGCAK S 

Seq ID NO: 256 DNA sequence 
Nucleic Acid Accession #: NM_016321 
Coding sequence: 25.. 1464 



60 
120 
180 



51 

1 

ACACCAACCT 
TTCTCTTCGG 
GGACGCACAA 
AGGACGTGCA 
ACGGCTTCAG 
TGCTCATGCA 
ACCTCATCAA 
GTAAAGTCAG 
TGAATGAGTT 
TCCACACATT 
TAGAGCAGAG 
GCACCCTCTT 
ACAGCCAGCA 
CGGTGGCAAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 



285 



WO 02/086443 

ATCCAGTGCC CTGCACAAGA AGGGCAAGCT GGACATGGTG CACATCCAGA ATGCCACGCT 900 

CGCAGGAGGG GTGGCCGTGG GTACCGCTGC TGAGATGATG CTCATGCCTT ACGGTGCCCT 960 

CATCATCGGC TTCGTCTGCG GCATCATCTC CACCCTGGGT TTTGTATACC TGACCCCATT 3.020 

CCTGGAGTCC CGGCTGCACA TCCAGGACAC ATGTGGCATT AACAATCTGC ATGGCATTCC 1080 

TGG CAT CATA GGCGGCATCG TGGGTGCTGT GACAGCGGCC TCCGCCAGCC TTGAAGTCTA 1140 

TGGAAAAGAA GGGCTTGTCC ATTCCTTTGA CTTTCAAGGT TTCAACGGGG ACTGGACCGC 1200 

AAGAACACAG GGAAAGTTCC AGATTTATGG TCTCTTGGTG ACCCTGGCCA TGGCCCTGAT 1260 

GGGTGGCATC ATTGTGGGGC TCATTTTGAG ATTACCATTC TGGGGACAAC CTTCAGATGA 1320 

GAACTGCTTT GAGGATGCGG TCTACTGGGA GATGCCTGAA GGGAACAGCA CTGTCTACAT 1380 

CCCTGAGGAC CCCACCTTCA AGCCCTCAGG ACCCTCAGTA CCCTCAGTAC CCATGGTGTC 1440 

CCCACTACCC ATGGCTTCCT CGGTACCCTT GGTACCCTAG GCTCCCAGGG CAGGTGAGGA IS 00 

GCAGGCTCCA CAGACTSTCC TGGGGCCCAG AGGAGCTGGT GCTGACCTAG CTAGGGATGC 1560 

AAGAGTGAGC AAGCAGCACC CCCACCTGCT GGCTTGGCCT CAAGGTGCCT CCACCCCTGC 1620 

CCTCCCCTTC ATCCCAGGGG GTCTGMCTGA GAATGGAGAA GGAGAAGCTA CAAAGTGGGC 16 BO 

ATCCAAGCCG GGTTCTGGCT GCAGAAGTTC TGCCTCTGCC TGGGGTCTTG GCCACATTGG 1740 

AGAAAAACAG GCTCAAAGTG GGGCTGGGAC CTGGTGGGTG AACCTGAGCT CTCCCAGGAG 1800 

ACAACTTAGC TGCCAGTCAC CACCTATGAG GCTCTTCTAC CCCGTGCCTG CACCTCGGCC I860 

AGCATCTCCT ATGCTCCCTG GGTCCCCCAG ACCTCTCTGT GTTGTGTGCG TGGCAGCCTC 1920 
CAGGAATAAA CATTCTTGTT GTCCTTTGTA AAAAAAAAAA AAAAAAAA 

Seq ID NO: 257 Protein sequence t 
Protein Accession ft: NP_057405 

1 • 11 21 31 41 51 

I I I I I I 

MAWNTNLRWR LPLTCLLLQV IMVILFGVFV RYDFEADAHW WSERTHKNLS DMENBFYYRY 60 

PSFQDVHVMV FVGFGFLMTF LQRYGFSAVG FNFLIAAPGI QWALLMQGWF HFLQDRYIW 120 

GVENLINADF CVASVCVAFG AVJjGKVSPIQ LLIMTFFQVT LFAVNEFILL NLLKVKDAGG 180 

SMTIHTFGAY FGLTVTRILY RRNLEQSKER QNSVYQSDLF AMIGTLFLWM YWPSFNSAIS 240 

YHGDSQHRAA iNTYCSIiAAC VLTSVAISSA LHKKGKLDMV HIQNATLAGG VAVGTAAEMM 300 

LMPYGAIillG PVCGIISTLG PVYLTPFLES RLHIQDTCGI NNLHGIPGII GGIVGAVTAA 360 

9ASLEVYGKE GLVHSFDFQG FNGDWTARTQ GKFQIYGLIiV TI*AMALMGGI IVGLIIjRLPF 420 
WGQPSDENCF EDAVYWEMPE GNSTVYIPED PTFKPSGPSV PSVPMVSPLP MASSVPLVP 

Seq ID NO: 258 DNA sequence 

NUcleic Acid Accession #: NM_002358.2 

Coding sequence* 75.-692 



1 11 21 

I ) I 

GGGAAGTGCT GTTGGAGCCG CTGTGGTTGC 
TTGTGTCCCT GGCCATGGCG CTGCAGCTCT 
GCGCCGAAAT CGTGGCCGAG TTCTTCTCAT 
GCATATATCC ATCTGAAACC TTTACTCGAG 
CTACTGATCT TGAGCTCATA AAATACCTAA 
TATACAAGTG TTCAGTTCAG AAACTGGTTG 
TCCTGGAAAG ATGGCAGTTT GATATTGAGT 
CCAGAGAAAA GTCTCAGAAA GCTATCCAGG. 
CAGCTACGGT GACATTTCTG CCACTGTTGG 
ATACAGACAA AGATTTGGTT GTACCTGAAA 
CCAAT TCTGA GGAAGTCCGC CTTCGTTCAT 
TGGTGGCCTA CAAAATTCCT GTCAATGACT 
TAATTTTGAA ATGTGGTTTT CCTGAAATCA 
TTGGTTAATT TTTACATGGA GAAAACCAAA 
CCTTTATTTT TTTGGTACCT ATTTGACTTA 
CATTGTTCAA AAGGAACCAG GAGGTTTTTT 
GATAGTAACT GTAGATGGAA AAACTTGTGC 
GTTTTGGTCA AGTAGTTTGA CTCAGTATAG 
AAAGGAAGTC TAAATATTCA GAATCTTTGT 
AACAATGAAA TATTGCTGTA TAGCTCCTTT 
TTGAATCAGT TTCCAATTAT TTGACTTTAA 
ATATTTGTAC TGTTTAATGT TCTGTGATAC 
TTATAAAATC AAGTTTTAAG TGAAAGTGAG 
AAAAAAAAAA 



31 41 51 

i I I 

TGTCCGCGGA GTGGAAGCGC GTGCTTTTGT 60 

CCCGGGAGCA GGGAATCACC CTGCGCGGGA 120 

TCGGCATCAA CAGCATTTTA TATCAGCGTG 180 

TGCAGAAATA CGGACTCACC TTGCTTGTAA 240 

ATAATGTGGT GGAACAACTG AAAGATTGGT 300 

TAGTTATCTC AAATATTGAA AGTGGTGAGG 360 

GTGACAAGAC TGCAAAAGAT GACAGTGCAC 420 

ATGAAATCCG TTCAGTGATC AGACAGATCA 480 

AAGTTTCTTG TTCATTTGAT CTGCTGATTT 540 

AATGGGAAGA GTCGGGACCA CAGTTTATTA 600 

TTACTACTAC AATCCACAAA GTAAATAGCA 660 

GAGGATGACA TGAGGAAAAT AATGTAATTG 720 

GGTCATCTAT AGTTGATATG TTTTATTTCA 780 

ATGATACTTA CTGAACTGTG TGTAATTGTT ' 840 

CCATGGAGTT AACATCATGA ATTTATTGCA 900 

TGTCAACATT GTGATGTATA TTCCTTTGAA 960 

TATAAAGCTA GATGCTTTCC TAAATCAGAT 1020 

GTAGGGAGAT ATTTAAGTAT AAAATACAAC 1080 

TAAGGTCCTG AAAGTAACTC ATAATCTATA 1140 

TGACCTTCAT TTCATGTATA GTTTTCCCTA 1200 

TTTATGTAAC TTGAACCTAT GAAGCAATGG 1260 

AGAACTCTTA AAAATGTTTT TTCATGTGTT 1320 

GAAATAAAGT TAAGTTTGTT TT AAAAAAAA 1380 



Seq ID NO: 259 Protein sequence: 
Protein Accession #: NP_002349.1 

1 XI 21 31 41 51 

MALQLSREQG ITLRGSAEIV AEFFSFGINS ILYQRGIYPS ETFTRVQKYG LTLLVTTDLE 60 
LIKYLNNWE QLKDWLYKCS VQKLVWISN IESGEVLERW QFDIECDKTA KDDSAPREKS 120 
QKAIQDEIRS VIRQITATVT FLPLLEVSCS FDLIi I YTDKD LWPEKWEES GPQFITNSEE 180 
VRLRSFTTTI HKVNSMVAYK IPVND 

Seq ID NO: 260 DNA sequence 
Nucleic Acid Accession #: NM_001211 
Coding sequence: 43.. 3195 

1 ll 21 31 41 51 

| | I I I I 

AAAGGCCTGC AGCAGGACGA GGACCTGAGC CAGGAATGCA GGATGGCGGC GGTGAAGAAG 60 
GAAGGGGGTG CTCTGAGTGA AGCCATGTCC CTGGAGGGAG ATGAATGGGA ACTGAGTAAA 120 
GAAAATGTAC AACCTTTAAG GCAAGGGCGG ATCATGTCCA CGCTTCAGGG AGCACTGGCA 180 
CAAGAATCTG CCTGTAACAA TACTCTTCAG CAGCAGAAAC GGGCATTTGA ATATGAAATT 240 



286 



WO 02/086443 



PCT/US02/12476 



5 

10 
15 

20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



CGATTTTACA 
CAGAACTATC 
GAAGCACTAC 
AAATTAGGGC 
ATTGGTGTTT 
AACTTTAGGA 
GAAAGACTAC 
GCACTTGAGA 
CTAGCTGAAC 
GGTGCTCTCA 
CAAAATAATA 
TTGTCTAAGC 
GAGCTGCAAG 
ACAGCTTCAC 
ACTGCACAAC 
AGCACCAGAA 
CAAGCGTCTG 
GTAGGGGAAT 
CAAAGGGAAG 
GAAGAGATGG 
CAG CAAGAAG 
CAGAAAATAC 
GAAACTTCAC 
CCTTTCTCCA 
GATCCCCCAC 
ATCACCTCAA 
TTGAGCGAGG 
GACACTTGTG 
TCCTTGAAGG 
AAGACCT CTG 
AAGAAGCTGA 
GGTTCTTCTG 
CTAGAACTTA 
CGCAGACAGC 
GACAGACCAA 
TGCATTAAAC 
AACTCTGCAG 
ATCAACCTCA 
CAATATCAAG 
CTTCTCCAAC 
TTGACAATAG 
TGTCTGATTC 
TTGAAGATAG 
CTCAGCGGCT 
TCTCCCTACC 
GAACACCTAC 
CTAAAAGATG 
GCCACAGTGT 
TTCCAAAGTC 

GTATTGTGGA 
CTACCATTGC 
ACAGTGATAT 
CAGACTCATT 
CTTTTCCCAT 
TAAAATAGTA 



CTGGAAATGA 
CTCAAGGTGG 
AAGGAGAAAA 
GTTTATGCAA 
CACTTGCTCA 
AAGCAGATGC 
AGTCCCAGCA 
AAGAAGAAGA 
TAAAGAGCAA 
AGGCTCCAAG 
GTAGAATTAC 
CTACAGTCCA 
CAGGCCCTTG 
TGATAGCTGT 
AGCCAGTTAT 
AG CCTGG AAA 
AGGAGAAGAA 
TCTCCTTTGA 
CCGAGCTATT 
AGAAGAAGCT 
AGACGATGCC 
CAGGAATGAC 
TTGCGGAGAA 
T TTTTGATG A 
GAGTTTTAGC 
ATGAAGATGT 
ATGCCATTAT 
ACTTTGCCAG 
ATCTCCCTTC 
AGGACCAGCA 
GCCCAATTAT 
CCTCGGTTGC 
CTAATGAGAC 
TACTGAAGTC 
TGCCTAAGTT 
GAGAATACCT 
AATTAACAGT 
AGTTAAAGGA 
ATGGCTGTAT 
ACAGTGAATA 
TGGAGATGCT 
T CAG AAA CAG 
TGGACTTTTC 
TTCGGACTGT 
AGGTAGACCT 
AGGTCTTCTG 
GTGAATTGTG 
CTGTTCTTGG 
ACCTGAACAA 
AGTGAGCTAG 
ACACTGAAAC 
TGTTCTACTT 
ACTTACTCAT 
ACAAATGGTT 
TTGTAATTTG 
TCTGTTAAAA 



CCCTCTGGAT 
GAAAGAGAGT 
ACGATATTAT 
TGAGCCTTTG 
GTTCTATAT C 
GATATTTCAG 
CCGACAATTC 
GGAGGAAGTT 
AGGGAAAAAG 
CCAGAACAGA 
TGTTTTTGAT 
G CCATGG ATA 
GAACACAGGC 
ACCCGCTGTG 
GACACCATGT 
GGAAGAAGGA 
AGAGAAGATG 
AGAAATTCGG 
GACCAGTGCA 
AAAAGAAATC 
TACAAAGGAG 
TCTATCCAGT 
CATTTGGCAG 



TCAACGAAGA 
GTCTCCAGAT 
CACAGGCTTC 
AGCAGCTCGT 
TGATCCTGAG 
GACAGCTTGT 
TGAAGACAGT 
AAGCACCTCC 
TTCAGAAAAC 
CCTACCAGAG 
GGAAATTGAG 
AATATGTGAA 
AATAAAGGTA 
ACGTTTAAAT 
TGTTTGGCAC 
TATTACCCAT 
ACACAAAGCA 
AATCCACGAT 
CTACAGTGTT 
ACAGATCCTG 
GTTTGGTATA 
GGATGGGTCC 
GAATAAATTC 
GGAGCTTGCA 
AGCCTTATGG 
GCAATCAAGT 
TGTATGTGCT 
TTTGGTACAG 
GGCCTTGTCT 
ACCTTGTTAT 
TAAAATGTTC 
AAAAAAAAAA 



GTTTGGGATA 
AATATGTCAA 
AGTGATCCTC 
GATATGTACA 
TCATGGGCAG 
GAAGGGATTC 
CAAGCTCGAG 
TTTGAGTCTT 
ACAGCAAGAG 
GGACTCCAAA 
GAAAATG CTG 
GCACCCCCCA 
AGGTCCTTGG 
CTTCCCAGTT 
AAAATTGAAC 
GATCCTCTAC 
ATGTATTGTA 
GCTGAAGTTT 
GAGAAGAGAG 
CAAACTACTC 
ACAACTAAAC 
TCTGTTTGTC 
GAACAACCTC 
TCAGAAAAGA 
CCCCTTGCAG 
GTTTGTGATG 
AGAAATGTAA 
TTTGTATCCA 
AGACTGTTAC 
GGCACTATCT 
CGTGAAGCCA 
TCCATCAAAT 
CCTACTCAGT 
TTAAGTGCCT 
AAGGAAATTG 
GATTACAAGT 
TCTTCTCAAC 
GAAGATTTTG 
CAATATATAA 
GAAATAACAG 
GAAATAGTCC 
CCCTATGATT 
GACCTTAGGG 
GAAGGACAAA 
GCAGATTTAG 
TTCTGGAAAC 
TTTGTGCGGA 
GCAGAAATGA 
AAGGTAGGGA 
CTCACAGATT 
GTAATTTAAT 
GTATATTTTG 
AACTTTTGTG 
TTAACCCATT 
TCTTATGATC 
AAAAAAAAAA 



GGTATATCAG 
CGTTATTAGA 
GATTTCTCAA 
GTTACTTGCA 
AAGAATATGA 
AACAG AAGG C 
TGTCTCGGCA 
CTGT AC CACA 
CTCCAATCAT 
ATCCATTTCC 
ATGAGGCTTC 
TGCCCAGGGC 
AACACAGGCC 
TCACTCCATA 
CTAGTATAAA 
AAAGGGTTCA 
AGGAGAAGAT 
TCCGGAAGAA 
CAGAAATGCA 
AGCAAGAAAG 
TGCAAATTGC 
AAGTAAACTG 
ATTCTAAAGG 
AGAATAAAAG 
TTCTCAAAAC 
AATTTACAGG 
CAATTTGTCC 
CTCCTTTTCA 
CGGAAGAAGA 
ACAGTCAGAC 
CACACTCCTC 
GTCTTCAAAT 
CACCATGGTG 
CTGCAGAGTT 
AATTAGGTAA 
TATTCTGGGT 
CTGTCCCATG 
ATCATTTTTG 
ACTGCTTCAC 
TGTTGATTAT 
ATGGTGACTT 
GTAACAAGAA 
TGCAGCTGGA 
AGATCCTGGC 
CACATTTACT 
TTAGCCAAAA 
TTCTGAATGC 
ATGGGGTTTT 
AGTTAACTAG 
GCTGCCTCAG 
TTAGGACACA 
ACGTCACTGA 
AAGAACTATT 
TGTCTCTACT 
ACCATGTATT 



CTGGACAGAG 
AAGAGCTGTA 
TCTCTGGCTT 
CAACCAAGGG 
AGCTAGAGAA 
TGAACCACTA 
AACTCTGTTG 
ACGAAGCACA 
CCGTGTAGGA 
TCAACAGATG 
TACAGCAGAG 
CAAAGAGAAT 
TCGTGGCAAT 
TGTGGAAGAG 
CCACATCCTA 
GAGCCATCAG 
TTATGCAGGA 
ATTAAAAGAG 
GAAACAGATT 
AACAGGTGAT 
TTCCGAGTCT 
TTGTGCCAGA 
TCCCAGTGTA 
TCCTCCTGCA 
CTCAGAAAGC 
AATTGAACCC 
TAACCCAGAA 
TGAGATAATG 
TCTAGATGTA 
TCTCAGCATC 
TGGCTTCTCT 
TC CTG AG AAA 
TTCACAGTAT 
GTGTATAGAA 
TGAGGATTAC 
GGCGCCAAGA 
GGACTTTTAT 
CAGCTGTTAT 
CCTTCAGGAT 
TTATAACCTT 
GAGTCCAAGG 
CAATCAAGCT 
TGTTTTTACC 
TAACTGTTCT 
ATTGTTCAAG 
TATTTCTGAG 
CAATGATGAG 
TGACACTACA 
TCCTGGGGCT 
AG CAATGGTT 
TTTAGATGCA 
TATTTTTTAT 
TTATTCTAAA 
TTTCCCTGTA 
TTGTAAATAA 



Seq ID MO: 261 Protein sequence: 
Protein Accession fl: NP_001202 



MAAVKKEGGA 
AFEYEIRFYT 
FLNLWLKLGR 
QKAEPLERLQ 
P I I RVGGALK 
PRAKENELQA 
SINHILSTRK 
RKKLKEQREA 
QIASESQKIP 
NKSPPADPPR 
1CPNPEDTCD 
SQTLSIKKLS 
PWCSQYRRQL 
FWVAPRNSAE 
CFTLQDLLQH 
NKNNQALKIV 
HLLLFKEHLQ 
GVFDTTFQSH 



11 
I 

LSEAMSLEGD 
GNDPLDVWDR 
LCNEPLDMYS 
SQHRQFQARV 
APSQNRGLQN 
GPWNTGRSLE 
PGKEEGDPLQ 
ELLTSAEKRA 
GMTLSSSVCQ 
VLAQRRPLAV 
FARAARFVST 
PIIEDSREAT 
LKSLPELSAS 
LTVIKVSSQP 
SEYITHEITV 
DFSYSVDLiRV 
VFWDGSFWKL 
LNKALWKVGK 



21 
I 

EWELSKENVQ 
YISWTEQNYP 
YLHNQGIGVS 
SRQTLLALEK 
PFPQQMQNNS 
HRPRGNTASL 
RVQSHQQASE 
EMQKQIEEMB 
VNCCARETSL 
LKTSESITSN 
PFHEIMSUCD 
HSSGFSGSSA 
AELCIEDRPM 
VPWDFYINLK 
LIIYNLLTIV 
QLDVFTLSGF 
SQNISELKDG 
LTSPGALLFQ 



31 

I 

PLRQGRIMST 
QGGKESNMST 
LAQFYI SWAE 



RITVFDENAD 
IAVPAVLPSF 
EKKEKMMYCK 
KKLKEIQTTQ 
AENIWQEQPH 
EDVSPDVCDE 
LPSDPERLLP 
SVASTSSIKC 
PKIiEXEKEIE 
LKERLNEDFD 
EMLHKAEIVH 
RTVQILEGQK 
ELiWNKFFVRI 



41 
I 

LQGALAQESA 
LLERAVEALQ 
EYEARENFRK 
VPQRSTLAEL 
EASTAELSKP 
TPYVEETAQQ 
EKIYAGVGEF 
QERTGDQQBE 
SKGPSVPFSI 
FTGIEPLSED 
EEDLDVKTSE 
LQIPEKLELT 
LGNEDYC I KR 
HFCSCYQYQD 
GDLSPRCLIL 
ILANCSSPYQ 
LNANDEATVS 



300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
I860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2680 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 



51 
I 

CNNTLQQQKR 
GEKRYYSDPR 
ADAIFQEGIQ 
KS KGKKTARA 
TVQPWIAPPM 
PVMTPCKIEP 
SFEEIRAEVF 
TMPTKETTKL 
FDEFLLSEKK 
AIITGFRNVT 
DQQTACGTIY 
NETSENPTQS 
EYLICBDYKL 
GCIVWHQYIN 
RNRIHDPYDC 
VDLFGIADLA 
VLiGEIiAAEMN 



60 
120 
180 
240 
300 
3 60 
420 
460 
S40 
600 
660 
720 
780 
840 
900 
960 
1020 



Seq ID NO: 262 DNA sequence 
Nucleic Acid Accession #: NM_003784 
Coding sequence: 3 6 5.. 150 7 



11 



31 



41 



287 



WO 02/086443 

| I I I I I 

GTCTACTTAT CAATAAGCAG CTGCCTGTGC AGAGTGCAGG CTGCACCTTT GGACAGCCTT 60 

TAAAACTGAA TTCTCAGAAT TTTAGAACAA ATTTTTGTCT AGAAATGCTG ACTTTGGTTC 120 

ATTAGGTAGT GGTAAAACAG GCTCCCTTCG AAGCTCTCCT TCATCACCTT CCTAAGTGCA 180 

TGTACAGGGA AGCTCTCCTT CATCACCTTC CTAAGTGCAT GGGGGAAAAT ACCTAGGGCT 240 

CAACAGTCTT GAGAAGTGTG GAAACATTTT CTTTGTGAGT GAGAACAGAT CACCTAGAGA 300 

AAGGAAACCA GATTCCCATC ACTGCTTCTG GGTATCAGAT GCTAGCGCTG CACTCCATTT 360 

TGCAATGGCC TCCCTTGCTG CAGCAAATGC AGAGTTTTGC TTCAACCTGT TCAGAGAGAT 420 

GGATGACAAT CAAGGAAATG GAAATGTGTT CTTTTCCTCT CTGAGCCTCT TCGCTGCCCT 480 

GGCCCTGGTC CGCTTGGGCG CTCAAGATGA CTCCCTCTCT CAGATTGATA AGTTGCTTCA 54 0 

TGTTAACACT GCCTCAGGAT ATGGAAACTC TTCTAATAGT CAGTCAGGGC TCCAGTCTCA 600 

ACTGAAAAGA GTTTTTTCTG ATATAAATGC ATCCCACAAG GATTATGATC TCAGCATTGT 660 

GAATGGGCTT TTTGCTGAAA AAGTGTATGG CTTTCATAAG GACTACATTG AGTGTGCCGA 720 

AAAATTATAC GATGCCAAAG TGGAGCGAQT TGACTTTACG AATCATTTAG AAGACACTAG 780 

ACGTAATATT AATAAGTGGG TTGAAAATGA AACACATGGC AAAATCAAGA AOGTGATTGG 840 

TGAAGGTGGC ATAAGCTCAT CTGCTGTAAT GGTGCTGGTG AATGCTGTGT ACTTCAAAGG 900 

CAAGTGGCAA TCAGCCTTCA CCAAGAGCGA AACCATAAAT TGCCATTTCA AATCTCCCAA 960 

GTGCTCTGGG AAGGCAGTCG CCATGATGCA TCAGGAACGG AAGTTCAATT TGTCTGTTAT 1020 

TGAGGACCCA TCAATGAAGA TTCTTGAGCT CAGATACAAT GGTGGCATAA ACATGTACGT 1080 

TCTGCTGCCT GAGAATGACC TCTCTGAAAT TGAAAACAAA CTGACCTTTC AGAATCTAAT 1140 

GGAATGGACC AATCCAAGGC GAATGACCTC TAAGTATGTT GAGGTATTTT TTCCTCAGTT 1200 

CAAGATAGAG AAGAATTATG AAATGAAACA ATATTTGAGA GCCCTAGGGC TGAAAGATAT 1260 

CTTTGATGAA TCCAAAGCAG ATCTCTCTGG GATTGCTTCG GGGGGTCGTC TGTATATATC 1320 

AAGGATGATG CACAAATCTT ACATAGAGGT CACTGAGGAG GGCACCGAGG CTACTGCTGC 1380 

CACAGGAAGT AATATTGTAG AAAAGCAACT CCCTCAGTCC ACGCTGTTTA GA GCTGA CCA 1440 

CCCATTCCTA TTTGTTATCA GGAAGGATGA CATCATCTTA TTCAGTGGCA AAGTTTCTTG 1500 

CCCTTGAAAA TCCAATTGGT TTCTGTTATA GCAGTCCCCA CAACATCAAA GRACCACCAC 1560 

AAGTCAATAG ATYTGRGTTT AATTGGAAAA ATGTGGTGTT TCCTTTGAGT TTATTTCTTC 1620 

CTAACATTGG TCAGCAGATG ACACTGGTGA CTTGACCCTT CCTAGACACC TGGTTGATTG 1680 

TCCTGATCCC TGCTCTTAGC ATTCTACCAC CATGTGTCTC ACCCATTTCT AATTTCATTG 1740 

TCTTTCTTCC CACGCTCATT TCTATCATTC TCCCCCATGA CCCGTCTGGA AATTATGGAG 1800 

RGTGCTCAAC TGGTAAGGAG AACGTAGAAG TAGCCCTAGG GATCCTTTTT GAAACTCTAC 1860 

AGTTATCGCA GATATTCTAG CTTCATTGTA AGCAATCTAG GAAATAAGCC CTGCTGCTTT 1920 

CTAGAAATAA GTGTGAAGGA TAAATTTTCT TTGTTGACCT ATGAAGATTT TAGAGTTTAC 1980 

CTTCATATGT TTGATTTTAA ATCAGTGTAT AATCTAGATG GTAAAAAATG TGAAATTGGG 2040 

ATTAGGGACC TACCAAAATA TTTCATTAAT GCTTTCAATT GACAAATTTT GGCCTTTCTT 2100 

TGATAAGACA ATATGTACAT GTTTTTTCAA ATATTAAAGA TCTTTTAACT GTTGGCAGTT 2160 

GTTATCTACA GAATCATATT TCATATGCTG TGTAGTTTAT AAGTTTTTCC TCTATTTATC 2220 
AGAATAAAGA AATACAACAT ACCTGTAAA 



Seq ID NO: 263 Protein sequence: 
Protein Accession #; NP_ 003775 



1 11 21 31 41 51 

I 1 I I I I 

MASLAAANAE FCFNLFREMD DNQGNGNVFF S3LSLPAALA LVRLGAQDDS LSQIDKLLHV 60 

NTASGYGNSS NSQSGLQSQL KRVFSDINAS HKDYDLSIVN GLFAEKVYGP HKDYIECABK 120 

LYDAKVERVD FTNHLEDTRR NINKWVENET HGKIKNVIGB GGISSSAVMV LVNAVYFKGK 180 

WQSAFTKSET INCHFKSPKC SGKAVAMMHQ ERKFNLSVIB DPSMKILELR YNGGINMYVL 240 

LPENDLSEIE NKLTFQNLME WTNPRRMTSK YVEVFFPQFK IEKNYEMKQY LRALGLKD I F 300 

DESKADLSGI ASGGRLYISR MMHKSYIEVT EEGTEATAAT GSNIVEKQLP QSTLFRADHP 360 
FLFVIRKDDI ILFSGKVSCP 



Seq ID NO: 264 DNA sequence 
Nucleic Acid Accession #i AB052 906 
Coding sequence*. 74-814 

1 11 21 31 41 51 

I I I I I I 

AAAACCTTGA GGTGATTCAT CTTCCAGGCT CTCCTTCCAT CAAGTCTCTC CTCCCTAGCG ' 60 

CTCTGGGTCC TTAATGGCAG CAGCCGCCGC TACCAAGATC CTTCTGTGCC TCCCGCTTCt 120 

GCTCCTGCTG TCCGGCTGGT CCCGGGCTGG GCGAGCCGAC CCTCACTCTC TTTGCTATGA 180 

CATCACCGTC ATCCCTAAGT TCAGACCTGG ACCACGGTGG TGTGCGGTTC AAGGCCAGGT 240 

GGATGAAAAG ACTTTTCTTC ACTATGACTG TGGCAACAAG ACAGTCACAC CTGTCAGTCC 300 

CCTGGGGAAG AAACTAAATG TCACAAOGGC CTGGAAAGCA CAGAACCCAG TACTGAGAGA 360 

GGTGGTGGAC ATACTTACAG AGCAACTGCG TGACATTCAG CTGGAGAATT ACACACCCAA 420 

GGAACCCCTC ACCCTGCAGG CCAGGATGTC TTGTGAGCAG AAAGCTGAAG GACACAGCAG 480 

TGGATCTTGG CAGTTCAGTT TCGATGGGCA GATCTTCCTC CTCTTTGACT CAGAGAAGAG 540 

AATGTGGACA ACGGTT CAT C CTGGAGCCAG AAAGATGAAA GAAAAGTGGG AGAATGACAA 600 

GGTTGTGGCC ATGTCCTTCC ATTACTTCTC AATGGGAGAC TGTATAGGAT GGCTTGAGGA 660 

CTTCTTGATG GGCATGGACA GCACCCTGGA GCCAAGTGCA GGAGCACCAC TCGCCATGTC 720 

CTCAGGCACA ACCCAACTCA GGGCCACAGC CACCACCCTC ATCCTTTGCT GCCTCCTCAT 780 

CATCCTCCCC TGCTTCATCC TCCCTGGCAT CTGAGGAGAG TCCTTTAGAG TGACAGGTTA 840 

AAG CTGAT AC CAAAAGGCTC CTGTGAGCAC GGTCTTGATC AAACTCGCCC TTCTGTCTGG 900 

CCAGCTGCCC ACGACCTACG GTGTATGTCC AGTGGCCTCC AGCAG ATCA T GATGACATCA 960 

TGGACCCAAT AGCTCATTCA CTGCCTTGAT TCCTTTTGCC AACAATTTTA CCAGCAGTTA 1020 

TACCTAACAT ATTATGCAAT TTTCTCTTGG TGCTACCTGA TGGAATTCCT GCACTTAAAG 1080 

TTCTGGCTGA CTAAACAAGA TATATCATTT TCTTTCTTCT CTTTTTGTTT GGAAAATCAA 1140 

GTACTTCTTT GAATGATGAT CTCTTTCTTG CAAATGATAT TGTCAGTAAA ATAATCACGT 1200 

TAG ACTT CAG ACCTCTGGGG ATTCTTTCCG TGTCCTGAAA GAGAATTTTT AAATTATTTA 1260 

ATAAGAAAAA ATTTATATTA ATGATTGTTT CCTTTAGTAA TTTATTGTTC TGTACTGATA 1320 
TTTAAATAAA GAGTTCTATT TCCCAAAAAA AAAAAAAAAA A 



Seq ID NO: 265 Protein sequence: 
Protein Accession #: BAB61048.1 



288 



WO 02/086443 

1 11 21 31 41 51 

I I I I 1 I 

MAAAAATKIL LCLPLLLLLS GWSRAGRADP HSLCYDITVI PKFRPGPRWC AVQGQVDEKT 
FLHYDCGNKT VTPVSPLGKK LNVTTAWKAQ NPVLREWDI LTEQLRDIQL ENYTPKEPLT 
LQARMSCBQK AEGHSSGSWQ PSPDGQIFLL FDSEKRMWTT VHPGARKMKE KWENDKWAM 
SPHYFSMGDC IGWLEDFLMG MDSTLBPSAG APLAMSSGTT QLRATATTLI LCCIiLI ILPC 
FILPGI 

Sag ID NO: 266 DNA sequence 

Nucleic Acid Accession ft: XM_084853.1 

Coding sequence: 127-444 

1 11 21 31 41 51 

I I I I I I 

ATTGATGATA TATTTAACGA AATCAAATTT GGTGAATATG TGGACACTGG AAAGCTAATC 
GACAAGATCA ACTTACCAGA TTTCCTAAAA GTGTACCTTA ACCACAAGCC ACCTTTTGGT 
AACACCATGA GTGGCATCCA CAAGAGCTTT GAGGTGCTCG GTTATACCAA CTCCAAAGGG 
AAAAAGGCCA TTCGAAGAGA GGACTTCCTG AGACTGCTCG TTACTAAAGG TGAGCATATG 
ACGGAGGAGG AGATGTTGGA TTGCTTTGCT TCACTGTTTG GCCTGAATCC CGAGGGATGG 
AAATCCGAGC CTGCAACCTG CTCCGTCAAA GGTTCAGAAA TTTGCCTTGA AGAAGAACTT 
CCAGACGAAA TCACTGCAGA AATATTCGCG ACTGAAATTC TTGGCTTAAC CATTTCAGAA 
GATTCCGGCC AGGATGGTCA GTGAAGTTAC CAGGAATGTT TAAAGCACAA AGGACTTTGG 
GTGTGTGTGC ATGCACATGT GTGTGTTTTC • CATGAGGCAC TGCTTTTTAT GCATTTCCCT 
CCCCCCTCTC ATCTTTAGAA CATTTAGACA TTAAAGCAAG TTTCTGGTGA GCAATG 



Seq ID NO: 267 Protein sequence i 
Protein Accession ft: XP_084853.1 

1 11 21 31 41 51 

I I I 1 I I 

MSGIHKSFEV LGYTNSKGKK AIRREDPLRL LVTKGEHMTE EEMLDCFASL FGLNPEGWKS 
EPATCSVKGS EICLEEELPD EITAEIPATE ILGLTISEDS GQDGQ 



Seq ID NO: 268 DNA sequence 
Nucleic Acid Accession ft: NMJ)01898 
Coding sequence: 57-482 



l 11 21 

I I 1 

GGCTCTCACC CTCCTCTCCT GCAGCTCCAG 
CCCAGTATCT GAGTACCCTG CTGCTCCTGC 
GCCCCAAGGA GGAGGATAGG ATAATCCCGG 
AGTGGGTACA GCGTGCCCTT CACTTCGCCA 
ACTACTACAG ACGTCCGCTG CGGGTACTAA 
ATTACTTCTT CGACGTAGAG GTGGGCCGCA 
ACACCTGTGC CTTCCATGAA CAGCCAGAAC 
TCTACGAAGT TCCCTGGGAG AACAGAAGGT 
AGGGATCTGT GCCAGGCCAT TCGCACCAGC 
CCACCCCTGG ACTGGTGGCC CCCACCCTGC 
GACAGACAGA GAAGGCTGCA GGAGTCCTTT 
CTTCCTTCTT GCTTCTAATA GCCCTGGTAC 
AAACAGTAGC ATCGCC 



31 41 51 

I I I 

CTTTGTGCTC TGCCTCTGAG GAGACCATGG 
TGGCCACCCT AGCTGTGGCC CTGGCCTGGA 
GTGGCATCTA TAACGCAGAC CTCAATGATG 
TCAGCGAGTA TAACAAGGCC ACCAAAGATG 
GAGCCAGGCA ACAGACCGTT GGGGGGGTGA 
CCATATGTAC CAAGTCCCAG CCCAACTTGG 
TGCAGAAGAA ACAGTTGTGC TCTTTCGAGA 
CCCTGGTGAA ATCCAGGTGT CAAGAATCCT 
CACCACCCAC TCCCACCCCC TGTAGTGCTC 
GGGAGGCCTC CCCATGTGCC TGCGCCAAGA 
GTTGCTCAGC AGGGCGCTCT GCCCTCCCTC 
ATGGTACACA CCCCCCCACC TCCTGCAATT 



Seq ID NO: 269 Protein sequence: 
Protein Accession ft :NP_001889 . 1 

1 11 21 

1 1 I 

MAQYLSTLLL LLATLAVALA WSPKEEDRII 
DDYYRRPLRV LRARQQTVGG VNYPPDVEVG 
EIYEVPWENR RSLVKSRCQE S 



31 41 51 

I \ I 

PGGIYNADLN DEWVQRALHF AISBYNKATK 
RTICTKSQPN LDTCAFHEQP ELQKKQLCSF 



Seq ID NO: 270 DNA sequence 
Nucleic Acid Accession #: XM_093210 
Coding sequence: 13-1854 

1 11 21 31 41 51 

I 1 I I I I 

ATGGCAAGCG CCGGAATCTC CTCAGCTGCC GTTTCACAAA AGAGGTACCA GGTCCGCACC 
AAACGAGCAC ACAAGCAGCA CCAGGAGCTG CAGAAGAAGG AGGCGGCAGC GATGGACCAG 
GGCAGAGGGA ATGGGGAGGG GGCATCCTAC CCCATATCTG AGGTGCGACT GCGGGACGTA 
GAGCGGACTG GGCCTTTCCC GTTGGCGCGT GGCCTCAATC AGGACTTCTT GCCCACGTGC 
GCCTTCAAAA CGGTAAGAGC TGCAACTGAA CGTGTGAGAC ATGGTGCAGA TAGGCTGAGA 
GGCGGCGGGA GAGATGCCCA TGAACTCAAG TACCCGGACA CGCCCTCCAC TTCTACCACC 
ACGAGTAACA CCGCCCCCAC GGGACCGCTC TCGAGGTCCC CCAAGCCAAG GACGCAAGGA 
GGAACGCCCC GGCGCGCGGC CAGCAGCGGC GGGCACCGGC CCAATGGCCA CGGAACTCAG 
CACTGGCAGT CGGCCCTCCT CACACCGCAG GCGTGCAGTG TGGCCGACGG AGCCTCCCGG 
GCCGAGGACC CAGCTAGGCC GTCACCCCGG TTGCTCCCAC GGGAAGGGGC ACCAGGCAAA 
CTGCCCAAGG CCCCGAGCCC AGGCTCCCTG GCGGAGGCCT CCGCTGGTCC CGCCCAGATC 
ATGGCCGCCA CCAGGCTCCC GAGCCATGGC TTCCTGTCCG GGAACGGCCC GGCGTCCTGG 
CTGTCCAGCT AG 



Seq ID NO: 271 Protein sequence: 
Protein Accession ft: XP_093210 

1 11 21 31 41 51 



289 



10 
15 

20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 
I I I I I I 

MLRHGEQKRK RARKKWDFLP TCAFKTVRAA TERVRHOADR LRGGGRDAHE LKYPDTPSTS 
TTTSNTAPTG PLSRSPKPRT QGGTPRRRPA AAGTRANGHG TQHWQSALLT PQACSVADGA 
8RAEDPARPS PRLLPREGAP GKI»PKAPSPG SLAEASAGLL AHVRLQNADA QRVSISQALP 
PNSSVGRKEE RPGAGQQRRA PAPMATELST GSRPSSHRRR AVWPTEPPGP RTQLEPSPRL 
LPREGAPGKL PKAPSPGSLA EASAGPAQIM AATRLPSRGP LSGNGPASWI* SS 

Seq ID NO: 272 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence : 1 . . 732 



PCT/US02/12476 



GGATACTGTG 
TGAAAAAGCT 
TAATGTGGAG 
ATACCCACTT 
ATGATTTTGT 
TAAATTATTT 
TTAGTATCAC 
AAAATTGCAG 
TTAAGCC 



11 
1 

TCACTCAAAG 
TTTTTTCCCA 
GAAATTATTC 
GAAGCCTCTG 
CTTGTTTCTG 
TTATTTATCT 
AATTTATGGG 
AAGTCATAGG 



21 
I 

TAATGGGAGG 
CTTTTAACTT 
TTTCTCATTG 
TAGAAATGTC 
CAGTGAGAAA 
TTCATATAGT 
AGAGGGTTTT 
ACTGTCATGT 



3X 
I 

GAGAGAGAAC 
GCTTTAGCGT 
GAGATTACAG 
TCGTCCTCCG 
TTACATCCAT 
TCTTACAATT 
TTGTATTTTT 
ATTGCAGCTC 



41 
I 

AGGGAGGGTA 
TAAGAGTACT 
AATATATCTA 
GTTGTATTTC 
AGCAAAGACA 
TCTAAAAAAT 
AAGCATATGT 
TGAGAACCAA 



51 
I 

GGGATGCTTT 
TACCAGCTAA 
TTCATCTTGA 
TAAAACCTAC 
AAAGTCTTTT 
TAACACTCAT 
GGCTTATATA 
TGCCTGAAAC 



Seq ID NO i 273 Protein sequence: 
Protein Accession ft: Eos sequence 



MGGRENREGR DAPEKAFFPT FNLL 



31 
I 



41 

I 



51 



Seq ID NO: 274 DNA sequence 

Nucleic Acid Accession ft: NM_003976.2 

Coding sequence: 299-961 



CTCTGAGCTT 
CATGGAGTTG 
CTACTTCTGC 
GGGTGGCAGG 
CAGGAGGGTG 
GGAACTTGGA 
TGCCCTGTGG 
GGGCTCCGCG 
CGCCGGCCAC 
GCCGCCGCAG 
CGGGGGCCGC 
GGGCTGCCGC 
CGACGAGCTG 
CGACCTCAGC 
GCCCGTCAGC 
CAACAGCACC 
AGGGCTCGCT 
CCTCCCGCAG 
AGGCCCCTAC 
CAGCCCCAGA 
GGAGCCCTTC 
CCCTCCTCTG 
ACAGCATTTG 
CCTGTACTCA 



11 
I 

CTCTGAGCCT 
TGAAAGAATA 
TGGGTTGAGT 
CCGGTCCCCC 
GGGGAACAGC 
CTTGGAGGCC 
CCCACCCTGG 
CCCCGCAGCC 
CTGCCGGGGG 
CCTTCTCGGC 
GCGGCGCGGG 
CTGCGCTCGC 
GTGCGTTTCC 
CTGGCCAGCC 
CAGCCCTGCT 
TGGAGAACCG 
CCAGGGCTTT 
AGTCCCACTA 
CGGTGGGTGA 
GCCCTCACCC 
GGACCCACTT 
ATGAACACTA 
AAGGACACAT 
CTCATGGGAG 



21 
I 

TGTTTGCTCA 
GCTGCAAAGC 
CTAGCTGTGT 
ACAAAAGATA 
TCAACAATGG 
TCTCCACGCT 
CCGCTCTGGC 
CTGCCCCCCG 
GACGCACGGC 
CCGCGCCCCC 
CTGGGGGCCC 
AGCTGGTGCC 
GCTTCTGCAG 
TACTGGGCGC 
GCCGACCCAC 
TGGACCGCCT 
GCAGACTGGA 
GCCAGCGGCC 
TGGATATCAT 
TGCGGATCCC 
CTCACAGACT 
CAGTGGCTGA 
ATTGCAGTTG 
CTGGCCCC 



31 
I 

TCTGGAAAAA 
ACCTAACACA 
AGGCCCCTTG 
ACTCATCTCT 
CTGATGGGCG 
GTCCCACTGC 
TCTGCTGAGC 
CGAAGGCCCC 



GCCGCCTGCA 
GGGCAGCCGC 
GGTGCGCGCG 
CGGCTCCTGC 
CGGGGCCCTG 
GCGCTACGAA 
CTCCGCCACC 
CCCTTACCGG 
TCAGCCAGGG 
CCCCGAACAG 
AGCCTAAAAG 
CTGGCACTGG 
GGCATCAGCC 
CTTGGTTGAA 



41 

I 

GGGGATTAAA 
TAGTAAGGTT 
TTCCTCACCT 
TAATTTGCAA 
CTCCTGGTGT 
CCCTGGCCTA 
AGCGTCGCAG 
CCGCCTGTCC 
AGTGGAAGAG 
CCCCCATCTG 
GCTCGGGCAG 
CTCGGCCTGG 
CGCCGCGCGC 
CGACCGCCCC 
GCGGTCTCCT 
GCCTGCGGCT 
TGGCTCTTCC 
ACGAAGGCCT 
GTGAAGGGAC 
ACACCAGAGA 
CCAGGCCTCG 
CCCGCCCAGG 
AGTGCCTGTG 



51 
I 

CCATTTACCT 
CCCAGTGCAG 
GGAGAAACTG 
GCTGCCTCAA 
TGATAGAGAT 
GGCGGCAGCC 
AGGCCTCCCT 
TGGCGTCCCC 
CCCGGCGGCC 
CTCTTCCCCG 
CGGGGGCGCG 
GCCACCGCTC 
GCTCTCCACA 
CGGGCTCCCG 
TCATGGACGT 
GCCTGGGCTG 
TGCCTGGGAC 
CAAAGCTGAG 
AACTGACTAG 
CCTCAGCTAT 
AACCTGGGAC 
CCCTGTAGGG 
CTGGAACTGG 



Seq ID NO: 275 Protein sequence: 
Protein Accession ft: NP_003967.l 



41 



51 



1 11 21 31 

I I I I I I 

MELGLGGLST LSHCPWPRRQ PALWPTLAAL ALLSSVAEAS LGSAPRSPAP REGPPPVLAS 
PAGHLPGGRT ARWCSGRARR PPPQPSRPAP PPPAPPSALP RGGRAARAGG PGSRARAAGA 
RGCRLRSQLV PVRALGLGHR SDELVRFRFC SGSCRRARSP HDLSLASLLG AGALRPPPGS 
RPVSQPCCRP TRYEAVSPMD VNSTWRTVDR LSATACGCLG 

Seq ID NO: 276 DNA sequence 

Nucleic Acid Accession ft: NM_057091.1 

Coding sequence: 783-1445 



ACTGGCCGCT 
GGACCCCCAA 
TCGCTCCCCG 
CGCGTGTCTA 
CTCCATATCC 
CAAGCTAGGG 
CGGGGCAGGG 
CACCGGACGG 
CAGACAAGGC 



11 
I 

GAGAGAAGAA 
ATCTGCACGT 
CCCTCACTCA 
CAAACTCAAC 
GAGGGGCCCC 
GGGACTGGAT 
GCGCTCCCAG 
CTGCGGCGGC 
CCGGGGGCTC 



21 

I 

TCGGGTGGAG 
ACCAGCAGTC 
CTTTCTCCCG 
TCCCGGTTTC 
TCCCAGCATC 
CCGACGGGTG 
CCCCACCCCG 
GGGCAGGAGG 
CGCCAGCAGC 



31 
I 

CAGAGAGCAG 
AGCCGCCCCA 
CCCTCGGCCC 
CGTGCCTCTC 
TACCCCCCTC 
GAGCAGCCAG 
GGATCTGGTG 
CTGCTGAGGG 
AGGTCCCTCG 



41 

I 

CTGCTGCAGG 
CGCAGGGACC 
GGCCTCCCAG 
CACCGCTCGA 
CCAACCTCGG 
GTGAGCCCCG 
ACGCTGGGGC 
ATGGAGTTGG 
GGCCCCAGCC 



51 
1 

GCAGACAGCC 
GGCTTACCCC 
CTCTCTACTT 
GTTCTCTACT 
GGGACCTAGC 
AAAGGTGGGG 
TGGAATTTGA 
GCCCGGCCCC 
CTCGCTGCCA 



60 
120 
180 
240 



60 
120 
180 
240 
300 
360 
420 
480 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
114 0 
1200 
1260 
132 0 
1380 



60 
120 
180 



60 
120 
180 
240 
300 
360 
420 
480 
540 



290 



\VO 02/086443 

CCCGGGCCTG GAGCCCCACA CCCGAGGGTG CAGACTGGCT GCCAAGGCCA CACTTTTGGC 600 

TAAAAGAGGC ACTGCCAGGT GTACAGTCCT GGGCATGCGC TGTTTGAGCT TCGGGGGAGA 660 

GCCCAGCACT GGTCCCCGGA AAGGTGCCTA GAAGAACAAG GTGCAGGACC CCGTGCTGCC 720 

TCAACAGGAG GGTGGGGGAA CAGCTCAACA ATGGCTGATG GGCGCTCCTG GTGTTGATAG 780 

AGATGGAACT TGGACTTGGA GGCCTCTCCA CGCTGTCCCA CTGCCCCTGG CCTAGGCGGC 840 

AGCCTGCCCT GTGGCCCACC CTGGCCGCTC TGGCTCTGCT GAGCAGCGTC GCAGAGGCCT 900 

CCCTGGGCTC CGCGCCCCGC AGCCCTGCCC CCCGCGAAGG CCCCCCGCCT GTCCTGGCGT 960 

CCCCCGCCGG CCACCTGCCG GGGGGACGCA CGGCCCGCTG GTGCAGTGGA AGAGCCCGGC 1020 

GGCCGCCGCC GCAGCCTTCT CGGCCCGCGC CCCCGCOGCC TGCACCCCCA TCTGCTCTTC 1080 

CCCGCGGGGG CCGCGCGGCG CGGGCTGGGG GCCCGGGCAG CCGCGCTCGG GCAGCGGGGG 1140 

CGCGGGGCTG CCGCCTGCGC TCGCAGCTGG TGCCGGTGCG CGCGCTCGGC CTGGGCCACC 1200 

GCTCCGACGA GCTGGTGCGT TTCCGCTTCT GCAGCGGCTC CTGCCGCCGC GCGCGCTCTC 1260 

CACACGACCT CAGCCTGGCC AGCCTACTGG GCGCCGGGGC CCTGCGACCG CCCCCGGGCT 1320 

CCCGGCCCGT CAGCCAGCCC TGCTGCCGAC CCACGCGCTA CGAAGCGGTC TCCTTCATGG 1380 

ACGTCAACAG CACCTGGAGA ACCGTGGACC GCCTCTCCGC CACCGCCTGC GGCTGCCTGG 1440 

GCTGAGGGCT CGCTCCAGGG CTTTGCAGAC TGGACCCTTA CCGGTGGCTC TTCCTGCCTG 1500 

GGACCCTCCC GCAGAGTCCC ACTAGCCAGC GGCCTCAGCC AGGGACGAAG GCCT CAAAGC 1560 

TGAGAGGCCC CTACCGGTGG GTGATGGATA TCATCCCCGA ACAGGTGAAG GGACAACTGA 1620 

CTAGCAGCCC CAGAGCCCTC ACCCTGCGGA TCCCAGCCTA AAAGACACCA GAGACCTCAG 1680 

CTATGGAGCC CTTCGGACCC ACTTCTCACA GACTCTGGCA CTGGCCAGGC CTCGAACCTG 1740 

GGACCCCTCC TCTGATGAAC ACTACAGTGG CTGAGGCATC AGCCCCCGCC CAGGCCCTGT 1800 

AGGGACAGCA TTTGAAGGAC ACATATTGCA GTTGCTTGGT TGAAAGTGCC TGTGCTGGAA 1860 
CTGGCCTGTA CTCACTCATG GGAGCTGGCC CC 

Seq ID NO; 277 Protein sequence: 
Protein Accession #: NP_003967.1 
1 11 21 31 

I I I I 

MELGLGGLST LSHCPWPRRQ PALWPTLAAL AI»LSSVAEAS 
PAGHLPGGRT ARWCSGRARR PPPQPSRPAP PPPAPPSALP 
RGCRLRSQLV PVRALGLGHR SDEfcVRFRFC SGSCRRARSP 
RPVSQPCCRP TRYBAVSFMD VNSTWRTVDR LSATACGCLG 

Seq ID NOt 278 DNA sequence 
Kucleic Acid Accession #: NM_057160.1 
Coding sequence: 1-714 

1 11 21 31 41 51 

I | I I I I 

ATGCCCGGCC TGATCTCAGC CCGAGGACAG CCCCTCCTTG AGGTCCTTCC TCCCCAAGCC 60 

CACCTGGGTG CCCTCTTTCT CCCTGAGGCT CCACTTGGTC TCTCCGCGCA GCCTGCCCTG 120 

TGGCCCACCC TGGCCGCTCT GGCTCTGCTG AGCAGCGTCG CAGAGGCCTC CCTGGGCTCC 180 

GCGCCCCGCA GCCCTGCCCC CCGCGAAGGC CCCCCGCCTG TCCTGGCGTC CCCCGCCGGC 240 

CACCTGCCGG GGGGACGCAC GGCCCGCTGG TGCAGTGGAA GAGCCCGGCG GCCGCCGCCG 300 

CAGCCTTCTC GGCCCGCGCC CCCGCCGCCT GCACCCCCAT CTGCTCTTCC CCGCGGGGGC 360 

CGCGCGGCGC GGGCTGGGGG CCCGGGCAGC CGCGCTCGGG CAGCGGGGGC GCGGGGCTGC 420 

CGCCTGCGCT CGCAGCTGGT GCCGGTGCGC GCGCTCGGCC TGGGCCACCG CTCCGACGAG 480 

CTGGTGCGTT TCCGCTTCTG CAGCGGCTCC TGCCGCCGCG CGCGCTCTCC ACACGACCTC S40 

AGCCTGGCCA GCCTACTGGG CGCCGGGGCC CTGCGACCGC CCCCGGGCTC CCGGCCCGTC 600 

AGCCAGCCCT GCTGCCGACC CACGCGCTAC GAAGCGGTCT CCTTCATGGA CGTCAACAGC 660 

ACCTGGAGAA CCGTGGACCG CCTCTCCGCC ACCGCCTGCG GCTGCCTGGG CTGAGGGCTC 720 

GCTCCAGGGC TTTGCAGACT GGACCCTTAC CGGTGGCTCT TCCTGCCTGG GACCCTCCCG 780 

CAGAGTCCCA CTAGCCAGCG GCCTCAGCCA GGGACGAAGG CCTCAAAGCT GAGAGGCCCC 840 

TACCGGTGGG TGATGGATAT CATCCCCGAA CAGGTGAAGG GACAACTGAC TAGCAGCCCC 900 

AGAGCCCTCA CCCTGCGGAT CCCAGCCTAA AAGACACCAG AGACCTCAGC TATGGAGCCC 960 

TTCGGACCCA CTTCTCACAG ACTCTGGCAC TGGCCAGGCC TCGAACCTGG GACCCCTCCT 1020 

CTGATGAACA CTACAGTGGC TGAGGCATCA GCCCCCGCCC AGGCCCTGTA GGGACAGCAT 1080 

TTGAAGGACA CATATTGCAG TTGCTTGGTT GAAAGTGCCT GTGCTGGAAC TGGCCTGTAC 1140 
TCACTCATGG GAGCTGGCCC C 

Seq ID NO: 279 Protein sequence: 
Protein Accession tt: NP_476501.1 

l 11 21 31 41 51 

I I I - I I I 

MPGIiISARGQ PLLEVLPPQA HLGALFLPEA PLGLSAQPAL WPTLAALALL SSVAEASLGS 60 

APRSPAPREG PPPVLASPAG HLPGGRTARW CSGRARRPPP QPSRPAPPPP APPSALPRGG 120 

RAARAGGPGS RARAAGARGC RLRSQLVPVR ALGLGHRSDE LVRFRPCSGS CRRARSPHDL 180 
SLASLLGAGA LRPPPGSRPV SQPCCRPTRY EAVSFMDVNS TWRTVDRLSA TACGCLG 

Seq ID NO: 280 DNA sequence 

Nucleic Acid Accession #: NM_057090.1 

Coding sequence: 29-715 

1 11 21 31 41 51 

I I I I I I 

CTGATGGGCG CTCCTGGTGT TGATAGAGAT GGAACTTGGA CTTGGAGGCC TCTCCACGCT 60 

GTCCCACTGC CCCTGGCCTA GGCGGCAGGC TCCACTTGGT CTCTCCGCGC AGCCTGCCCT 120 

GTGGCCCACC CTGGCCGCTC TGGCTCTGCT GAGCAGCGTC GCAGAGGCCT CCCTGGGCTC 180 

CGCGCCCCGC AGCCCTGCCC CCCGCGAAGG CCCCCCGCCT GTCCTGGCGT CCCCCGCCGG 240 

CCACCTGCCG GGGGGACGCA CGGCCCGCTG GTGCAGTGGA AGAGCCCGGC GGCCGCCGCC 300 

GCAGCCTTCT CGGCCCGCGC CCCCGCCGCC TGCACCCCCA TCTGCTCTTC CCCGCGGGGG 3 60 

CCGCGCGGCG CGGGCTGGGG GCCCGGGCAG CCGCGCTCGG GCAGCGGGGG CGCGGGGCTG 42 0 

CCGCCTGCGC TCGCAGCTGG TGCCGGTGCG CGCGCTCGGC CTGGGCCACC GCTCCGACGA 480 

GCTGGTGCGT TTCCGCTTCT GCAGCGGCTC CTGCCGCCGC GCGCGCTCTC CACACGACCT 540 

CAGCCTGGCC AGCCTACTGG GCGCCGGGGC CCTGCGACCG CCCCCGGGCT CCCGGCCCGT 600 

CAGCCAGCCC TGCTGCCGAC CCACGCGCTA CGAAGCGGTC TCCTTCATGG ACGTCAACAG 660 



41 51 
I I 

LGSAPRSPAP REGPPPVLAS 60 
RGGRAARAGG PGSRARAAGA 120 
HDLSLASLLG AGALRPPPGS 180 



291 



5 

10 
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WO 02/086443 

CACCTGGAGA ACCOTGGACC GCCTCTCCGC CACCGCCTGC GGCTGCCTGG GCTGAGGGCT 
CGCTCCAGGG CTTTGCAGAC TGGACCCTTA CCGGTGGCTC TTCCTGCCTG GGACCCTCCC 
GCAGAGTCCC ACTAGCCAGC GGCCTCAGCC AGGGACGAAG GCCTCAAAGC TGAGAGGCCC 
CTACCGGTGG GTGATGGATA TCATCCCCGA ACAGGTGAAG GGACAACTGA CTAGCAGCCC 
CAGAGCCCTC ACCCTGCGGA TCCCAGCCTA AAAGACACCA GAGACCTCAG CTATGGAGCC 
CTTCGOACCC ACTTCTCACA GACTCTGGCA CTGGCCAGGC CTCGAACCTG GGACCCCTCC 
TCTGATGAAC ACTACAGTGG CTGAGGCATC AGCCCCCGCC CAGGCCCTGT AGGGACAGCA 
TTTGAAGGAC ACATATTGCA GTTGCTTGGT TGAAAGTGCC TGTGCTGGAA CTGGCCTGTA 
CTCACTCATG GGAGCTGGCC CC 

Seq ID NO: 281 Protein sequence t 
Protein Accession #: NP_476431.l ■ 

1 11 21 31 41 SI 

1 I I I I ) 

ME1X3LGGLST LSHCPVJPRRQ APLGLSAQPA LWPTIAALAli L.SSVAEASLG SAPRSPAPRE 
GPPPVLASPA GHLPGGRTAR WCSGRARRPP PQPSRPAPPP PAPPSALPRG GRAARAGGPG 
SRARAAGARG CRL.RSQL.VPV RALGLGKRSD ELVRFRFCSG SCRRARSPHD LSLASLLGAG 
ALRPPPGSRP VSQPCCRPTR YEAVS FMDVN STWRTVDRLS ATACGCLG 

Seq ID NO: 262 DNA sequence 

Nucleic Acid Accession ft: Eos sequence 



1 
I 

CTACTGCACC 
ATAGCAATTT 
CCGTGAACCT 
AGGTAACGTT 
GATCTAATCA 
GCTAAAAATG 
TATTCTTCAT 
CACTCTTAAG 
GAATCTCAGG 



11 

I 

TGCCCTCTGT 
CCTTTGGTTT 
TATGAATGCT 
TTCTTTTGCT 
ATGATCTAGT 
AACTTGAAAC 
CCTGGCTGTA 
GATTTTATGT 
ATACCTACAT 



21 
I 

TTCCTTTGGA 
TTAAGACTTC 
GCTTAAAAAT 
CTCATCTTGG 
CAGAAGCTAC 
ACGGAAGTAG 
ATAATAGCCA 
GTATTATTCA 
TTATCACTTT 



31 
I 

AATCTCTTAC 
TACATTGCTT 
AATGTCAAAA 
TTTCCATATA 
TTCACTGGCT 
TGGTTGGTCC 
TTATTTGTTA 
AATTGCTATT 
TTCAATATAT 



41 
I 

CTTTCATTAG 
TTTCTTTTAT 
TATGTTTTAG 
CTATTTTTGG 
AACAGTGATC 
AGTTTGAAAG 
TGCCTTTGTT 
ACTGTTCTTT 
ATGTATTTCT 



Seq ID NO: 283 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence * 564-1481 



1 
I 

GAGACTTTTA 
CAAAGAGGTC 
GAAGTGCTAC 
ACTTGGATCA 
GGACCGATGC 
CTTTCCCTGA 
GCAAACAAGG 
GGGGAGACAC 
AAGGGCTGGC 
CCTCCCGGCC 
GCCTGGCGCG 
ACTTGGCGGG 
CCGGTTTCTC 
CTATGACGGG 
CGCTGAGCTG 
AGTTTGACGA 
TGGATGAAGA 
AGGGGTGCGC 
GCTTACCAGA 
AACGTGACCG 
AAGAAATGGA 
TTCAGAAAAA 
AGGAAAAAGC 
ATCAAGAATC 
AAAACAACAG 
ATGGTTGGAA 
TGGAAAACTT 
AATTCCGTGG 
GAAGAGTAAA 
AGCCAGGAGC 
AATAACATGC 
CAGTCAATAA 
TGGAGTCCTT 
GTTGATCTTG 
ATTTAGCTTG 



11 
I 

ATCATCTATC 
AAAAATTCAC 
CAAAACACGC 
AGTTCCCTCC 
TCACAATTCT 
TTCAGTGGCA 
GACAGGCCCT 
CGTACTAAAC 
TAAGGGAGGC 
TCCACGCGTC 
CGCAGGGCCC 
CCAGATGTGG 
TGCTGACTGC 
CGCACGTGGG 
CAGCAATTCC 
GGACGACGGT 
TGCCCATGAT 
CAGCACTCAG 
AAGCCGCCTG 
GCTGCAACTG 
AGAACGTGAA 
GAATGAGCAA 
AGCAAAGGAA 
GTTAAAGAAA 
CAAGCTGAAA 
AATGCGAAAC 
ACAGGTTTTT 
AAACCAATTC 
AGACCTGTGA 
AATCTTTGCC 
TTTTATCTGG 
CTCAACACTT 
AGAGTTGAGG 
GCATGTTGTT 
TATTAAAAGT 



21 
I 

CCTTGTGCTT 
ATGTGTAGAC 
AATGACTGTC 
CCTCTCCTCA 
GACCTCGTAA 
GGTAACATAT 
CAAAGTTGTC 
AAGCTTGCAA 
CGGGGCGGAG 
GCCATGGCAA 
GCTGGGAAAG 
GCGGCGGGGC 
AGACCCAGGT 
CAGGGGCTGG 
ACCAGGTOGC 
GACGGGGAGG 
TCAGAGGCCA 
GTTGAATCAG 
ACACCATGGG 
AAAGCTCTAG 
AAAAGAAAGA 
AAAAGAAAAG 
CTGGAGAAAG 
AAAAATGCTG 
TACAGGAGAA 
ATAAACCTCG 
ACAGTGGAAA 
ATATGCCACC 
TAAGTCAGCC 
TTGGAACTCT 
AGCTATTTAA 
AATGTGATTA 
AAGATATTTT 
TTGCAGAATA 
ACACTGTAAT 




AGGTGGTGCG 
TGTTGTCTCC 
ATGAGGAAGA 
AAGTGGCGAG 
AAAATAACCA 
AGGTGTGGTT 
AGGAATTAAA 
TAATTGCTGA 
AAAGAGAACA 
AATACTTGCA 
AAGAATGTGA 
AAAGGAAATA 
TCCAGCTGCA 
TTCCTATCCA 
TCCCAAAGAA 
ACACAAGTCA 
GTGCAGAATA 
TTTAAAAATC 
TTGACAAATA 
CTGGATTTTG 
AGTGGCTGAA 
ACCAATAAAA 



41 
I 

CTACAATACA 
CCTTAAGATG 
GTTCTGGGAT 
ACTTGTGCTG 
GGTGGTTTTG 
AAATGAACTG 
CAGGACCCCG 
CCTTCCTGCC 
TCTGCAGGCC 
GTGCTGGCCG 
CCGCGGCTCC 
TACTTTTCCC 
CTCGGACTCC 
CTCGCCGTCG 
CCTTGGCCAC 
CGTGGATGAT 
CCTGAGAGGA 
AGAAGAACAG 
TATTGGCAAA 
TCAACAACTA 
AGAAAAGCAC 
AAAAATTAAT 
AGAAAAAGCA 
GAGGAAGAAG 
GCAGAAAAAA 
AAGAGCTATG 
GAACCAGCCT 
GCTAAGGATC 
TCATCTCTGG 
CAAAGATAGC 
AGAAATTGTT 
GCAATTTTTG 
GTTTTTATAA 
TATGTAAGAA 
CTAACAATTT 



Seq ID NO j 284 Protein sequence: 
Protein Accession #x Bos sequence 

1 11 21 

I I I 

MATRGLCWPG WVGLARAGPA GKARPRRGSA 
PRSRPSSDSC SVPMTGARGQ GLEWRSPSP 
GEDEEDVDDE EDVDEDAHDS EAKVASLRGM 
PWEVWFIGKE KEERDRLQbK ALEELNQQLE 
RKEREQKINK EMEEKAAKEL EKEYLQEKAK 



720 
780 
840 
900 
960 
1020 
1080 
1140 



PCT/US02/12476 



51 
I 

GGTTTCTTTC 
TATCTGTGCT 
CTGCCTACTC 
TTTTTTGTGA 
ATGTTCATGT 
CTCTTATTAG 
ATGTAGCAGA 
TTATAGTTGA 
TATT 



51 

I 

CTAGAGGCTT 
CCAGGCAAAC 
ACACCTGTAA 
AAAGAAATCA 
GTTTCTGCGT 
CAACACCACG 
CCAGTGGCGT • 
ACTGAGGAGG 
CTGACAAAGT 
GGATTGGCCG 
GCCAGTTTGA 
TCTTCCTACG 
TGCTCAGTCC 
CCGCCGCTGC 
CAGAGCTTCC 
GAGGAAGACG 
ATGGAGTTAC 
AAACAGGTGC 
GAAAAAGAAG 
GAAAAAAGAA 
AAGGAATGGG 
AAAGAAATGG 
AAAGAAAAAT 
AAAGAAAAGA 
AGTTTCAAGA 
GTTATGCCAA 
TTTATAATCC 
TATCAGGAAG 
TAATTCATAA 
GTATGTGGAA 
TTTTACTGCT 
CATTTGTATA 
ACTTTTTAAG 
TTGTGTTTGT 
TTCTTG 



31 41 51 

i I I 

SLNLiAGQMWA AGRWGPTPPS SYAGPSADCR 
PLPLSCSNST RSLLSPLGHQ SFQFDEDDGD 
ELQGCASTQV ESENNQEEQK QVRI*PESRLT 
KRKEMEEREK RKI IAEEKHK EWVQKKNEQK 
EKYQEWLKKK NAEECERKKK EKKNNSXLKY 



60 
120 
180 



60 
120 
180 
240 
300 
360 
420 
480 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
84 0 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 



60 
120 
180 
240 
300 



292 



WO 02/086443 

RRKRK 



PCTYUS02/12476 



Seq ID NO: 285 DNA sequence 
Nucleic Acid Accession 8: Eos sequence 
5 Coding sequence: 1-1746 

X 11 21 31 41 51 

I I I I I I 

ATGCCACTGA AGCATTATCT CCTTTTGCTG GTGGGCTGCC AAGCCTGGGG TGCAGGGTTG 60 

10 GCCTACCATG GCTGCCCTAG CGAGTGTACC TGCTCCAGGG CCTCCCAGGT GGAGTGCACC 12 0 

GGGGCACGCA TTGTGGCGGT GCCCACCCCT CTGCCCTGGA ACGCCATGAG CCTGCAGATC 180 

CTCAACACGC ACATCACTGA ACTCAATGAG TCCCCGTTCC TCAATATCTC AGCCCTCATC 240 

GCCCTGAGGA TTGAGAAGAA TGAGCTGTCG CGCATCACGC CTGGGGCCTT CCGAAACCTG 300 

GGCTCGCTGC GCTATCTCAG CCTCGCCAAC AACAAGCTGC AGGTTCTGCC CATCGGCCTC 3 60 

15 TTCCAGGGCC TGGACAGCCT TGAGTCTCTC CTTCTGTCCA GTAACCAGCT GTTGCAGATC 420 

CAGCCGGCCC ACTTCTCCCA GTGCAGCAAC CTCAAGGAGC TGCAGTTGCA CGGCAACCAC 480 

CTGGAATACA TCCCTGACGG AGCCTTCGAC CACCTGGTAG GACTCACGAA GCTCAATCTG 540 

GGCAAGAATA GCCTCACCCA CATCTCACCC AGGGTCTTCC AGCACCTGGG CAATCTCCAG 600 

GTCCTCCGGC TGTATGAGAA CAGGCTCACG GATATCCCCA TGGGCACTTT TGATGGGCTT 660 

20 GTTAACCTGC AGGAACTGGC TCTACAGCAG AACCAGATTG GACTGCTCTC CCCTGGTCTC 720 

TTCCACAACA ACCACAACCT CCAGAGACTC TACCTGTCCA ACAACCACAT CTCCCAGCTG 780 

CCACCCAGCA TCTTCATGCA GCTGCCCCAG CTCAACCGTC TTACTCTCTT TGGGAATTCC 840 

CTGAAGGAGC TCTCTCTGGG GATCTTCGGG CCCATGCCCA ACCTGCGGGA GCTTTGGCTC 900 

TATGACAACC ACATCTCTTC TCTACCCGAC AATGTCTTCA GCAACCTCCG CCAGTTGCAG 960 

25 GTCCTGATTC TTAGCCGCAA TCAGATCAGC TTCATCTCCC CGGGTGCCTT CAACGGGCTA 1020 

ACGGAGCTTC GGGAGCTGTC CCTCCACACC AACGCACTGC AGGACCTGGA CGGGAATGTC 1080 

TTCCGCATGT TGGCCAACCT GCAGAACATC TCCCTGCAGA ACAATCGCCT CAGACAGCTC 1140 

CCAGGGAATA TCTTCGCCAA CGTCAATGGC CTCATGGCCA TCCAGCTGCA GAACAACCAG 1200 

CTGGAGAACT TGCCCCTCGG CATCTTCGAT CACCTGGGGA AACTGTGTGA GCTGCGGCTG 1260 

30 TATGACAATC CCTGGAGGTG TGACTCAGAC ATCCTTCCGC TCCGCAACTG GCTCCTGCTC 1320 

AACCAGCCTA GGTTAGGGAC GGACACTGTA CCTGTGTGTT TCAGCCCAGC CAATGTCCGA 1380 

GGCCAGTCCC T CATT AT CAT CAATGTCAAC GTTGCTGTTC CAAGCGTCCA TGTCCCTGAG 1440 

GTGCCTAGTT ACCCAGAAAC ACCATGGTAC CCAGACACAC CCAGTTACCC TGACACCACA 1500 

TCCGTCTCTT CTACCACTGA GCTAACCAGC CCTGTGGAAG ACTACACTGA TCTGACTACC 1560 

35 ATTCAGGTCA CTGATGACCG CAGCGTTTGG GGCATGACCC AGGCCCAGAG CGGGCTGGCC 1620 

ATTGCCGCCA TTGTAATTGG CATTGTCGCC CTGGCCTGCT CCCTGGCTGC CTGCGTCGGC 1680 

TGTTGCTGCT GCAAGAAGAG GAGCCAAGCT GTCCTGATGC AGATGAAGGC ACCCAATGAG 1740 

TGTTAAAGAG GCAGGCTGGA GCAGGGCTGG GGAATGATGG GACTGGAGGA CCTGGGAATT 1800 

TCATCTTTCT GCCTCCACCC CTGGGTCCAT GGAGCTTTCC CGTGATTGCT CTTTCTGGCC I860 

40 CTAGATAAAG GTGTGCCTAC CTCTTCCTGA CTTGCCTGAT TCTCCCGTAG AGAAGCAGGT 1920 

CGTGCCGGAC CTTCCTACAA TCAGGAAGAT AGATCCAACT GGCCATGGCA AAAGCCCTGG 1980 

GGATTTCCGA TTCATACCCC TGGGCTTCCT TCGAGAGGGC TCTTCCTCCA AATCCTCCCC 2 040 

ACCTGTCCTC CAAGAACAGC CTTCCCTGCG CCCAGGCCCC CTCCGGGCCT CTGTAGACTC 2100 

AGTTAGTCCA CAGCCTGCTC ACTTCGTGGG AATAGTTCTC CGCTGAGATA GCCCCTCTCG 2160 

45 CCTAAGTATT ATGTAAGTTG ATTTCCCTTC TTTTGTTTCT CTTGTTTGTG CTATGGCTTG 2220 

ACCCAGCATG TCCCCTCAAA TGAAAGTTCT CCCCTTGATT TTCTGCTCCT GAAGGCAGGG 2280 

TGAGTTCTCT CCTCAAAGAA GACTTCAAAC CATTTAACTG GTTTCTTAAG AGCCGTCAAT 2340 

CAGCCTGGTT TTGGGGATGC TATGAAAGAG AGAAGGAAAA TCATGCCGCT CAGTTCCTGG 2400 

AGACAGAAGA GCCGTCATCA GTGTCTCACT TGTGATTTTT ATCTGGAAAA GGAAGAAACA 2460 

50 CCCCAGCACA GCAAGCTCAG CCTTTTAGAG AAGGATATTT CCAAACTGCA AACTTTGCTT 2520 

TGAAAAGTTT AGCCCTTTAA GGAATGAAAT CATGTAGAAT TTTGGACTTC TAAAAACATT 25 80 

AAAATCAGCT TATTAATACG GGATAGAGAA AGAAATCTGG TGCCTGGGGG TCCCTGTGTT 2640 

CACCCCTAGA GTTTGTTTTA AAATTTTTAA TTGAAGCATG TGAAGTGTAC STGCAGAAAA 2700 

GTGGGAACAT GATAGTGTAT GGCTTGGTGG ATTTTCACAA ACTGAACATA CCTGTGTAAT 2760 

55 CAGCATCTAG ACCCAGACCC AGAGCATCAC AAATATCCCC CATCCTGGGC TTTTCCCAGA 2820 

GGAGATGGGG GCTTCTGAAG ATGGACTTAC CTGGGACCTG CCCCCCATGA GCCAGGACGG 2880 

TCCCCCCACA GTCAGCCTGT GCAAAGGCCC CGTGGCCAGG GGTGGAGGAG AATATGTGGG 2940 

TGTGGACAGG ATGGGAGACT GTGGCCTGAA CAGGAGATTT TATTATATCT GGAGACCCTG 3000 

AGAGACCCTG AGACCTGGGG CACCATGGCT GGCCAGGTCA GAAGCATCCT GACTGCAGAG 3060 

60 GTCCGTGCAG CCACACCCTC TTCCCTGCCA GCAAGTTGTC TGCGGCTCAT CGGAGGCCCC 3120 

TCCGCCTGGA GCCTTCTATG GACGTGATAT GCCTGTATCT GTTTTTAATT TTCATTCTTC 3180 

ACTTAGGGGA AQTGAAATCG CTCAGAGATG AGATCCTTTA ATTGAAAACG AAGTGTAACG 3240 

GAATCTAGTG TCTTTCTAAT GTGGTAAAAT TCTCCATCAA CATCACAGTC AGCTGGCAGC 3300 

TGAACTTCAG AATCTCACTT ACAGCAGGCG ACACGGGGGT ACACCGATGG GTCAC ACT GQ 3360 

65 GTCTGGGGGC TCCCTGGAGC TCCTCCTGCG TGTGGTCTGG TTAGGAGTTG AGTTGTTTGC 3420 

TCCAGGGTTA TTCTGCTCCT CGAGTCACAG TCACACGAAT ACCTGCCTTC TCTGGCTTTC 3480 

CTGCTATACA CATATTCACA TGGCGCTCAA GAAGTTAGGC TCATGGCAAC GTGTGTCTTT 3540 

CTCTGGACAA CTGGCCCAGT TTACAGTGAA ATGGAGAATT TCAGGTCTCC ACGTCTGCCC 3600 

AGGAAAGAAC TTCAGCTGAC TCCACGGGGA TCTGGAAATC CACGACCAAT CCCGATCGGC 3660 

70 TCTTATTAGC TCCCCGCTCC ACAAGACACC TGTGCTTTGG AAATCCACCA CCAATCCOGA 3720 

TCGGCTCTTA TTAGCTCCCC GCTCCACAAG ACACCTGTGA TCTGGAAATC TACCACCAAT 3780 

CCCGATCGGC TCTTATTAGC TCCCCGCTCC ACAAGACACC TGTGACATCC TCCAGGGCCA 3840 

CAGGAGCACG TGCTGACCAG TTTTCCCTTC CAGTTCCTGC ACAAAAAGTG TCCAGAGGGC 3900 

TGTTTGCAAA CACTAGTGCA CTTTGTAGCT TTTCACCCTC TGTCCCAGGG AATCTAGGAG 3960 

75 AGATGAGGCC CGTCAGAGTC AAGAGATGTC ATCCCCCCAQ GGTCTCCAAG GCATTTCCAC 4020 

ACTATTGGTG GCACCTGGAG GACATGCACC AAGGCTTGCC AGAGCCAACA GGAAGTGAGC 4080 

CCAGAGCATG GCACATGAGC ATCACCCGCT GATGGTGGCC TGCTGTGCCT GGTGCCAACA 4140 

GGGGCATCCC GGCCCGTACC CCTCCAGACA GGAAGCATGG GTTTGCCCAC AGACCTGTCG 4200 

GGTGCTCCTG TGAGTGGCCT CCAGATGTCT TTGTGCATAG GCACAAGTGG GCCAGGGCTG 4260 

80 GAGGGAGGTG GGAAACCTCA TCATCCGGTG GGCCCTGCCA ATCTTAACCC AGAACCCTTA 4320 

GGTATTCCTG GCAGTAGCCA TGACATTGGA GCACCTTCCT CTCCAGCCAG AGGCTGACCT 4380 

GAGGGCCACT GTCCTCAGAT GACACCACCC AGGAGCACCC TAGGTGAGGG GTGAGGGCCC 4440 

CCTTATGTGA ACCTCTTGCC TCTTCCTTTC TCCCATCAGA GTGGTTGGAT GGAGCCATTG 4500 

GCCTCCTTTT CTTCAGCGGG CCCTTCAACC TCTCTGCACC ATGTTGTCTG GCTGAGGAGC 4560 

85 TACTAGAAAA GCTGAGTGGA GTCTCCTTTC CAACAGGATG ATGCATTTGC TCAATTCTCA 4620 

GGGCTGGAAT GAGCCGGCTG GTCCCCCAGA AAGCTGGAGT GGGGTACAGA GTTCAGTTTT 4680 

CCTCTCTGTT TACAGCTCCT TGACAGTCCC ACGCCCATCT GGAGTGGGAG CTGGGAGTTA 4740 
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10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



GTGTTGGAGA 
TGCACAGATA 
GGTAGGAGTG 
GGTGTTCAAT 
TTGTCTTGGG 
TTAGTCTTGG 
GGAAAAAATA 
TGGGCTGTAT 
AACTTTTCAT 
GAACTTCCAA 
AGTTGGTCGA 
GCCCCCAGAT 
GGAAGGAAGC 
CTCCTTCCGC 
CTTCATGCTG 
GCCCCAGTGC 
AGCCCTGGTG 
GGTGTACAGA 



AGAAACAACA 
CTCTTCAAGC 
CCGCCTCTAC 
AGGCTGGGAG 
CTTTCGTCAT 
TCATCAGAAC 
AACTCTTCCA 
GTATATTGTT 
GGACACAATT 
ACTCAGGAAG 
CAGATGTTAG 
CCCACAGTCA 
CATGGCTGTG 
CCCAGGTTTC 
CCTTCAAAGC 
TTGGCGATGC 
GGCAGGGTTG 
ATCAACAATA 



AAAGCCAATT 
ACTGGACGTG 
CCACTTGTGA 
TTTTATTTAT 
TAAACCAAAG 
CTCACTTGGT 
TCCCTTAAAG 
CTTCCTCCTT 
TCCACAACCT 
TTTGCAGAGA 
ATGTATCCTA 
GAACTGAATC 
GTTCAGAGAG 
TTCTTCTCTT 
TAGATCATGT 
ATTTACAGAT 

AATAATATAC 



AGAACCACTA 
GATTCTCTCT 
TGGGGTACAG 
CTCTTCAAAC 
GAAATGGAAG 
ACCATATAGA 
AATAGAATAG 
AGAATTTAGA 
TTCAGATGCT 
GCAGACAGCT 
GCTTTTAGCC 
TGCGTTGTTG 
GGTGGGCTGG 
AAGGAGAGAT 
TTGCCTTGCT 
TTCTAGGCCC 
CTTCTGCTGG 
ATGTAT 



TTTTTAAAAA 
CTAGCCCTCA 
AGGCACTTGC 
TTTGTACAAG 
CCATTCCCCT 
TCAAAAGCTT 
TTTGTCCCTC 
GATACAAGAG 
GATGTAGAGC 
AGAGATAACT 
ATAAACCACT 
GGAAGCCAGC 
CAAGCCACTT 
TGTTCTCACC 
TAGAGAATTA 
TCAGGGTTTT 
ATGCTGCTTG 



GTGCTTACTG 
GCACCCCTGC 
TCTTCTGCAT 
AGCTCATGGC 
GTTGCTCTCC 
TGTAACCACA 
TCATGGGAAT 
TTCTACTTAG 
TATTGGGAAA 
CGGGACCCAG 
CAAAGATTCA 
AGTGGCCTTG 
CCGGGGAAAA 
AACCCGCTGC 
CTGCAAATCA 
GTAGAGTGTG 
TAATCCATTT 



Seq ID NOt 286 Protein sequence: 
Protein Accesaion #: NP_570843.1 



MPLKHYLLLL 
LNTHITELNE 
FQGLDSLESL 
GKNSLTHISP 
FHNNHNLQRL 
YDNHISSLPD 
FRMLANLQNI 
YDNPWRCDSD 
VPSYPETPWY 
IAAIVIGIVA 



11 
I 

VGCQAWGAGL 
SPFLNISALI 
LLSSNQLLQI 
RVFQHLGNLQ 
YLSNNHISQL 
NVFSNLRQLQ 
SLQNNRLRQL 
I LPLRNWLLL 
PDTPSYPDTT 
LACSLAACVG 



21 

1 

AYHGCPSECT 
ALRIEKNELS 
QPAHFSQCSN 
VLRLYENRLT 
PPSIFMQLPQ 
VLILSRNQIS 
PGNIFANVNG 
NQPRLGTDTV 
SVSSTTELTS 
CCCCKKRSQA 



31 
I 

CSRASQVECT 
RITPGAFRNL 
LKELQLHGNH 
DIPMGTFDGL 
LNRLTLFGNS 
FISPGAFNGL 
LMAIQLQNNQ 
PVCFSPANVR 
PVEDYTDLTT 
VLMQMKAPNE 



41 

I 

GARIVAVPTP 
GSLRYIiSLAN 
LEYIPDGAFD 
VNLQELALQQ 
LKELSLGIFG 
TELRELSLHT 
LENLPLGIFD 
GQSLIIINVN 
IQVTDDRSVW 
C 



SI 

1 

LPWNAMSLQI 
NKLQVLPIGL 
HLVGLTKLNL 
NQIGLLSPGL 
PMPNLRELWL 
NAliQDLDGNV 
HLGKLCELRL 
VAVP9VHVPE 
GMTQAQSGLA 



Seq ID NO: 287 DNA sequence 
Nucleic Acid Accession #i NM_002362 
Coding sequence: 1..9S4 



ATGTCTTCTG 
GAGGCCCTGG 
TCCTCCTCCT 
GGTCCTCCCC 
TGGAGGCAAC 
GACGCAGAGT 
CTGCTCCGCA 
ATCAAAAATT 
ATGATCTTTG 
ACCTGCCTGG 
GGCCTTCTGA 
GAAATCTGGG 
GGGGAGCCCA 
CAGGTACCCG 
GAAACCAGCT 
GCCTACCCAT 



11 
I 

AGCAGAAGAG 
GCCTGGTGGG 
CTCCTCTGGT 
AGAGTCCTCA 
CCAATGAGGG 
CCTTGTTCCG 
AGTATCGAGC 
ACAAGCGCTG 
GCATTGACGT 
GCCTTTCCTA 
TAATCGTCCT 
AGGAGCTGGG 
GGAAACTGCT 
GCAGTAATCC 
ATGTGAAAGT 
CCCTGCGTGA 



21 

I 

TCAGCACTGC 
TGCACAGGCT 
CCCTGGCACC 
GGGAGCCTCT 
TTCCAGCAGC 
AGAAGCACTC 
CAAGGAGCTG 
CTTTCCTGTG 
GAAGGAAGTG 
TGATGGCCTG 
GGGCACAATT 
TGTGATGGGG 
CACCCAAGAT 
TGCGCGCTAT 
CCTGGAGCAT 
AGCAGCTTTG 



31 
I 

AAGCCTGAGG 
C CT ACTA CTG 
CTGGAGGAAG 
GCCTTACCCA 
CAAGAAGAGG 
AGTAACAAGG 
GTCACAAAGG 
ATCTTCGGCA 
GACCCCGCCA 
CTGGGTAATA 
GCAATGGAGG 
GTGTATGATG 
TGGGTGCAGG 
GAGTTCCTGT 
GTGGTCAGGG 
TTAGAGGAGG 



Seq ID NO j 288 Protein sequence: 
Protein Accession #: NP_002353.1 



1 

! 

MSSEQKSQHC 
GPPQSPQGAS 
LLRKYRAKEL 
TCLGLSYDGL 
GEPRKIiLTQD 
AYPSIiREAAL 



11 
I 

KPEEGVEAQE 
ALPTTISFTC 
VTKAEMLERV 
LGNNQIPPKT 
WVQENYLEYR 
LEEBEGV 



21 
I 

EALGLVGAQA 
WRQPNEGSSS 
IKNYKRCFPV 
GLLIIVLGTI 
QVPGSNPARY 



31 

I 

PTTEEQEAAV 



IFGKASESIiK 
AMEGDSASEB 
EFLWGPRALA 



41 

I 

AAGGCGTTGA 
AGGAGCAGGA 
TGCCTGCTGC 
CTACCATCAG 
AGGGGCCAAG 
TGGATGAGTT 
CAGAAATGCT 
AAGCCTCCGA 
GCAACACCTA 
ATCAGATCTT 
GCGACAGCGC 
GGAGGGAGCA 
AAAACTACCT 
GGGGTCCAAG 
TCAATGCAAG 
AAGAGGGAGT 



41 

I 

SSSSPLVPGT 
DASSLFREAL 
MIFGIDVKEV 
EIWEELGVMG 
ETSYVKVLEH 



Seq ID NO: 289 DNA sequence 
Nucleic Acid Accession #: NM_002362 
Coding sequence: 46.. 1344 



1 
I 

CGGCGGCCGC 
GGCGACCTGA 
CATCAGCGCG 
CTCAACAGAC 
TTGACCAGAA 
CAGCCCATCG 
GGCCCCAGCA 
GTTCTACCTG 
AAATCCCATC 



11 

I 

GCCCTGGTTG 
AGCAGGCGCT 
GCAGCAGCAC 
ATAATATTGT 
ATGTGCAGTC 
ATTTGAGTGC 
GTGAAAATCT 
CAGCTGAATT 
TCCTCGATTA 




TGCAAAGAAA 
GTTTGGTGAT 
TGTGTCTATT 
ATGCACTGTT 
GGAGGAAGAG 
CCATGGGCTT 
TGTGATGACA 



31 

I 

GCTCTCGGGG 
GCCGAGTCGC 
GAAGACATAA 
TACACATGGA 
ATTGACACAG 
GCACTTCACA 
ACAGAAAACA 
TGGGACAGCT 
ACTTTACTGT 



41 

I 

GCGCCATGGA 
CAACGGTCCA 
ACCTGAGTGT 
CTGAGTTTGA 
AATTAAAGGT 
TTTTCCAGCT 
TAATTGCAGC 
TGGTATACGA 
TTTCAGACAA 



51 

1 • 

CGAGGCCGTG 
CGTGGAGGTG 
TAGAAAGCTA 
TGAACCTTTT 
TAAAGACTCA 
GAATGAAGAT 
AAATCACTGG 
TGTGGAAGTC 
GAACGTCAAC 



4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 



51 
I 

GGCCCAAGAA 
GGCTGCTGTC 
TGAGTCAGCA 
CTTCACTTGC 
CACCTCGCCT 
GGCTCATTTT 
GGAGAGAGTC 
GTCCCTGAAG 
CACCCTTGTC 
TCCCAAGACA 
CTCTGAGGAG 
CACTGTCTAT 
GG AG T ACCGG 
GGCTCTGGCT 
AGTTCGCATT 
CTGA 



51 
I 

LEEVPAAESA 
SNKVDELAHF 
DPASNTYTLV 
VYDGREHTVY 
WRVNARVRI 



60 
120 
180 
240 
300 
360 
420 
480 
540 



60 
120 
180 
240 
300 
360 
420 
460 
540 
600 
660 
720 
780 
840 
900 



60 
120 
180 
240 
300 



60 
120 
160 
240 
300 
360 
420 
480 
540 



294 



WO 02/086443 

AGCAACCTCA TCACCTGGAA CCGGGTGGTG CTQCTCCACG GTCCTCCTGG CACTGGAAAA 600 

ACATCCCTGT GTAAAGCGTT AGCCCAGAAA TTGACAATTA GACTTTCAAG CAGGTACCGA 660 

TATGGCCAAT TAATTGAAAT AAACAGCCAC AGCCTCTTTT CTAAGTGGTT TTCGGAAAGT 720 

GGCAAGCTGG TAACCAAGAT GTTTCAGAAG ATTCAGGATT TGATTGATGA TAAAGACGCC 780 

CTGGTGTTCG TGCTGATTGA TGAGGTGGAG AGTCTCACAG CCGCCCGAAA TGCCTGCAGG 840 

GCGGGCACCG AGCCATCAGA TGCCATCCGC GTGGTCAATG CTGTCTTGAC CCAAATTGAT 900 

CAGATTAAAA GGCATTCCAA TGTTGTGATT CTGACCACTT CTAACATCAC CGAGAAGATC 960 

GACGTGGCCT TCGTGGACAG GGCTGACATC AAGCAGTACA TTGGGCCACC CTCTGCAGCA 1020 

GCCATCTTCA AAATCTACCT CTCTTGTTTG GAAGAACTGA TGAAGTGTCA GATCATATAC 1080 

CCTCGCCAGC AGCTGCTGAC CCTCCGAGAG CTAGAGATGA TTGGCTTCAT TGAAAACAAC 1140 

GTGTCAAAAT TGAGCCTTCT TTTGAATGAC ATTTCAAGGA AGAGCGAGGG CCTCAGCGGC 1200 

CGGGTCCTGA GAAAACTCCC CTTTCTGGCT CATGCGCTGT ATGTCCAGGC CCCCACCGTC 1260 

ACCATAGAGG GGTTCCTCCA GGCCCTGTCT CTGGCAGTGG ACAAGCAGTT TGAAGAGAGA 1320 

AAGAAGCTTG CAGCTTACAT CTGATCCTGG GCTTCCCCAT CTGGTGCTTT TCCCATGGAG 1380 

AACACACAAC CAGTAAGTGA GGTTGCCCCA CACAGCCGTC TCCCAGGGAA TCCCTTCTGC 1440 

AAACCAAACG TTACTTAGAC TGCAAGCTAG AAAGCCACCA AGGCCAGGCT TTGTTAAAAG 1500 

AAGTGTATTC TATTTATGTT GTTTTAAAAT GCATACTGAG AGACAAACAT CTTGTCATTT 1560 

TCACTGTTTG TAAAAGATAA TTCAGATTGT TTGTCTCCTT GTGAAGAACC ATCGAAACCT 1620 

GT TTGTTCCC AGCCCACCCC CAGTGGATGG GATGCATAAT GCCAGCAAGT TTTGTTTAAC 1680 

AGCAAAAAAG GAAGATTAAT GCAGGTGTTA TAGAAGCCAG AAGAGAAACT GTGTCACCCT 1740 

AAAGAAGCAT ATAATCATAG CATTAAAAAT GCACACATTA CTCCAGGTGG AAGGTGGCAA 1800 

TTGCTTTCTG ATATCAGCTC GTTTGATTTA GTGCAAAAAT GTTTTCAAGA CTATTTAATG 1860 

GATGTAAAAA AGCCTATTTC TACATTATAC CAACTGAGAA AAAAATGGTC GGTAAAGTGT 1920 

TCTTTCATAA TAAATAATCA AGACATGGTC CCATTTGCAG GAAAAGTGCA GACTCTGAGT 1980 

GTTCCAGGGA AACACATGCT GGACATCCCT TGTAACCCGG TATGGGCGCC CCTGCATTGC 2040 

TGGGATGTTT CTGCCCAOGG TTTTGTTTGT GCAATAACGT TATCACATTT CTAATGAGGA 2100 

TTCACATTAA TATAATATAA AATAAATAGG TCAGTTACTG GTCTCTTTCT GCCGAATGTT 2160 
ATGTTTTGCT TTTATCTCAC AGTAAAATAA ATATAATTAA AAA 



Seq ID NO: 290 Protein sequence: 
Protein Accession #: NP_004228 

1 11 21 31 41 51 

I I I I I I 

MDEAVGDLKQ ALPCVAESPT VHVEVHQRGS STAKKEDINL SVRKLLNRHN IVFGDYTWTE 60 
PDEPFLTRNV QSVSIIDTEL KVKDSQPIDL SACTVALHIP QLNEDGPSSE NLEEETENII 120 
AANHWVLPAA EFHGLWDSLV YDVEVKSHLL DYVMTTLLFS DKNVNSNLIT WNRWLLHGP 180 
PGTGKTSLCK ALAQKLTIRL SSRYRYGQLI EINSHSLFSK WFSESGKLVT KMFQKIQDLI 240 
DDKDALVFVL IDEVESLTAA RNACRAGTEP SDAIRWNAV LTQIDQIKRH SNWILTTSN 300 
ITEKIDVAPV DRADIKQYIG PPSAAAIPKI YLSCLEELMK CQI1YPRQQL LTLRELEMIG 360 
FIENNVSKLS LLLNDISRKS EGLSGRVLRK LPPLAHAIiYV QAPTVTIEGP LQALSLAVDK 420 
QFEERKXLAA YI 



Seq ID NO: 291 DNA sequence 

Nucleic Acid Accession #i NM_002658.1 

Coding sequence: 77-1372 

1 11 21 31 41 51 

I I I I I I 

GTCCCCGCAG CGCCGTCGCG CCCTCCTGCC GCAGGCCACC GAGGCCGCCG CCGTCTAGCG 60 

CCCCGACCTC GCCACCATGA GAGCCCTGCT GGCGCGCCTG CTTCTCTGCG TCCTGGTCGT 120 

GAG CGACTCC AAAGGCAGCA ATGAACTTCA TCAAGTTCCA TCGAACTGTG ACTGTCTAAA 180 

TGGAGGAACA TGTGTGTCCA ACAAGTACTT CTCCAACATT CACTGGTGCA ACTGCCCAAA 240 

GAAATTCGGA GGGCAGCACT GTGAAATAGA TAAGTCAAAA ACCTGCTATG AGGGGAATGG 300 

TCACTTTTAC CGAGGAAAGG CCAGCACTGA CACCATGGGC CGGCCCTGCC TGCCCTGGAA 360 

CTCTGCCACT GTCCTTCAGC AAACGTACCA TGCCCACAGA TCTGATGCTC TTCAGCTGGG 420 

CCTGGGGAAA CATAATTACT GCAGGAACCC AGACAACCGG AGGCGACCCT GGTGCTATGT 480 

GCAGGTGGGC CTAAAGCCGC TTGTCCAAGA GTGCATGGTG CATGACTGCG CAGATGGAAA 540 

AAAGCCCTCC TCTCCTCCAG AAGAATTAAA ATTTCAGTGT GGCCAAAAGA CTCTGAGGCC 600 

CCGCTTTAAG ATTATTGGGG GAGAATTCAC CACCATCGAG AACCAGCCCT GGTTTGCGGC 660 

CATCTACAGG AGGCACCGGG GGGGCTCTGT CACCTACGTG TGTGGAGGCA GCCTCATCAG 720 

CCCTTGCTGG GTGATCAGCG CCACACACTG CTTCATTGAT TACCCAAAGA AGGAGGACTA 780 

CATCGTCTAC CTGGGTCGCT CAAGGCTTAA CTCCAACACG CAAGGGGAGA TGAAGTTTGA 840 

GGTGGAAAAC CTCATCCTAC ACAAGGACTA CAGCGCTGAC ACGCTTGCTC ACCACAACGA 900 

CATTGCCTTG CTGAAGATCC GTTCCAAGGA GGGCAGGTGT GCGCAGCCAT CCCGGACTAT 960 

ACAGACCATC TGCCTGCCCT CGATGTATAA CGATCCCCAG TTTGGCACAA GCTGTGAGAT 1020 

CACTGGCTTT GGAAAAGAGA ATTCTACCGA CTATCTCTAT CCGGAGCAGC TGAAAATGAC 1080 

TGTTGTGAAG CTGATTTCCC ACCGGGAGTG TCAGCAGCCC CACTACTACG GCTCTGAAGT 1140 

CACCACCAAA ATGCTATGTG CTGCTGACCC CCAATGGAAA ACAGATTCCT GCCAGGGAGA 1200 

CTCAGGGGGA CCCCTCGTCT GTTCCCTCCA AGGCCGCATG ACTTTGACTG GAATTGTGAG 1260 

CTGGGGCCGT GGATGTGCCC TGAAGGACAA GCCAGGCGTC TACACGAGAG TCTCACACTT 1320 

CTTACCCTGG ATCCGCAGTC ACACCAAGGA AGAGAATGGC CTGGCCCTCT GAGGGTCCCC 1380 

AGGGAGGAAA CGGGCACCAC CCGCTTTCTT GCTGGTTGTC ATTTTTGCAG TAGAGTCATC 1440 

TCCATCAGCT GTAAGAAGAG ACTGGGAAGA TAGGCTCTGC ACAQATGGAT TTGCCTGTGG 1500 

CACCACCAGG GTGAACGACA ATAGCTTTAC CCTCACGGAT AGGCCTGGGT GCTGGCTGCC 1560 

CAGACCCTCT GGCCAGGATG GAGGGGTGGT CCTGACTCAA CATGTTACTG ACCAGCAACT 1620 

TGTCTTTTTC TGGACTGAAG CCTGCAGGAG TTAAAAAGGG CAGGGCATCT CCTGTGCATG 1680 

GGCTCGAAGG GAGAGCCAGC TCCCCCGACC GGTGGGCATT TGTGAGGCCC ATGGTTGAGA 1740 

AATGAATAAT TTCCCAATTA GGAAGTGTAA GCAGCTGAGG TCTCTTGAGG GAGCTTAGCC 1800 

AATGTGGGAG CAGCGGTTTG GGGAGCAGAG ACACTAACGA CTTCAGGGCA GGGCTCTGAT 1860 

ATTCCATGAA TGTATCAGGA AATATATATG TGTGTGTATG TTTGCACACT TGTTGTGTGG 1920 

GCTGTGAGTG TAAGTGTGAG TAAGAGCTGG TGTCTGATTG TTAAGTCTAA ATATTTCCTT 1980 

AAACTGTGTG GACTGTGATG CCACACAGAG TGGTCTTTCT GGAGAGGTTA TAGGTCACTC 2040 

CTGGGGCCTC TTGGGTCCCC CACGTGACAG TGCCTGGGAA TGTACTTATT CTGCAGCATG 2100 

ACCTGTGACC AGCACTGTCT CAGTTTCACT TTCACATAGA TGTCCCTTTC TTGGCCAGTT 2160 

ATCCCTTCCT TTTAGCCTAG TTCATCCAAT CCTCACTGGG TGGGGTGAGG ACCACTCCTT 2220 

ACACTGAATA TTTATATTTC ACTATTTTTA TTTATATTTT TGTAATTTTA AATAAAAGTG 2280 
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ATCAATAAAA T6TGATTTTT CTOA 

Seq ID NO: 292 Protein sequence: 
Protein Accession #:NP_002649.1 



PCT/US02/12476 



i 
1 

MRALLARIiLL 
HCEIDKSKTC 
YCRNPDNRRR 
GGEFTTIENQ 
RSRLNSNTQG 
PSMYNDPQFG 
CAADPQWKTD 
SHTKEENGLA 



11 

1 

CVLWSDSKG 
YEQNQHFYRQ 
PWCYVQVGLK 
PWFAAIYRRH 
EMKPEVENLI 
TSCEITGFGK 
SCQGDSGGPL 
L 



21 
I 



KASTDTMGRP 
PLVQECMVHD 
RGGSVTYVCG 
LHKDYSADTL 
ENSTDYLYPB 
VCSLQGRMTL 



31 
I 

CDCLNGGTCV 
CLPWNSATVL 
CADGKKPSSP 
GSLISPCWV1 
AHHNDIALLK 
QLKMTWKIiI 
TGIVSWGRGC 



41 
I 

SNKYFSNIHW 
QQTYHAHRSD 
PEELKFQCGQ 
SATHCFIDYP 
IRSKEGRCAQ 
SHRECQQPHY 
ALKDKPGVYT 



SI 
I 

CNCPKKFGGQ 
ALQLGLGKHN 
KTLRPRFKII 
KKEDYIVYLG 
PSRTIQTICL 
YGSEVTTKML 
RVSHFLPWIR 



Seq ID NO: 293 DNA sequence 
Nucleic Acid Accession #: NMJ>01498 
Coding sequence: 93.. 2006 



1 
I 

GGCACGAGGC 
AGGAGGAGGA 
TGAGCTGGGA 
TCCTGCACAT 
AGGTGGAATA 
CTGGGGAGAA 
CTACCCTTTG 
ACGGAGGAAC 
AGGCTACTTC 
TAGGCTGTCC 
CCAAGTCCCT 
CAAGAAATAT 
ACAAGAATAC 
CTTCTAAGCC 
TCCAGGTGAC 
CTACTATCTG 
TGTCAGACAT 
AGGAGCGAGG 
ACTCAATAGA 
TAGATAAAGA 
ATGTTGCTCA 
ATGATGCTAA 
GATTTAAGCC 
AGGTGCAATT 
GAGTGATCCT 
TGAAGGTAGC 
TTTGCAAAGG 
TCGCTGCAGA 
TGTTTCCTGG 
ACACCAGATG 
TAATGACAGT 
ACAGTGTCAT 
ATGAATTATG 
GAAGTAAAAC 
ACTGGCTACA 
CTGTACTGTT 
TCTAGAGTTT 
AATAACATAT 
CAACCTACTG 
GAGTTAAAAT 
TGAAATTTTT 
TGTCTACATT 
GCACCTTATC 
GCTTTCCCCT 



11 
I 

TGAGTGTCCG 
GGAGGAGGAG 
GGAAACCAAG 
CTACCACGCC 
CATGTTGGTA 
AGTTCTTGAA 
GAGACCAGAG 
AATGTCCGAG 
TATATTAGAA 
TGGGTTCACA 
CTTCTTTCCA 
CCGACATAGG 
ACCATCTCCA 
GGATCATATT 
ATTCCAAGCC 
TCCAATTGTT 
TGATTGTCGC 
ACTGGAGCCA 
CAGCTATTTA 
GATCTACGAA 
TCTCTTTATT 
TGAGTCTGAC 
CCCTCCTCCA 
AACAGACTTT 
TTCCTACAAA 
ACAGAAAAGA 
TGGCAATGCA 
GGAGTACACC 
ACTGATCCCA 
TAGTATTCTG 
TGCCAGATGG 
AACTGATGAA 
TGAATGCCCA 
TGACTCATCC 
GTACCATGCC 
TTCTGGGCCA 
ATACAGTGTA 
CTAAAGTCAT 
TCTAAGCAGT 
GTTTACTGTA 
CTCTTTAAAA 
AAATCACTTG 
TATGATGTTT 
CTGAATAAAT 



21 

I 

TCTCGCGCCC 
GAGGGGGCGG 
CGCCATGCCG 
GTCAAGGACC 
TCTTTTGATC 
ACTCTGCAAG 
TATGGGAGTT 
TTCAATACAG 
GAAAATCAGG 
CTGCCCGAGG 
GATGAAGCAA 
AGAGGAGAAA 
TTTATAGAAA 
TACATGGATG 
TGCAGTATAT 
ATGGCTTTGA 
TGGGGAGTGA 
TTGAAGAACA 
TCTAAGTGTG 
CAGCTGTTGC 
AGAGACCCAC 
CATTTTGAGA 
AACTCAGACA 
GAGAACTCTG 
TTGGATTTTC 
GATGCTGTCT 
GTGGTGGATG 
CTCATGAGCA 
ATTCTGAACT 
AACTACCTAA 
ATGAGGGAGT 
ATGAATTATA 
GAGTTACTTG 
AACTAGACAT 
TCTCAGCCCG 
GTGAGCCAGA 
CATGTACATA 
CATGAACTGG 
TTTGTAAATG 
AATTTTTGTT 
ACATTTTCTC 
AATCCATTGA 
CTTTTGCAAT 
ACCCATTGAA 



31 

I 

GGAAGCGGGC 
CCATGGGGCT 
ACCACGTGCG 
GGCACAAGGA 
ATGAAAATAA 
AGAAGGGGGA 
ACATGATTGA 
TTGAGGCCAA 
CTCTTTGCAC 
TCAAACCCAA 
TAAACAAGCA 
AGGTTGTCAT 
CATTTACTGA 
CCATGGGATT 
CTGAGGCCAG 
GTGCTGCATC 
TTTCTGCATC 
ATAACTATAG 
GTGAGAAATA 
AGGAAGGCAT 
TGACACTGTT 
ATATTCAGTC 
TTGGATGGAG 
CCTATGTGGT 
TCATTCCACT 
TGCAGGGAAT 
GTTGTGGCAA 
TAGACACCAT 
CTTACCTTGA 
AGCTAATTAA 
TTATCGCAAA 
GCCTTATTTT 
GATCAGCATT 
TCTACAGAAA 
TGTGTATAAT 
AATTGATTAA 
GTAAAGTATT 
CTTGTACATT 
TACTGGTAAT 
CTTTTAAAGA 
TCGTTAATTT 
AAGTGCTTCA 
TGGAATAATC 
CTCTGAAAAA 



41 

I 

GACCGCCGTC 
GCTGTCCCAG 
GCGGCACGGG 
CGTTCTCAAG 
AAAAGTCCGG 
AAGGACAAAC 
AGGGACACCA 
CATGCGAAAA 
AATAACTTCA 
CCCAGTGGAA 
CCCTCGCTTC 
CAATGTACCA 
GGATGATGAA 
TGGAATGGGC 
ATACCTTTAT 
TCCCTTTTAC 
TGTAGATGAT 
GATCAGTAAA 
TAATGACATC 
TGATCATCTC 
TGAAGAGAAA 
CACAAATTGG 
AGTAGAATTT 
GTTTGTGGTA 
GTCAAAGGTT 
GTTTTATTTC 
GGCCCAGAAC 
CATCAATGGG 
AAACATGGAA 
GAAGAGAGCA 
CCATCCTGAC 
GAAGTGTAAC 
TAGGAAAGTA 
GAAAAATGCA 
ATGAAGACCA 
GGCTTTCTTT 
TTTGATTAAC 
TTTAAATTCT 
TGTACAATAC 
CTACCTGGGA 
TCCTTTGTCA 
AGGGTAATCT 
ACTTGGTCAC 
AAAAAAAAAA 



51 
I 

AGCCCGGAGG 
GGCTCGCCGC 
ATCCTCCAGT 
TGGGGCGATG 
TTGGTCCTGT 
CCAAACCATC 
GGACAGCCCT 
CGCCGGAAGG 
TTTCCCAGAT 
GGAGGAGCTT 
AGTACCTTAA 
ATATTTAAGG 
GCTTCAAGGG 
AATTGCTGTC 
GATCAGTTGG 
CGAGGCTATG 
AGAACTCGGG 
TCCCGATATG 
GACTTGACGA 
CTGGCCCAGC 
ATACACCTGG 
CAGACAATGA 
CGACCCATGG 
CTGCTCACCA 
GATGAGAACA 
AGGAAAGATA 
AGCACGGAGC 
AAGGAAGGTG 
GTGGATGTGG 
TCTGGAGAAC 
TACAAGCAAG 
CAAATTGCAA 
AAATATAGTG 
TTATTGACGA 
AATGATAGAA 
GGTAGGTAAA 
AATGTATTTT 
TACTCTGGAG 
TTGCATTCCA 
CCTGATTTAT 
TTTCCTTTGT 
TGGGTTT CT A 
CTTGCCCCAA 
AAAA 



60 
120 
180 
240 
300 
360 
420 



60 
120 

ieo 

240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 



Seq ID NO: 294 Protein sequence: 
Protein Accession #: NP_001489 



75 
80 
85 



1 

I 

MGLLSQGSPL 
ENKKVRLVLS 
EANMRKRRKE 
NKHPRFSTLT 
MGFGMGNCCL 
SASVDDRTRE 
EGIDHLIAQH 
GWRVEFRPME 
QGMFYFRKDI 
YLENMEVDVD 
LILKCNQIAN 



11 
I 

SWEETKRHAD 
GEKVLETLQE 
ATSILEENQA 
RNIRHRRGEK 
QVTFQACSIS 
ERGLEPLKNN 
VAHLFIRDPL 
VQLTDFENSA 
CKGGNAWDG 
TRCSIIiNYUC 
ELCECPELLG 



21 

I 

HVRRHGILQF 
KGERTNPNHP 
LCTITSFPRL 
WINVPIFKD 
EARYLYDQIiA 
NYRISKSRYD 
TLFEEKIHLD 
YWFWLLTR 
CGKAQNSTEL 
LIKKRASGEL 
SAFRKVKYSG 



31 
I 

LHIYHAVKDR 
TtiWRE EYGSY 
GCPGFTIiPEV 
KNTPSPFIET 
TICPIVMALS 
SIDSYLSKCG 
DANE SDH FEN 
VILSYKLDFIt 
AAEEYTLMSI 
MTVARWMREF 
SKTDSSN 



41 
I 

HKDVLKWGDE 
MIEGTPGQPY 
KPNPVEGGAS 
FTEDDEASRA 
AASPFYRGYV 
EKYNDIDLTI 
IQSTNWQTMR 
IPLSKVDENM 
DTIINGKEGV 
IANHPDYKQD 



51 

I 

VEYMLVSFDH 
GGTMSEPNTV 
KSLFFPDEAI 
SKPDKIYMDA 
SDIDCRWGVI 
DKEIYEQLLQ 
FKPPPPNSDI 
KVAQKRDAVL 
FPGLIPILNS 
SVITDEMNYS 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



296 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

Seq ID NOi 295 DNA sequence 

Nucleic Acid Accession #: Bos sequence 

Coding sequence: 247-816 



PCT/US02/12476 



AGTGTTCGGC 
GGCCAAACGG 
CCTAGGGGGC 
GGGAGGCGCC 
QAAACAATGA 
CCCAGGGAAT 
AAACAAGGAG 
ATGACAGGAC 
TTCAGCAAAG 
ACCAGCAGTT 
CAACGAQAAA 
CAAAAATATG 
AAGCGATTTT 
AAGCACCTTA 
CACACCCCAA 
TTCTACAATG 
CTTCCAGAGG 
TTGAAAGGAT 
TAACCCATTA 



11 

I 

TGGGG CAGGC 
GATCGGTGCT 
ACATTTCCCA 
ACAACTTCAC 
CCGATAAAAC 
GTGACAGTCC 
CAGGAGACAG 
ATGCTATTCC 
ATAGGATGAT 
TCTCTGGAGA 
TTAATGCTGA 
AAAAAATCTT 
TTGAATCCAT 
AGAAGAAACT 
ATGCATAATC 
GAGCAGGATA 
CTAAGAAATT 
AACTTGTGTT 
GGTAAATACT 



21 

I 

ACGCTGTGGC 
TCTGGTGAGA 
CAACTCCCAG 
TGCCATTTTG 
AGAGAAGGTG 
TTCGTATCAG 
CCTTATTGCA 
ACCCAGCCAA 
GCAGAAACCT 
TGACCTAGAA 
TATAAAACGT 
CGAAATGCTT 
CATCAAGGAA 
GAAAOGTATG 
TCGTTAATGA 
TTGCTGAAGT 
TCTGTTAGTA 
TTGGTTATTT 
ATTACAGTCG 



31 
I 

TGGCTACTTC 
CGCCTCCCCA 
AGGGCAGGTT 
TGAGGTGCCG 
GCTGTAGATC 
AAAAGGCAGA 
GGCTCTGCCA 
TTGGATTCTC 
GGTAGCAATG 
TGCAGAGAAA 
AAATTAGTGA 
GAAGGAGTGC 
GCAGCAAGAT 
ATTTGAGAAT 
TTGAGGAGAG 
CTCCTGGCAT 
AAAGATGTTC 
TGTATTCCCA 



41 

I 

CCTTCCTCCC 
TGCACATCAC 
TCTAGAAAGT 
CCGTCTCTCC 
CTGAAACTGT 
GGATGGCCCT 
TGTCCAAAGA 
AGATTGATGA 
CACCTGTGGG 
CAGCCTCCTC 
AGGAACTCCG 
AAGGACCTAC 
GTATGAGACG 
ACTTGTCCCT 
AAAAGGATCA 
ATGTTACCGA 
TTTTTCCCAA 
CCTGTGCTGG 
A 



SI 
I 

ATCCCCCTTG 
TCCCAGGTGC 
GCCACCAGTG 
TCCAGCAAGG 
GTTTAAACGT 
GTTGGCAAGG 
AAAGAAGCTT 
CTTCACTGGT 
AGGAAACGTT 
TCCCAAAAGC 
ATGCGTTGGA 
TGCAGTCAGG 
AGACTTTGTT 
GGAGGATTAT 
GATTGCTGTT 
ATCAAATAGC 
AGCATTTTAT 
TAGATATTAT 



Seq ID NO: 296 Protein sequence: 
Protein Accession #: Eos sequence 

11 



51 



1 11 21 31 41 

I I I I I 1 

MTDKTEKVAV DPETVPKRPR ECDSPSYQKR QRMALLARKQ GAGDSLIAGS AMSKEKKLMT 
GHAIPPSQLD SQIDDPTGFS KDRMMQKPGS NAPVGGNVTS SFSGDDLECR ETASSPKSQR 
EINADIKRKL VKELRCVGQK YEKIFEMLEG VQGPTAVRKR FFESIIKEAA RCMRRDFVKH 
LKKKLKRMI 

Seq ID NO: 2 97 DNA sequence 

Nucleic Acid Accession #i Eos sequence 

Coding sequence: 247-815 



AGTGTTCGGC 
GGCCAAACGG 
CCTAGGGGGC 
GGGAGGCGCC 
GAAACAATGA 
CCCAGGGAAT 
AAACAAGGAG 
ATGACAGGAC 
TTCAGCAAAG 
ACCAGCAGTT 
CAACAAGAAA 
CAAAAATATG 
AAACGATTTT 
AAGCACCTTA 
CACACCCCAA 
TTCTACAATG 
CTTCCAGAGG 
TTGAAAGGAT 
TAACCCATTA 



11 
I 

TGGGG CAGGC 
GATCGGTGCT 
ACATTTCCCA 
ACAACTTCAC 
CCGATAAAAC 
GTGACAGTCC 
CAGGAGACAG 
ATGCTATTCC 
ATAGGATGAT 
TCTCTGGAGA 
TTAATGCTGA 
AAAAAATCTT 
TTGAATCCAT 
AGAAGAAACT 
ATGCATAATC 
GAGCAGGATA 
CTAAGAAATT 
AACTTGTGTT 
GGTAAATACT 



21 
I 

ACGCTGTGGC 
TCTGGTGAGA 
CAACTCCCAG 
TGCCATTTTG 
AGAGAAGGTG 
TTCGTATCAG 
CCTTATTGCA 
ACCCAGCCAA 
GCAGAAACCT 
TGACCTAGAA 
TATAAAACGT 
CGAAATGCTT 
CATCAAGGAA 
GAAAOGTATG 
TCATTAATGA 
TTGCTGAAGT 
TCTGTTAGTA 
TTGGTTATTT 
ATTACAGTCG 



31 
I 

TGGCTACTTC 
CGCCTCCCCA 
AGGGCAGGTT 
TGAGGTGCCG 
GCTGTAGATC 
AAAAGGCAGA 
GGCTCTGCCA 
TTGGATTCTC 
GGTAGCAATG 
TGCAGAGAAA 
AAATTAGTGA 
GAAGGAGTGC 
GCAGCAAGAT 
ATTTGAGAAT 
TTGAGGAGAG 
CTCCTGGCAT 
AAAGATGTTC 
TGTATTCCCA 
TGGTTT CTGC 



41 
I 

CCTTCCTCCC 
TGCACATCAC 
TCTAGAAAGT 
CCGTCTCTCC 
CTGAAACTGT 
GGATGGCCCT 
TGTCCAAAGA 
AGATTGATGA 
CACCTGTGGG 
CAGCCTCCTC 
AGGAACTCCG 
AAGGACCTAC 
GTATGAGACG 
ACTTGTCCCT 
AAAAGGATCA 
ATGTTACCGA 
TTTTTCCCAA 
CCTGTGCTGG 
A 



51 
I 

ATCCCCCTTG 
TCCCAGGTGC 
GCCACCAGTG 
TCCAGCAAGG 
GTTTAAACGT 
GTTGGCAAGG 
AAAGAAGCTT 
CTTCACTGGT 
AGGAAACGTT 
TCCCAAAAGC 
ATGCGTTGGA 
TGCAGTCAGG 
AGACTTTGTT 
GGAGGATTAT 
GATTGCTGTT 
ATCAACTGGC 
AGCGTTTTAT 
TAGATATTAT 



Seq ID NO: 2 98 Protein sequence: 
Protein Accession #» Eos sequence 



51 



1 11 21 31 41 

I I I I I I 

MTDKTEKVAV DPETVPKRPR ECDSPSYQKR QRMALLARKQ GAGDSLIAGS AMSKEKKLMT 
GHAIPPSQLD SQIDDPTGFS KDRMMQKPGS NAPVGGNVTS SFSGDDLECR ETASSPKSQQ 
EINADIKRKL VKELRCVGQK YEKIFEMLEG VQGPTAVRKR FFESIIKEAA RCMRRDFVKH 
LKKKLKRMI 

Seq ID NO: 2 99 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 247-615 



AGTGTTCGGC 
GGCCAAACGG 
CCTAGGGGGC 
GGGAGGCGCC 
GAAACAATGA 
CCCAGGGAAT 
AAACAAGGAG 
TGACAGGACA 



11 
I 

TGGGG CAGGC 
GATCGGTGCT 
ACATTTCCCA 
ACAACTTCAC 
CCGATAAAAC 
GTGACAGTCC 
CAGGAGACAG 
TGCTATTCCA 



21 
I 

ACGCTGTGGC 
TCTGGTGAGA 
CAACTCCCAG 
TGCCATTTTG 
AGAGAAGGTG 
TTCGTATCAG 
CCTTATTGCA 
CCCAGCCAAT 



31 
I 

TGGCTACTTC 
CGCCTCCCCA 
AGGGCAGGTT 
TGAGGTGCCG 
GCTGTAGATC 
AAAAGGCAGA 
GGCTCTGCCA 
TGGATTCTCA 



41 
I 

CCTTCCTCCC 
TGCACATCAC 
TCTAGAAAGT 
CCGTCTCTCC 
CTGAAACTGT 
GGATGGCCCT 
TGTCCAAAGC 
GATTGATGAC 



51 
I 

ATCCCCCTTG 
TCCCAGGTGC 
GCCACCAGTG 
TCCAGCAAGG 
GTTTAAACGT 
GTTGGCAAGG 
AAAGAGCTTA 
TTCACTGGTT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 



60 
120 
180 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 



60 
120 
180 



60 
120 
180 
240 
300 
360 
420 
480 



297 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

TCAGCAAAGA TAGGATQATG CAGAAACCTG GTAGCAATGC 
CCAG CAGTTT CTCTGGAGAT GACCTAGAAT GCAGAGAAAC 
AACAAGAAAT TAATGCTGAT ATAAAACGTA AATTAGTGAA 
AAAAATATGA AAAAATCTTC GAAATGCTTG AAGGAGTGCA 
AACGATTTTT TGAATCCATC ATCAAGGAAG CAGCAAGATG 
AGCACCTTAA GAAGAAACTG AAACGTATGA TTTGAGAATA 
ACACCCCAAA TGCATAATCT CATTAATGAT TGAGGAGAGA 
TCTACAATGG AGCAGGATAT TGCTGAAGTC TCCTGGCATA 
TTCCAGAGGC TAAGAAATTT CTGTTAGTAA AAGATGTTCT 
TGAAAGGATA ACTTGTGTTT TGGTTATTTT GTATTCCCAC 
AACCCATTAG GTAAATACTA TTACAGTCGT GGTTTCTGCA 

Seg ZD NO i 300 Protein sequence t 
Protein Accession #» Eos sequence 



ACCTGTGGGA 
AGCCTCCTCT 
GGAACTCCGA 
AGGACCTACT 
TATGAGACGA 
CTTGTCCCTG 
AAAGGATCAG 
TGTTACCGAA 
TTTTCCCAAA 
CTGTGCTGGT 



GGAAACGTTA 
CCCAAAAGCC 
TGCGTTGGAC 
GCAGTCAGGA 
GACTTTGTTA 
GAGGATTATC 
ATTGCTGTTT 
TCAACTGGCC 
GCGTTTTATT 
AGATATTATT 



11 



21 



51 



31 41 

I I i 

MTDKTEKVAV DPETVPKRPR ECDSPSYQKR QRMALLARKQ GAGDSLIAGS AMSKAKKLMT 
GHAIPPSQLD SQIDDFTGFS KDRMMQKPGS NAPVGGNVTS SFSGDDUSCR ETASSPKSQQ 
EINADIKRKL VKELRCVGQK YEKI FEMLEG VQGPTAVRKR FPESIIKEAA RCMRRDFVKH 
LKKKLKRMI 

Seq ID NO: 301 DNA sequence 

Nucleic Acid Accession ft: Eos sequence 

Coding sequence: 247-812 



1 
I 

AGTGTTCGGC 
GGCCAAACGG 
CCTAGGGGGC 
GGGAGGCGCC 
GAAACAATGA 
CCCAGGGAAT 
AAACAAGGAG 
TGACAGGACA 
TCAGCAAAGA 
CCAGCAATTT 
AACAAGAAAT 
AATATGAAAA 
GATTTTTTGA 
ACCTTAAGAA 
CCCCAAATGC 
ACAATGGAGC 
CAGAGGCTAA 
AAGGATAACT 
CCATTAGGTA 



11 

I 

TGGGGCAGGC 
GATCGGTGCT 
ACATTTCCCA 
ACAACTTCAC 
CCGATAAAAC 
GTGACAGTCC 
CAGGAGACAG 
TGCTATTCCA 
TGGGATGATG 
CTCTGGAGAT 
TAATGCTGAT 
AATCTTCGAA 
ATCCATCATC 
GAAACTGAAA 
ATAATCTCAT 
AGGATATTGC 
GAAATTTCTG 
TGTGTTTTGG 
AATACTATTA 



21 
I 

ACGCTGTGGC 
TCTGGTGAGA 
CAACTCCCAG 
TGCCATTTTG 
AGAGAAGGTG 
TTCGTATCAG 
CCTTATTGCA 
CCCAGCCAAT 
CAGAAACCTG 
GACCTAGAAT 
ATAAAATGTC 
ATG CTTGAAG 
AAGGAAGCAG 
CGTATGATTT 
TAATGATTGA 
TGAAGTCTCC 
TTAGTAAAAG 
TTATTTTGTA 
CAGTCGTGGT 



31 
I 

TGGCTACTTC 
CGCCTCCCCA 
AGGGCAGGTT 
TGAGGTGCCG 
GCTGTAGATC 
AAAAGGCAGA 
GGCTCTGCCA 
TGGATTCTCA 
GTAGCAATGC 
GCAGAGGAAT 
AAGTAGTGAA 
GAGTGCAAGG 
CAAGATGTAT 
GAGAATACTT 
GGAGAGAAAA 
TGGCATATGT 
ATGTTCTTTT 
TTCCCACCTG 
TTCTGCA 



41 
I 

CCTTCCTCCC 
TGCACATCAC 
TCTAGAAAGT 
CCGTCTCTCC 
CTGAAACTGT 
GGATGGCCCT 
TGTCCAAAGA 
GATTGATGAC 
ACCTGTGGGA 
AGCCTCCTCT 
GGAAATCCGA 
ACCTACTGCA 
GAGACGAGAC 
GTCCCTGGAG 
GGATCAGATT 
TACCGAATCA 
TCCCAAAGCG 
TGCTGGTAGA 



Seq ID NO: 302 Protein sequence: 
Protein Accession #: Eos sequence 

21 



11 21 31 41 51 

1 I I I I I 

MTDKTEKVAV DPETVFKRPR ECDSPSYQKR QRMALLARKQ GAGDSLIAGS AMSKEKKLMT 
GHAIPPSQLD SQIDDFTGFS KDGMMQKPGS NAPVGGNVTS NFSGDDLECR GIASSPKSQQ 
EINADIKCQV VKEIRCLGQY EKI FEMLEGV QGPTAVRKRF FESIIKEAAR CMRRDFVKHL 
KKKLKRMI 

Seq ID NO: 303 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 247-815 



1 
I 

AGTGTTCGGC 
GGCCAAACAG 
CCTAGGGGGC 
GGGAGGCGCC 
GAAACAATGA 
CCCAGGGAAT 
AAACAAGGAG 
TGACAGGACA 
TCAGCAAAGA 
CCAG CAGTTT 
AACAAGAAAT 
AAAAATATGA 
AACGATTTTT 
AGCACCTTAA 
ACACCCCAAA 
TCTACAATGG 
TTCCAGAGGC 
TGAAAGGATA 
AACCCATTAG 



11 
I 

TGGGACAGGC 
GATCGGTGCT 
ACATTTCCCA 
ACAACTTCAC 
CCGATAAAAC 
GTGACAGTCC 
CAGGAGACAG 
TGCTATTCCA 
TAGGATGATG 
CTCTGGAGAT 
TAATGCTGAT 
AAAAATCTTC 
TGAATCCATC 
GAAGAAACTG 
TGCATAATCT 
AGCAGGATAT 
TAAGAAATTT 
ACTTGTGTTT 
GTAAATACTA 



21 
I 

ACGCTGTGGC 
TCTGGTGAGA 
CAACTCCCAG 
TGCCATTTTG 
AGAGAAGGTG 
TTCGTATCAG 
CCTTATTGCA 
CCCAGCCAAT 
CAGAAACCTG 
GACCTAGAAT 
ATAAAACGTA 
GAAATGCTTG 
ATCAAGGAAG 
AAACGTATGA 
CGTTAATGAT 
TGCTGAAGTC 
CTGTTAGTAA 
TGGTTATTTT 
TTACAGTCGT 



31 
I 

TGGCTACTTC 
CGTCTCCCCA 
AGGGCAGGTT 
TGAGGTGCCG 
GCTGTAGATC 
AAAAGGCAGA 
GGCTCTGCCA 
TGGATTCTCA 
GTAGCAATGC 
GCAGAGAAAC 
AATTAGTGAA 
AAGGAGTGCA 
CAGCAAGATG 
TTTGAGAATA 
TGAGGAGAGA 
TCCTGGCATA 
AAGATGTTCT 
GTATTCCCAC 
GGTTTCTGCA 



41 
I 

CCTTCCTTCC 
TGCACATCAC 
TCTAGAAAGT 
CCGTCTCTCC 
CTGAAACTGT 
GGATGGCCCT 
TGTCCAAAGC 
GATTGATGAC 
ACCTGTGGGA 
AGCCTCCTCT 
GGAACTCCGA 
AGGACCTACT 
TATGAGACGA 
CTTGTCCCTG 
AAAGGATCAG 
TGTTACCGAA 
TTTTCCCAAA 
CTGTGCTGGT 



540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 



PCT/US02/12476 



51 
I 

ATCCCCCTTG 
TCCCAGGTGC 
GCCACCAGTG 
TCCAGCAAGG 
GTTTAAACGT 
GTTGGCAAGG 
AAAGAGCTTA 
TTCACTGGTT 
GGAAATGTTA 
CCCAAAAGCC 
TGCCTTGGAC 
GTCAGGAAAC 
TTTGTTAAGC 
GATTATCACA 
GCTGTTTTCT 
ACTGGCCTTC 
TTTTATTTGA 
TATTATTAAC 



51 

! 

ATCCCCCTTG 
TCCCAGATGC 
GCCACCAGTG 
TCCAGCAAGG 
GTTTAAACGT 
GTTGGCAAGG 
AAAGAGCTTA 
TTCACTGGTT 
GGAAACGTTA 
CCCAAAAGCC 
TGCGTTGGAC 
GCAGTCAGGA 
GACTTTGTTA 
GAGGATTATC 
ATTGCTGTTT 
TCAACTGGCC 
GCGTTTTATT 
AGATATTATT 



60 
120 
180 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 



60 
120 
180 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
640 
900 
960 
1020 
1080 



Seq ID NO: 304 Protein sequence: 
Protein Accession # : Eos sequence 



298 



WO 02/086443 

1 11 21 31 41 51 

I I I I I I 

MTDKTEKVAV DPETVFKRPR ECD8P9YQKR QRMALLARKQ GAGDSLIAGS AMSKAKKLMT 60 
GHAIPPSQLD SQIDDPTGFS KDRMMQKPGS NAPVGGNVTS SFSGDDLECR ETASSPKSQQ 120 
EINADIKRKL VKELRCVGQK YEKIFEMLEG VQGPTAVRKR FFESIIKEAA RCMRRDFVKH 180 
LKKKLKRMI 



Seq ID NO: 305 DNA sequence 

Nucleic Acid Accession ft: Eos sequence 

Coding sequence t 67-689 

1 11 21 31 41 51 

I I I I I I 

CGTGGAGGCA GCTAGCGCGA GGCTGGGGAG CGCTGAGCCG CGCGTCGTGC CCTGCGCTGC 60 

CCAGACTAGC GAACAATACA GTCAGGATGG CTAAAGGTGA CCCCAAGAAA CCAAAGGGCA 120 

AGATGTCCGC TTATGCCTTC TTTGTGCAGA CATGCAGAGA AGAACATAAG AAGAAAAACC 180 

CAGAGGTCCC TGTCAATTTT GCGGAATTTT CCAAGAAGTG CTCTGAGAGG TGGAAGACGA 240 

TGTCCGGGAA AGAGAAATCT AAATTTGATG AAATGGCAAA GGCAGATAAA GTGCGCTATG 300 

ATCGGGAAAT GAAGGATTAT GGACCAGCTA AGGGAGGCAA GAAGAAGAAG GATCCTAATG 360 

CTCCCAAAAG GCCACCGTCT GGATTCTTCC TGTTCTGTTC AGAATTCCGC CCCAAGATCA 420 

AATCCACAAA CCCCGGCATC TCTATTGGAG ACGTGGCAAA AAAGCTGGGT GAGATGTGGA 480 

ATAATTTAAA TGACAGTGAA AAGCAGCCTT ACATCACTAA GGCGGCAAAG CTGAAGGAGA 540 

AGTATGAGAA GGATGTTGCT GACTATAAGT CGAAAGGAAA GTTTGATGGT GCAAAGGGTC 600 

CTGCTAAAGT TGCCCGGAAA AAGGTGGAAG AGGAAGATGA AGAAGAGGAG GAGGAAGAAG 660 

AGGAGGAGGA GGAGGAGGAG GATGAATAAA GAAACTGTTT ATCTGTCTCC TTGTGAATAC 720 

TTAGAGTAGG GGAGCGCCGT AATTGACACA TCTCTTATTT GAGAAGTGTC TGTTGCCCTC 780 

ATTAGGTTTA ATTACAAAAT TTGATCACGA TCATATTGTA GTCTCTCAAA GTGCTCTAGA 840 

AATTGTCAGT GGTTTACATG AAGTGGCCAT GGGTGTCTGG AGCACCCTGA AACTGTATCA 900 

AAGTTGTACA TATTTCCAAA CATTTTTAAA ATGAAAAGGC ACTCTCGTGT TCTCCTCACT 960 

CTGTGCACTT TGCTGTTGGT GTGACAAGGC ATTTAAAGAT GTTTCTGGCA TTTTCTTTTT 1020 

ATTTGTAAGG TGGTGGTAAC TATGGTTATT GGCTAGAAAT CCTGAGTTTT CAACTGTATA 1080 

TATCTATAGT TTGTAAAAAG AACAAAACAA CCGAGACAAA CCCTTGATGC TCCTTGCTCG 1140 

GCGTTGAGGC TGTGGGGAAG ATGCCTTTTG GGAGAGGCTG TAGCTCAGGG CGTGCACTGT 1200 

GAGGCTGGAC CTGTTGACTC TGCAGGGGGC ATCCATTTAG CTTCAGGTTG TCTTGTTTCT 1260 

GTATATAGTG ACATAGCATT CTGCTGCCAT CTTAGCTGTG GACAAAGGGG GGTCAGCTGG 1320 

CATGAGAATA TTTTTTTTTT TAAGTGCGGT AGTTTTTAAA CTGTTTGTTT TTAAACAAAC 1380 

TATAGAACTC TTCATTGTCA GCAAAGCAAA GAGTCACTGC ATCAATGAAA GTTCAAGAAC 1440 

CTCCTGTACT TAAACACGAT TCGCAACGTT CTGTTATTTT TTTTGTATGT TTAGAATGCT 1500 

GAAATGTTTT TGAAGTTAAA TAAACAGTAT TACATTTTTA AAACTCTTCT CTATTATAAC 1560 

AGTCAATTTC TGACTCACAG CAGTGAACAA ACCCCCACTC CATTGTATTT GGAGACTGGC 1620 

CTCCCTATAA ATGTGGTAGC TTCTTTTATT ACTCAGTGGC CAGCTCACTT AGGGCTGAGA 1680 

TGAAGGAGAG GGCTACTTGA AGCTACTGTG TGATTTTGTT TGTGTCTGAG TGGCATTCAG 1740 

ATGAAGTCTG GAGGAGTTAG GAGAACGACA TAGGCAAGGT TCAGCAGCCT TCCAAGGTAT 1800 

AGGAAGGTGG GTGATTAGGA CTGAGGCTAT CTAGGTTTAA CTTTTGTCCC ACCTCCACCC 1860 

CCTATTTTGT GGGGCCAAAT GCATTGCTAA ACAGCAATTT CAGAGTGTAT GGTGTGTCAA 1920 

AAATTAAGGC CTTATTGTTT TTCTCTTTCA CCCCTACCCC CCGTGCTCCT GGCACATATC 1980 

ACATTATTTG TGGTGCCCAA CATTTGGGGT CTTGAGCCTG CTGCTGGTCT CCTGGATGCC 2040 

AGTGAGGGTA TGTGGGATGG GGTGGTGGGG TAGGGGACGG TATCCTTTTT TTGCTCCTAC 2100 

TTGGAAACAC CAAACACCCC AAGGAAGATG ATAGGCTCCA TCTTGGGCCA CCTGAGCTAT 2160 

AGGGCAGGCT AATGGAATCA ACCATTTCTG AGCACTAAAT GTATCATGAA AAGTTGAATG 2220 

GCCTGCTCAT AAGTTTAGCT CATTCACTGG AAATGTAGAT TGATGTTCAA TGTTAAACTG 2280 

GAAGGAGCTT GGTTTGTGTG TCAGTGGTTA TATTAGTGGG TAGTGTAACA T TTT ATCCAG 2340 

GTTGGGGTGA GGGGAGATGG CCACAGTAGC AAGTGGTGAC ACTAAATACC ATTTTGAAGG 2400 

CTGATGTGTA TATACATCAT TACTGTCCGT AGCAATGAAG GATACAGTAC TGTGTTGTGG 2460 

GTGAGTGTTG CTATTGCCCA GCATTAATAT TTGGGTGTGT ATGTTTGAGG CTATGAAACA 2520 

CGCAGGAGTG TTTTTGTGCT ATTAATTTTA AGAGAAAGCA GCTTTTTCTT AAAATTCACT 2580 

GTTGAGAAAC TTGCATGTCT GGAGGCGGTG TCCTCTCCGC CCTGTCGGGT CCTGGATGAG 2640 

TACGAGTTAT GGTCACGGTC ACAGCCTGAT CTCTTATGTG TTCATAGCCA TTOGCTCTCC 2700 

CATCAGAACT GTTTGTCCTG AATGTGTTCC TCTAGTTCTA GAAAATGACC ACTAATTTAA 2760 

AAAACTCGGT TGTGAGGTTT GCCCAGAGGC ACTTGTTCCA GAATTTCCCC TCCTGCTTCA 2820 

GCCATGTCCT TGTCACTTGG CATTCTAAGC TAAAGCTTTA GCTTCCCAAT TCGTGATGTG 2880 

CTAGGCCAAG ATTCGGGAGC TGTTGCCAGC CTCGTCAAAT ATGGAAGAGA AACAACCTGC 2940 

GGTCAAAAGG GAGTGATTTG TTAAGTGGTG CGCGTCTATC TCATAACTAG ATGTACCAAC 3000 

CAGGGAAGGG CCAAGGATGG AAAGGGGTAA CTTTTGTGCT TCCAAAGTAG CTAAGCAGAA 3060 

GTGGGGGAGC AGTTTAGCCA GATGATCTTT GATTAGGCAA ACATTGAGTT TTAAAGAGGC 3120 

TGTCAAGTTG AGGCCACTTG GTCCATTAGC TGGGGCAGCA AGATCACTAC TCAACGTTTT 3180 

CACACTGTGG CAAGATTGCT CTTCTAGTGG AATAATGCCC TAGTTTCTCT GAGATGATGT 3240 

AAGTGGCATG ATGTTACCTA AGGCTTAGGC TTAGCTTGAT TTCTGGGCCC ACTGTCTGTG 3300 

TTCTTAAGAT GCCAACCTGT TGCTTTTTTT TTTTTTTTCC CCCATTTAAA AGGATAGTAC 3360 

CTACTCCCTC TAACCACCTC ACCCCATTCT TGAATGACAT TTTATCCTTC GGAAAGAACA 3420 

AGGCTGTGAT GTAGTGACTA TTGTCTGTGT CTCCTGTGTG TGTCTGTTCT TGTCACAAAT 3480 
GTATTTGGGG ACGTTGGATG CATTCATTTT CTGTAATAAA G 



Seq ID NO: 306 Protein sequence: 
Protein Accession ft: NP_005333.1 

1 11 21 31 41 51 

I I I i I 1 

MAKGDPKKPK GKMSAYAFFV QTCREEHKKK NPEVPVNFAE FSKKCSEHWK TMSGKEKSKF 60 
DEMAKADKVR YDREMKDYGP AKGGKKKKDP NAPKRPPSGF FLFCSEFRPK IKSTNPGISI 120 
GDVAKKLGEM WNNLNDSEKQ P Y I TKAAKLK EKYEKDVADY KSKGKFDGAK GPAKVARKKV 180 
EEEDEEEEEE EEEEEEEEDE 



Seq ID NO: 307 DNA sequence 
Nucleic Acid Accession #: NM_022342 
Coding sequence: 1..2178 



299 



WO 02/086443 

1 XI 21 3X 4X 51 

I I I I I I 

ATGGGTACTA GGAAAAAAGT TCATGCATTT GTCCGTGTCA AACCCACCGA TGACTTTGCT 60 

CATGAAATGA TCAGATACGG AGATGACAAA AGAAGCATTG ATATTCACTT AAAAAAAGAC X20 

ATTCGGAGAG GAGTTGTCAA TAACCAACAG ACAGACTGGT CGTTTAAGTT GGATGGAGTT ISO 

TTCACGATG CCTCCCAGGA CTTGGTTTAT GAGACAGTTG CAAAGGATGT GGTTTCTCAG 240 

CCCTCGATG GCTATAATGG CACCATCATG TGTTATGGGC AGACGGGAGC TGGCAAGACA 300 

ACACCATGA TGGGGGCAAC TGAGAATTAC AAGCACCGGG GGATCCTCCC TCGTGCCCTG 360 

AGCAGGTTT TTAGGATGAT CGAAGAACGC CCCACACATG CCATCACTGT GCGTGTTTCC 420 

ACTTGGAAA TCTATAATGA GAGCCTGTTT GATCTCCTGT CCACTCTGCC CTATGTTGGA 4 BO 

CCTCAGTCA CACCAATGAC CATCGTGGAA AACCCTCAAG GAGTCTTCAT TAAGGGCTTG 540 

CAGTTCACC TCACAAGTCA GGAGGAGGAT GCATTCAGCC TCCTTTTTGA GGGTGAGACC 600 

ACAGGATTA TAGCCTCCCA CACTATGAAC AAAAACTCTT CCAGATCACA CTGCATTTTC 660 

CCATCTACT TAGAGGCCCA TTCCCGGACC TTATCAGAGG AAAAGTACAT CACTTCCAAA 720 

TTAACTTGG TGGATCTGGC AGGCTCAGAG AGGCTGGGGA AGTCTGGGTC TGAGGGCCAA 780 

TCCTGAAGG AAGCCACCTA CATCAACAAA TCGCTCTCAT TCCTGGAGCA GGCCATCATT 840 

CCCTTGGGG ACCAGAAGCG GGACCACATC CCCTTTCGGC AGTGCAAGCT CACCCACGCT 900 

TGAAGGACT CGTTAGGGGG AAACTGCAAT ATGGTCCTOG TGACAAACAT CTATGGAGAA 960 

CTGCCCAGT TAGAAGAAAC GCTATCTTGA CTGAGATTTG CCAGCAGGAT GAAGCTAGTC 1020 

CCACTGAGC CTGCCATCAA TGAAAAGTAT GATGCTGAGA GAATGGTCAA GAACCTGGAG 1080 

AGGAACTAG CACTACTCAA GCAGGAGCTG GCTATCCATG ACAGCCTGAC CAACCGCACC 1140 

TTGTGACCT ATGACCCCAT GGATGAAATC CAGATTGCTG AGATCAACTC CCAGGTGCGG 1200 

GGTACCTGG AGGGGACACT GGACGAGATC GACATAATCA GCCTTAGACA GATCAAGGAG 1260 

TGTTCAACC AGTTCCGGGT GGTTCTGAGC CAACAGGAAC AGGAAGTGGA GTCCACTTTG 1320 

GCAGGAAGT ACACCCTCAT TGACAGGAAT GACTTTGCAG CCATTTCTGC TATCCAGAAG 1380 

CGGGGCTTG TGGATGTTGA TGGCCACCTA GTGGGTGAGC CTGAAGGACA AAACTTTGGA 1440 

TCGGAGTCG CCCCTTTCTC TACCAAACCT GGGAAGAAAG CCAAGTCCAA GAAGACATTC 1500 

AAGAGCCAC TCAGGCCCGA CACCCCACCC TCCAAACCAG TGGCCTTTGA GGAGTTTAAG 1560 

ATGAGCAAG GTAGTGAGAT CAACCGAATT TTCAAAGAAA ACAAATCCAT CTTGAATGAA 1620 

GGAGGAAAA GGGCCAGCGA GACCACACAG CACATCAATG CCATCAAGCG GGAGATTGAT 1680 

TGACCAAGG AGGCCCTGAA TTTCCAGAAG TCACTACGGG AGAAGCAAGG CAAGTACGAA 1740 

ACAAGGGGC TGATGATCAT CGATGAGGAA GAATTCCTGC TGATCCTCAA GCTCAAAGAC 1800 

TCAAGAAGC AGTACCGCAG CGAGTACCAG GACCTGCGTG ACCTCAGGGC TGAGATCCAG I860 

ATTGCCAGC ACCTAGTGGA TCAGTGTCGC CACCGCCTGC TCATGGAATT TGACATCTGG 1920 

ACAATGAGT CCTTTGTCAT CCCTGAGGAC ATGCAGATGG CACTGAAGCC AGGCGGCAGC 1980 

TCCGGCCAG GCATGGTCCC TGTGAACAGG ATTGTGTCTC TGGGAGAAGA TGACCAGGAC 2040 

AATTCAGCC AGCTGCAGCA GAGGGTGCTT CCTGAGGGCC CTGATTCCAT CTCCTTCTAC 2100 

ATGCCAAAG TCAAGATAGA GCAGAAGCAT AATTACTTGA AAACCATGAT GGGCCTCGAG 2160 
AGGCACATA. GAAAATAG 

Seq ID NO: 308 Protein sequence: 
Protein Accession ft: NP_071737 

1 11 21 

1 I I 

MGTRKKVHAF VRVKPTDDPA HEMIRYGDDK 
ZiHDASQDliVY ETVAKDWSQ ALDGYNGTXM 
QQVFRMIEER PTHA1TVRVS YLEIYNESLF 
SVHIjTSQEED AFSLLFEGET NRIIASHTMN 
INLVDLAGSE RLGKSGSEGQ VLKEATYINK 
LKDSLGGNCN MVLVTNIYGE AAQLEETLSS 
KBLALLKQELi AIHDSliTNRT PVTYDPMDEI 
VPNQFRWLS QQEQEVESTL RRKYTLIDRN 
IiGVAPPSTKP GKKAKSKKTP KEPLRPDTPP 
RRKRASETTQ HINAIKREID VTKEALNFQK 
LKKQYRSEYQ DLRDLRAEIQ YCQHLVDQCR 
IRPGMVPVNR IVSLGEDDQD KFSQLQQRVL 
QAHRK 

Seq ID NO: 309 DNA sequence 
Nucleic Acid Accession fti CAT cluster 

1 11 21 31 41 51 

I I I I I I 

TTTTTTTTTT TTTTTTTTAA TGCCTGCTGT CATGCTCTGT CTACCAGGGT GAATTTCCAA 60 
AAATTTCTGC ATAGCAATTT TAGCCAAAAC TATATATGTT CTGGGGAGGA TAGGCATAGG 120 
CACATTGAAG ACCAAAGGAA AGAGTGAAGA AGTGTAGTTG GGTCATTGTG AATGGATGTT 180 
TAGATTGTCA AGAAAAGTGG GCCAGAGGCC CCACCTCACA CTAGGACGGC AATTGCCTCT 240 
CATTAGTATC TCAGGGACCA TGGGTCTTAT TTGGTGTCAT AAGAAACACC CTCAACAAAG 300 
TAATGAACCC TCAGCCTCCA GCTTCTCTTC TTCGGGATTC TTCTTAGGGC CT CCTTTTTC 360 
CTTTTATGTT TCCAGTACCC TGAATTTCTT ATTCCCATCC CCCATTAAAA TCTGCTTCAA 420 
AGAAAAAACA AGAAGGACAC ATTCACTTTA AGATCCAAAT GAATGATAAG AGCTTAAAAC 480 
ATTATACTTA TCAGTATTAT TTGCATTTTT ATAGAAACCA AAACCATATT TCAACAAC 

Seq ID NO i 310 DNA sequence 

Nucleic Acid Accession ft i NM_018622.2 

Coding sequence i 1-1140 

1 11 21 31 41 51 

I I I I I 1 

ATGGCGTGGC GAGGCTGGGC GCAGAGAGGC TGGGGCTGCG GCCAGGCGTG GGGTGCGTCG 60 
GTGGGCGGCC GCAGCTGCGA GGAGCTCACT GCGGTCCTAA CCCCGCCGCA GCTCCTCGGA 120 
CGCAGGTTTA ACTTCTTTAT TCAACAAAAA TGCGGATTCA GAAAAGCACC CAGGAAGGTT 180 
GAACCTCGAA GATCAGACCC AGGGACAAGT GGTGAAGCAT ACAAGAGAAG TGCTTTGATT 240 
CCTCCTGTGG AAGAAACAGT CTTTTATCCT TCTCCCTATC CTATAAGGAG TCTCATAAAA 300 
CCTTTATTTT TTACTGTTGG GTTTACAGGC TGTGCATTTG GATCAGCTGC TATTTGGCAA 360 



31 41 51 

1 I I 

RSIDIHLKKD IRRGWNNQQ TDWSFKLDGV 60 

CYGQTGAGKT YTMMGATENY KHRGI LPRAI* 120 

DLLSTIiPYVG PSVTPMTIVE NPQGVPIKGI* 180 

KNSSRSHCIP TIYLEAHSRT LSEEKYITSK 240 

SLSFLEQAII ALGDQKRDHI PFRQCKLTHA 300 

LRFASRMKLV TTEPAINEKY DAERMVKNLE 360 

QIAEINSQVR RYLEGTLDEI DIISLRQIKE 420 

DPAAISAIQK AGLVDVDGHL VGEPEGQNFG 480 

SKPVAFEEFK NEQGSEINRI FKENKSILNE 540 

SLREKQGKYE NKGLMI IDEE EPIiLILKLKD 600 

HRLLMEFDIW YNESFVIPED MQMALKPGGS 660 

PEGPDSISFY NAKVKIEQKH NYLKTMMGLQ 720 



300 



WO 02/086443 

TATGAATCAC TGAAATCCAG GGTCCAGAGT TATTTTGATG GTATAAAAGC TGATTGGTTG 420 

GATAGCATAA GACCACAAAA AGAAGGAGAC TTCAGAAAGG AGATTAACAA GTGGTGGAAT 480 

AACCTAAGTG ATGGCCAGCG GACTGTGACA GGTATTATAG CTGCAAATGT CCTTGTATTC 540 

TGTTTATGGA GAGTACCTTC TCTGCAGCGG ACAATGATCA GATATTTCAC ATCGAATCCA 600 

GCCTCAAAGG TCCTTTGTTC TCCAATGTTG CTGTCAACAT TCAGTCACTT CTCCTTATTT 660 

CACATGGCAG CAAATATGTA TGTTTTGTGG AGCTTCTCTT CCAGCATAGT GAACATTCTG 720 

GGTCAAGAGC AGTTCATGGC AGTGTACCTA TCTGCAGGTG TTATTTCCAA TTTTGTCAGT 780 

TACCTGGGTA AAGTTGCCAC AGGAAGATAT GGACCATCAC TTGGTGCATC TGGTGCCATC 840 

ATGACAGTCC TCGCAGCTGT CTGCACTAAG ATCCCAGAAQ GGAGGCTTGC CATTATTTTC 900 

CTTCCGATGT TCACGTTCAC AGCAGGGAAT GCCCTGAAAG CCATTATCGC CATGGATACA 960 

GCAGGAATGA TCCTGGGATG GAAATTTTTT GATCATGCGG CACATCTTGG GGGAGCTCTT 1020 

TTTGGAATAT GGTATGTTAC TTACGGTCAT GAACTGATTT GGAAGAACAG GGAGCCGCTA 1080 
GTGAAAATCT GGCATOAAAT AAGGACTAAT GGCCCCAAAA AAGGAGGTGG CTCTAAGTAA 



Seq ID NO i 311 Protein sequence t 
Protein Accession 8: NP_061092.2 

1 11 21 31 41 

I I I I I 

MAWRGWAQRG WGCGQAWGA3 VGGRSCEELT AVLTPPQLLG RRFNFFIQQK 
EPRRSDPGT3 GEAYKRSALI PPVEETVPYP SPYPIRSLIX PLFFTVGFTG 
YESLKSRVQS YFDGIKADWL DSIRPQKEGD FRKEINKWWN NLSDGQRTVT 
CLWRVPSLQR TMIRYFTSNP ASKVLCSPML LSTFSHFSIiF HMAANMYVLW 
GQEQFMAVYL SAGVISNFVS YLGKVATORY GPSLGASGAI MTVLAAVCTK 
LPMFTFTAGN ALKAIIAMDT AGMILGWKFF DHAAHLGGAL FGI WYVTYGH 
VKIWHEIRTN GPKKGGGSK 

Seq ID NO: 312 DNA sequence 
Nucleic Acid Accession #: NM_000625 
Coding sequence: 195.. 3656 



1 11 21 31 41 51 

I I I 1 1 I 

CTCTCGGCCA CCTTTGATGA GGGGACTGGG CAGTTCTAGA CAGTCCCGAA GTTCTCAAGG 60 

CACAGGTCTC TTCCTGGTTT GACTGTCCTT ACCCCGGGGA GGCAGTG CAG CCAGCTGCAA 120 

GCCCCACAGT GAAGAACATC TGAGCTCAAA TCCAGATAAG TGACATAAGT GACCTGCTTT 180 

GTAAAGCCAT AGAGATGGCC TGTCCTTGGA AATTTCTGTT CAAGACCAAA TTCCACCAGT 240 

ATGCAATGAA TGGGGAAAAA GGCATCAACA ACAATGTGGA GAAAGCCCCC TGTGCCACCT 300 

CCAGTCCAGT GACACAGGAT GACCTTCAGT ATCACAACCT CAGCAAGCAG CAGAATGAGT 360 

CCCCGCAGCC CCTCGTGGAG ACGGGAAAGA AGTCTCCAGA ATCTCTGGTC AAGCTGGATG 420 

CAACCCCATT GTCCTCCCCA CGGCATGTGA GGATCAAAAA CTGGGGCAGC GGGATGACTT 480 

TCCAAGACAC ACTTCACCAT AAGGCCAAAG GGATTTTAAC TTGCAGGTCC AAATCTTGCC 540 

TGGGGTCCAT TATGACTCCC AAAAGTTTGA CCAGAGGACC CAGGGACAAG CCTACCCCTC 600 

CAGATGAGCT TCTACCTCAA GCTATCGAAT TTGTCAACCA ATATTACGGC TCCCTCAAAG 660 

AGGCAAAAAT AGAGGAACAT CTGGCCAGGG TGGAAGCGGT AACAAAGGAG ATAGAAACAA 720 

CAGTAACCTA CCAACTGACG GGAGATGAGC TCATCTTCGC CACCAAGCAG GCCTGGCGCA 780 

ATGCCCCACG CTGCATTGGG AGGATCCAGT GGTCCAACCT GCAGGTCTTC GATGCCCGCA 840 

GCTGTTCCAC TGCCCGGGAA ATGTTTGAAC ACATCTGCAG ACACGTGCGT TACTCCACCA 900 

ACAATGGCAA CATCAGGTCG GCCATCACCG TGTTCCCCCA GCGGAGTGAT GGCAAGCACG 960 

ACTTCCGGGT GTGGAATGCT CAGCTCATCC GCTATGCTGG CTACCAGATG CCAGATGGCA 1020 

GCATCAGAGG GGACCCTGCC AACGTGGAAr TCRCTCAGCT GTGCATCGAC CTGGQCTGQA 1080 

AGCCCAAGTA CGGCCGCTTC GATGTGGTCC CCCTGGTCCT GCAGGCCAAT GGCCGTGACC 1140 

CTGAGCTCTT CGAAATCCCA CCTGACCTTG TGCTTGAGGT GGCCATGGAA CATCCCAAAT . 1200 

ACGAGTGGTT TCGGGAACTG GAGCTAAAGT GGTACGCCCT GCCTGCAGTG GCCAACATGC 1260 

TGCTTGAGGT GGGCGGCCTG GAGTTCCCAG GGTGCCCCTT CAATGGCTGG TACATGGGCA 1320 

CAGAGATCGG AGTCCGGGAC TTCTGTGATG TCCAGCGCTA CAACATCCTG GAGGAAGTGG 1380 

GCAGGAGAAT GGGCCTGGAA ACGCACAAGC TGGCCTCGCT CTGGAAAGAC CAGGCTGTCG 1440 

TTGAGATCAA CATTGCTGTG CTCCATAGTT TCCAGAAGCA GAATGTGACC ATCATGGACC 1500 

ACCACTCGGC TGCAGAATCC TTCATGAAGT ACATGCAGAA TGAATACCGG TCCCGTGGGG 1560 

GCTGCCCGGC AGACTGGATT TGGCTGGTCC CTCCCATGTC TGGGAGCATC ACCCCCGTGT 1620 

TTCACCAGGA GATGCTGAAC TACGTCCTGT CCCCTTTCTA CTACTATCAG GTAGAGGCCT 1680 

GGAAAACCCA TGTCTGGCAG GACGAGAAGC GGAGACCCAA GAGAAGAGAG ATTCCATTGA 1740 

AAGTCTTGGT CAAAGCTGTG CTCTTTGCCT GTATGCTGAT GCGCAAGACA ATGGCGTCCC 1800 

GAGTCAGAGT CACCATCCTC TTTGCGACAG AGACAGGAAA ATCAGAGGCG CTGGCCTGGG 1860 

ACCTGGGGGC CTTATTCAGC TGTGCCTTCA ACCCCAAGGT TGTCTGCATG GATAAGTACA 1920 

GGCTGAGCTG CCTGGAGGAG GAACGGCTGC TGTTGGTGGT GACCAGTACG TTTGGCAATG 1980 

GAGACTGCCC TGGCAATGGA GAGAAACTGA AGAAATCGCT CTTCATGCTG AAAGAGCTCA 2040 

ACAACAAATT CAGGTACGCT GTGTT TGGCC TCGGCTCCAG CATGTACCCT CGGTTCTGCG 2100 

CCTTTGCTCA TGACATTGAT CAGAAGCTGT CCCACCTGGG GGCCTCTCAG CTCACCCCGA 2160 

TGGGAGAAGG GGATGAGCTC AGTGGG CAGG AGGACGCCTT CCGCAGCTGG GCCGTGCAAA 2220 

CCTTCAAGGC AGCCTGTGAG ACGTTTGATG TCCGAGGCAA ACAGCACATT CAGATCCCCA 2280 

AGCTCTACAC CTCCAATGTG ACCTGGGACC CGCACCACTA CAGGCTCGTG CAGG ACT CAC 2340 

AGCCTTTGGA CCTCAGCAAA GCCCTCAGCA GCATGCATGC CAAGAACGTG TTCACCATGA 2400 

GGCTCAAATC TOGGCAGAAT CTACAAAGTC CGACATCCAG CCGTGCCACC ATCCTGGTGG 2460 

AACTCTCCTG TGAGGATGGC CAAGGCCTGA ACTACCTGCC GGGGGAGCAC CTTGGGGTTT 2520 

GCCCAGGCAA CCAGCCGGCC CTGGTCCAAG GCATCCTGGA GCGAGTGGTG GATGGCCCCA 2580 

CACCCCACCA GGCAGTG CGC CTGGAGGCCC TGGATGAGAG TGGCAGCTAC TGGGTCAGTG 2640 

ACAAGAGGCT GCCCCCCTGC TCACTCAGCC AGGCCCTCAC CTACTTCCTG GACATCACCA 2700 

CACCCCCAAC CCAGCTGCTG CTCCAAAAGC TGGCCCAGGT GGCCACAGAA GAGCCTGAGA 2760 

GACAGAGGCT GGAGGCCCTG TGCCAGCCCT CAGAGTACAG CAAGTGGAAG TTCACCAACA 2820 

GCCCCACATT CCTGGAGGTG CTAGAGGAGT TCCCGTCCCT GCGGGTGTCT GCTGGCTTCC 2880 

TGCTTTCCCA GCTCCCCATT CTGAAGCCCA GGTTCTACTC CATCAGCTCC CCCCGGGATC 2 940 

ACACGCCCAC GGAGATCCAC CTGACTGTGG CCGTGGTCAC CTACCACACC CGAGATGGCC 3000 

AGGGTCCCCT GCACCACGGC GTCTGCAGCA CATGGCTCAA CAGCCTGAAG CCCCAAGACC 3060 

CAGTGCCCTG CTTTGTGCGG AATGCCAGCG GCTTCCACCT CCCCGAGGAT CCCTCCCATC 3120 



51 
I 

CGFRKAPRKV 60 

CAFGSAAIWQ 120 

GIIAANVLVP 180 

SFSSSIVNIL 240 

IPEGRLAIIF 300 

ELIWKNREPL 360 



301 



5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

CTTGCATCCT CATCGGGCCT GGCACAGGCA 
GGCTCCATGA CTCCCASCAC AAGGGAGTGC 
OCCOCCGCCC AGATGAGGAC CACATCTACC 
GGGTGCTGCA TGCGGTGCAC ACAGCCTATT 
TTCAGGACAT CCTGCGGCAG CAGCTGGCCA 
CAGGCCACCT CTATGTTTGC GGGGATGTGC 
AGCAGCTGGT GGCTGCCAAG CTGAAATTGA 
AGCTCAAGAG CCAGAAGCGC TATCACGAAG 
CGAAGAAGGA CAGGGTGGCG GTGCAGCCCA 
CTACAGGAGQ GGTTAAAGCT GCCGGCACAG 
CTGAGGTCAC AGGGCCTGGG GAGATGGAGG 
TTCCTCAACG TTGCTCCCCA TCAAGCCCTT 
ATTGATCGGA GCCTC 

Seq ID NO: 313 Protein sequence: 
Protein Accession ft: NP_000616 



PCT/US02/12476 



MACPWKFLFK 
VETGKKSPES 
TPKSLTRGPR 
LTGDELIFAT 
RSAITVPPQR 
RFDWPLVLQ 
GLEFPGCPPN 
AVLHSFQKQN 
LNYVLSPFYY 
ILFATETGKS 
NGEKLKKSLF 
ELSGQEDAFR 
' SKALSSMHAK 
PALVQGILER 
LLLQKLAQVA 
PILKPRFYSI 
VRMASGFHLP 
EDHIYQEEML 
VCGDVRMARD 
VAVQPSSLEM 



11 
I 

TKFHQYAMNG 
LVKLDATPLS 
DKPTPPDELL 
KQAWRNAPRC 
9DGKHDFRVW 
ANGRDPELFE 
GWYMGTEIGV 
VTIMDHHSAA 
YQVEAWKTHV 
EAIjAWDLGAL 
MLKELNNKFR 
SWAVQTFKAA 
NVFTMRLKSR 

wdgptphqa 
teeperqrle 
ssprdhtpte 

EDPSHPCILI 
EMAQKGVLHA 
VAHTLKQLVA 
SAL 



21 

I 

EKGINNNVEK 

sprhvriknw 

PQAIEFVNQY 
IGRIQWSNLQ 

naqliryagy 

IPPDLVIjEVA 
RDFCDVQRYN 
RSFMKYMQNE 
WQDEKRRPKR 
FSCAFNPKW 
YAVFGLGSSM 
CETFDVRGKQ 
QNLQSPTSSR 
VRLEALDESG 
ALCQPSEYSK 
IHLTVAWTY 
GPGTGIAPFR 
VHTAYSRLPG 
AKLKLNEEQV 



TCGCGCCCTT 
GGGGAGGCCG 
AGGAGGAGAT 
CCCGCCTGCC 
GCGAGGTGCT 
GCATGGCCCG 
ATGAGGAGCA 
ATATCTTTGG 
GCAGCCTGGA 
AACTTAAGGA 
AAAGTGATAT 
TACTTGACCT 



31 

I 

APCATSSPVT 
GSGMTFQDTL 
YGSLKEAKIE 
VFDARSCSTA 
QMPDGSIRGD 
MEHPKYEWFR 
ILEEVGRRMG 
YRSRGGCPAD 
RE I PLKVLiVK 
CMDKYRLSCL 
YPRFCAFAHD 
HIQIPKLYTS 
ATILVELSCE 
SYWVSDKRLP 
WKFTNSPTFL 
HTRDGQGPLH 
SFWQQRLHDS 
KPKVYVQDIL 
EDYFFQLKSQ 



CCGCAGTTTC 
CATGACCTTG 
GCTGGAGATG 
TGGCAAGCCC 
CCGTGTGCTC 
GGACGTGGCC 
GGTCGAGGAC 
TGCTGTATTT 
GATGTCAGCG 
TGGAGCCAGC 
CCCCCAGCCT 
CCTAACAAGT 



41 

I 

QDDLQYHNLS 
HHKAKGILTC 
EELARVEAVT 
REMFEHICRH 
PANVEFTQLC 
ELELKWYALP 
LETHKLASLW 
WIWLVPPMSG 
AVLFACMLMR 
EEERLIiLWT 
IDQKLSHLGA 
NVTWDPHHYR 
DGQGLNYLPG 
PCSLSQALTY 
EVLEEFPSLR 
HGVCSTWLNS 
QHKGVRGGRM 
RQQLASEVLR 
KRYHEDIFGA 



TGGCAGCAAC 
GTGTTTGGGT 
GCCCAGAAGG 
AAGGTCTATG 
CACAAGGAGC 
CACACCCTGA 
TATTTCTTTC 
CCTTACGAGG 
CTCTGAGGGC 
TCTGCATTAT 
CAAGTCTTAT 
AGCACCCTGG 



51 

I 

KQQNESPQPIi 
RSKSCLGSIM 
KEIETTVTYQ 
VRYSTNNGNI 
IDLGWKPKYG 
AVANMLLEVG 
KDQAWEINI 
SITPVFHQEM 
KTMASRVRVT 
STFGNGDCPG 
SQLTPMGEGD 
IiVQDSQPLDL 
EHLGVCPGNQ 
FLDITTPPTQ 
VSAGFLLSQIi 
LKPQDPVPCF 
TLVFGCRRPD 
VLHKEPGHLY 
VFPYEAKKDR 



Seq ID NO: 314 DNA sequence 
Nucleic Acid Accession #i XM_087254 
Coding sequence: 47.. 2332 



1 

I 

AGAGTACGTG 
ATGTTCAATT 
AACACCAGAC 
CTTATCCCAT 
AATTAAAGAA 
CAATGTTCAA 
GTTGGAGTAC 
TGGTATTGTG 
GGAACGGTAC 
AATTGTTCAG 
TCTCCCTAAA 
TTTGAAAGGG 
GGAAATAGAT 
GGCAGCTGTT 
AGACAGACTA 
AGTATGGGTA 
CCATTTTCAT 
TGCTGAACAA 
GCTGGTAGTG 
GGAAGTTTGC 
AAAAGTAATA 
TGGTGCTAAT 
AGAAGGAAGA 
CAAATTGCTT 
TTTTTTTTAT 
GTTTTCTCAG 
TTCCCTACCT 
AAATAAGCCC 
TCTTTATTGG 
ACTAATAGGG 
TGGCACTTTG 
TCATTTTTGG 
ATTTTCCTTG 
TGTGTTTATT 
TACATG TCT A 
TACTGAAAAG 
CTGTTTCCCG 
AGGAAGATGT 
CAACGACAGG 



11 
I 

TTTACAGATA 
AATGGCATGA 
TCTTCAGAAG 
CTTACAACCA 
CATGATCTCT 
ACTGACTGCA 
TATGCATCTT 
TTTATTGGCA 
AAACTGCTTC 
GCACCTTCAG 
TGTATAGGTG 
CTAAGAACTC 
AAACGCATAT 
TTCCAGTTCA 
CAAGATAAAG 
CTTACTGGGG 
AGAACCATGA 
TTGAGGCAGC 
GATGGGACCA 
AGAAATTGTT 
AGACTAATAA 
GACGTAAGCA 
CAGG CTGCAA 
TTTGTTCATG 
AAGAATGTGT 
CAAACATTGT 
ATTCTGATAT 
ACCCTTTATC 
ACCATCCTGG 
AAAGATACAT 
GTCTTCACAG 
ACTTGGATCA 
TTTTATGGAG 
CAGCTCCTGT 
TTTCTTGATA 
GCACAGCTTA 



AGTCCAACCC 
AGCATCTTGA 



21 
I 

AAACTGGTAC 
AATACCAAGA 
GAAACTTATC 
GTTCCTCTTT 
TCTTTAAAGC 
CTGGTGATGG 
CACCAGATGA 
ATTCTGAAGA 
ATATTCTGGA 
GTGAGAAGTT 
GAGAAATAGA 
TGTGTATAGC 
TTGAAGCCAG 
TAGAGAAAGA 
TTCGAGAAAC 
ATAAACATGA 
ACATCCTTGA 
TTGCCAGAAG 
GCCTATCTCT 
CAGCTGTATT 
AAATATCACC 
TGATACAAGA 
GAAACAGTGA 
GTCATTTTTA 
GCTTTATCAC 
ATGACAGCGT 
ATAGTCTTTT 
GAGACATTAG 
GCTTCAGTCA 
CTCTGCTTGG 
TCATGGTTAT 
ACCATCTCGT 
GGATTCTCTG 
CAAGTGGTTC 
TCATAAAGAA 
CTGAAACAAA 
CAGCGTGTGC 
ACATCAG CAG 
CTCTCTCCAC 



31 

1 

ACTGACAGAA 
AATTAATGGT 
TTATCTTAGT 
CAGAACCAGT 
AGTCAGTCTC 
TCCCTGGCAA 
AAAGGCTCTA 
AACTATGGAG 
ATTTGATTCA 
ATTATTTGCT 
AAAAACCAGA 
ATATAGAAAA 
GACTGCCTTG 
CCTGATATTA 
TATTGAAGCA 
AACAGCTGTT 
ACTTATAAAC 
AATTACAGAG 
TGCACTCAGG 
ATGCTGTCGT 
TGAGAAACCT 
AGCCCATGTT 
CTATGCAATA 
TTATATTAGA 
ACCCCAGTTT 
GTACCTGACT 
GGAACAGCAT 
TAAAAACCGC 
TGCCTTTATT 
AAATGGCCAG 
TACAGTCACA 
TACCTGGGGA 
GCCATTTTTG 
TGCTTGGTTT 
GGTCTTTGAC 
TGCAGGTATC 
ATCTGTTGGA 
ATCATGGAGT 
AATGGACTCA 



41 

I 

AATGAGATGC 
AGACTTGTAC 
AGTTTATCCC 
CCTGAAAATG 
TGTCACACTG 
TCCAACCTGG 
GTAGAAG CTG 
GTTAAAACTC 
GATCGTAGGA 
AAAGGAG CTG 
ATTCATGTAG 
TTTACATCAA 
CAGCAGCGGG 
CTTGGAGCCA 
TTGAGAATGG 
AGTGTGAGTT 
CAGAAATCAG 
GATCATGTGA 
GAGCATGAAA 
ATGGCTCCAC 
ATAACATTGG 
GGCATAGGAA 
GCCAGATTTA 
ATAGCTACCC 
TTATATCAGT 
TTATACAATA 
GTAGACCCTC 
CTCTTAAGTA 

ATGTTTGGAA 
GTAAAGATGG 
TCTATTATAT 
GGCTCCCAGA 
GCCATAATCC 
CGACACCTCC 
AAGTGCTTGG 
AGAATGCTGG 
GCATCGGATC 
TCTACTTGTT 



51 
I 

AGTTTCGGGA 
CCGAAGGACC 
ATCTTAACAA 
AAACTGAACT 
TACAGATTAG 
CACCATCGCA 
CTGCAAGGAT 
TTGGAAAACT 
GAATGAGTGT 
AGTCATCAAT 
ATGAATTTGC 
AAGAGTATGA 
AAGAGAAATT 
CAGCAGTAGA 
CTGGTATCAA 
TATCATGTGG 
ACAGCGAGTG 
TTCAGCATGG 
AACTATTTAT 
TGCAGAAAGC 
CTGTTGGTGA 
TCATGGGTAA 
AGTTCCTCTC 
TTGTACAGTA 
TCTACTGTTT 
TTTGTTTTAC 
ATGTGTTACA 
TTAAAACATT 
GATCCTATTT 
ACTGGACATT 
CTCTGGAAAC 
TTTATTTTGT 
ATATGTATTT 
TCATGGTTGT 
ACCCTACAAG 
ACTCCATGTG 
AACGAGTTAT 
CTTTCTATAC 
AAAGGGGCAG 



3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 



302 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

TAGTACTTTG TGGGAGCCAG TTCACCTCCT 
ATGGCCACAC TAGCTCTGAA ATTAATTTCC 
QAQTTATAAT GGCAAACAAA CAGAAAG CAT 
TGAATCTGAA CATGTTAAAA TTTGAGAATA 
TGTCCCTTGT GCTTATGGGA CTCCTAATGG 
TTTAATATAA ATGTAGAAAA AAGAGAGAAA 
TTGATTATTG ACTCTTCTAT TTAAATCTGC 
AGAACTCTAT TTTTTTATTA GAGTTATATT 
ATACTGAGGA ATTTTGGTCC CTCAGTGACC 
TTCACAGAGC AAATTAGGAG AATCATTTCC 
ATTTATACCA ATTCCTCTAA CTGTACTGTA 
CAAGGGTATA TCATATATAC AAATCAGGAA 
GTTTACTAAT ATTTTTGTGA CAGAGTATAA 
TAGCATATTA TTAATTTAAT GTCTTTATCA 
ACATATTTAA ATTTGCTTTT TTTCTCTTTA 
CATCGTTGTA CAGTTTAACT AT AT CAAT AA 
TATGTTTAAT TATACAAATC AGAATAGTAT 
CTCTTTCTGC AGCOGACTTA GACATGCTCT 
AGGGTTTCAG TTAATAATCT TATTTTCAGG 
CAATACAGTG AGTTCTGCCA GTGTCCCAGT 
TGTAAAAATG CTCAACTTGT ATCAGGTAAT 
TAATCGGGTA CATGTTACTG TAATTAACTC 
AATTTATCAA GTAGTTCAGT ATTGT CATTT 
TTAAGATTTA GAAGTGATTA TTAGCTTGAG 
TTGTATACAT ATTAAGATAA TGGTTAAATG 
AATTCCTTTA TGGAGATTTA TTGTGCAGCC 
GGCTTCTAGA ATTGGACTGG CAGGGGAAAG 
CTTGTTTGCT TGTAAACTAT TATTTTCTTG 
TAAGGATATT AAGTTATTAA GCTAAATATT 
GATATTTCAT AGCTGGATTT AGGAAGATCT 
CAACGTACAA TGTCTGCATT CACTAATTCA 
CTCAGTAGAG TACTAGGTGG GAGGATATGG 
TGCATATAAC AAAATGACAC CCAGTAGGCC 
GCCATCAAAT AAACTGAGTA CTGACACCAG 
TGACCAACTG CAGCAAGACA GGAGGTCAGC 
ATGTAATTTT CTGTACTCAC CATTTGAAGT 
AAGTAAATGG CAACCACTAG TGTGCTCATC 
TTAAAGCAAA ATTATCTTGT GATTTTAAGA 
CAATGCAGTC TGCAAGCTTT CAGTAGTTTT 
ACTACGTAAC CAGTAATCAC AAGGAAAGTG 
CTTTGGAAAG TATGATGTTG ATAATTAACT 
GCTAAATACG TTATTGCTAA TCAGTGGTCT 
GAGGGCTGTA AGCCTGAAGA TAGTGGCAAG 
GCTGCTTTAA GTGACTCAGC ACCCTGCCTC 
GGAGCAAAGT ATGGGCCAGG GAGAACTACA 
AGGGGAGAAT TTATGGTCTG AATTTTCTAA 
ATACACAAAG GCTTCCAGAC CTGAGCCACA 
CAGAGGCAAA TCAACCCTAG GAAATACTTG 
AGGTCATTTC TACTGGAAAA GATTGTGAGA 
ATAGGCAGGA GTCAAGGCCA CTAGAAAATT 
TGCTACTCTG AAAAATCTCG TGAAGGCTGT 
TTTTCCTGTA AAGATCAGTT TGGGGTATGA 
CAAAGAGTTA CGTAAAACAT GTTTTATTAA 
CTATTTTGAA ATGAGTTATC TATTTTCATA 
TGTGAAATAA CTTGAATGTT GTTCCTATAA 
AATCATGGTA ATTTAGATTT TTATGAGGAA 
GGTTTAAAAT TTTGGACCTG AGACACTGTG 
TGCATTGTCA GTAAATGTAG TATATTATTG 
GAAGTTATAT TTATCAAATA AAAACTTTCC 

Seq ID NO: 31S Protein sequence: 
Protein Accession #: XP_087254 



TTCCTAAAAT 
AAAATCTTTG 
TAGTACAAGC 
AAGAGACATT 
CATTTCAGTC 
TCTTAGTAAA 
TTCTGTAAAT 
TAAAGCTTTT 
TGTGTTGTTA 
AACCATTATT 
ACACAGCCTG 
TCAGGTCCGT 
AGACCCTATA 
TTGGATCTTT 
CCTGAAGGCT 
AAAGTTTGGA 
GGGTAATTAA 
TCCCTTTCTA 
TTATGTCAT C 
ACAAGG CAT A 
GTTAGCAATA 
ATTGCACTTC 
GTTTTTGTTT 
AACTATTACC 
CGGTTTTACC 
CTAAGCTTCC 
AATGGTAGAG 
CTAATGTAAC 
AATTTTCAAA 
GTTATTCTGG 
TGTTCCAGAA 
AAATTTGCTC 
TGCATTACAT 
ACAAAGACTC 
TCGCCTATAA 
TAGTTAAGGA 
CTGAACTGTT 
AAAGAGTTTT 
CTAGTGCTAT 
TCCCCTTTGC 
TACCCTTATC 
CAAATCGATT 
CACCAAGTCA 
AGCTTCAGCA 
GCTACGAAGA 
CTGTCCTCTT 
CCCAGGCCCT 
CATTCTGCCC 
TTGAACTTAT 
GACAGTTAAG 
AGGAAAAGGG 
TATAAGCAGG 
TTTTGGTCCC 
AAAGTAAAAC 
AAAATAGATC 
TGAGTATCTG 
GCTGTCTAAT 
TACAGCTACT 
TATAT 



TCAGTGTGAT 
TAGTAGTTCA 
CCCTCCCAAC 
TTTCATCTCT 
TGTTGCTGAG 
GAGTATTTTT 
TATGCTGAAA 
CATGGGAAAA 
ATTCATTAAT 
TACTGCAGTA 
TAAAGTTAGC 
TCACCGAACT 
GTGGGTAAAT 
TGCATGCTTT 
CTGTGTATAG 
CAGTATTTAA 
ATGAATACAA 
TAAGCTAGAT 
TAACTTATAG 
TTTCAGGTGT 
AATTAAATGC 
AAAACCTAAC 
TATTGAAAAG 
CAGCTCTAAG 
AAGTTTTCCC 
TTCCCATTTC 
ACAGAAATTA 
ATTTGTCTGT 
AATAGTCCTT 
AAGTACTAAA 
GA6GAAATAA 
ATAAAATCTC 
TTACATGACC 
CAAAGTCATA 
TGGTGCTTAA 
GAACTTTATT 
ACTCCAAATC 
CTATTTATTT 
ATTCATCCTG 
ATATTTCTTT 
TGCCAAAACC 
TGCCTCCCTT 
GTTTCCAAAA 
GGCGTAGGCT 
CCTGCTGTCG 
TCTTGGGTCT 
ATCCTGAACA 
TACGGTTAGT 
CTGATCGCTT 
AGCCAAAAGT 
AGAATCTTCC 
TATTAATAAA 
CACGTACAGA 
ACTATTAAAG 
ATAACTCATG 
GAAATATTGT 
GTAATCCTTT 
CATAATTTTT 



CACCCTGTTA 
TACCCACTCA 
ACCCTTAATT 

GC CATT AT AT 
TAGTATTAGC 
GTTTGCCTTG 
GTTAATGTGA 
GCATTCTGAG 
TGGGGAGTAA 
CATATAAATG 
TCAAATTGAT 
TAGATACTAT 
AATCTGGTTA 
TATTTCATGA 
ATATTGCAAA 
AAAGAAGAGC 
TTTAGAATAA 
CAAACTACCA 
GGCTGTGGAA 
TAAGAATGAT 
TTCCATCCTG 
TAATGTTGTC 
CAAATAATGA 
TTGAAAATGT 
ATGAATATAA 
AGACTTTATC 
TCCAGTGATG 
CTTTAACTTA 
AAGAATAATA 
TGAAGATATA 
TTATAAAACG 
GTGTTTATTT 
AAATAGCCTA 
AGTGTGATTG 
TTTTTAAAAA 
CACTCCGTTT 
AAGAAAGTAA 
TAAAACTCTT 
AAAATTCTTT 
AGAGCAAAAT 
TGCCTCGTCT 
TTGCCCCTCA 
CACCCTGGGC 
AGTTGAGAAA 
AAAGCTCATA 
GGAGACTAAA 
ACCAGGACTG 
GAGACTCCTA 
TTTTAAAATA 
ATGTTGGTGT 
AATAACACAC 
CATTTTATTT 
TGCTGTTTTA 
ATATGTTTGT 
AGCAATACTT 
AAAAATTCTC 
TAAAGTTTAT 



MQFRSCSING 
NETELIKEHD 
AAARIGIVFI 
AESSILPKCI 
REEKLAAVFQ 
SLSCGHFHRT 
EKLPMEVCRN 
GIMGKEGRQA 
QFYCLFSQQT 
SIKTFLYWTI 
MAliETHFWTW 
ILMWTCIiFL 
LERVIGRCSP 



11 

I 

MKYQEINGRL 
LFFKAVSLCH 
GNSEETMEVX 
GGEIEKTRIH 
FIEKDLILLG 
MNIIxEIiINQK 
CSAVLCCRMA 
ARNSDYAIAR 
LYDSVYLTLY 
LGFSHAFIFP 
INHLVTWGSI 
DIIKKVFDRH 
THISRSWSAS 



21 
I 

VPEGPTPDSS 
TVQISNVQTD 
TLGKLERYKL 
VDEFALKGLR 
ATAVEDRLQD 
SDSECAEQLR 
PLQKAKVIRL 
FKFLSKLLFV 
NICFTSLPIL 
FGSYLLXGKD 
IFYFVFSLFY 
LHPTSTEKAQ 
DPFYTNDRSI 



31 
I 

EGNIiSYLSSI* 
CTGDGPWQSN 
LHILEFDSDR 
TLCIAYRKFT 
KVRETIEALR 
QLARRITEDH 
IKISPEKPIT 
HGHFYYIRIA 
IYSLLEQHVD 
TSLLGNGQMF 
GGILWPFLGS 
LTETNAGIKC 
LTLSTMDSST 



41 
I 

SHLNNLSHLT 
LAPSQLEYYA 
RRMSVIVQAP 
SKEYEEIDKR 
MAGIKVWVLT 
VIQHGLWDG 
LAVGDGANDV 
TLVQYFFYKN 
PHVLQNKPTL 
GNWTFGTLVF 
QNMYFVFICL 
LDSMCCFPEG 
C 



2400 
2460 
2520 
25B0 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
52B0 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
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51 
I 

TSSSFRTSPE 
SSPDEKALVE 
SGEKLLFAKG 
IFEARTALQQ 
GDKHETAVSV 
TSLSLALREH 
SMIQEAHVGI 
VCFITPQFLY 
YRDISKNRLL 
TVMVITVTVK 
LSSGSAWFAl 
EAACASVGRM 



60 
120 
1B0 
240 
300 
360 
420 
480 
540 
600 
660 
720 



Seq ID NO: 316 DNA sequence 
Nucleic Acid Accession »: NMJ304473 
Coding sequence: 661.. 1791 



51 



1 11 21 31 41 

I i I I I I 

CTCGCCAGCG GTCCGCGGGG CTGGAGACCC ACGCCGTGGA GAGGACCAGC CTCAGGTCGC 



60 



303 



WO 02/086443 

CCCGCCTGGG CCCGCGCCCC GACCTCGCTG CCCCCGCCTC GCCTCTCTGC CCGTGGCGCT 120 

TACCGCCACC TTGGCCTCGG GGGCAGGGCA TGGGCGGCCC CCGCCAGATC GCCCAGCGCC 180 

AGTACTAACT GCCCTCGCTC TGGCCTTCGA GCCCGAAGCC TCTTCTGCGC GCACAACCTA 240 

GGCAGTAATC CTAAACTAGC GGGCACCACA GACCAGCTGC AGCCACCCCA ACCCAGGGAT 300 

CACTTCOGGA CCCCTCGACC GCCCGGCACC AGCGCGCAAG GGACCCTTCA GCCGGAGACC 360 

AGAGTCCAGT CCCGGTCGCG AGGCCACCGC CGCTGCCCGC CTCGAGAAGC ACAACGCGGG 420 

CTGAGCCGTC GGCTAGCGGG TCACTCCCGA GCCTCTGTCT GCACCGCGCC AGCCCCAGAC 480 

CACGGACGCT GAGCCTCCAG CGCGCGCCAG CCTGGGCCGC TGGGCTCTCC GGGCCAGCCC 540 

GCGACGATCC CCTGAGCTCT CCGCAGAAGG GCCGAGCGTC CGTTCCGGGG ACGCCAGGCC 600 

CGCCCCCGCC CCCCGACAGC CGCGGGGATC CAGAGCCCGG GGGTGCGGGA CGCCCGCGCC 660 

ATGACTGCCG AGAGCGGGCC GCCGCCGCCG CAGCCGGAGG TGCTGGCTAC CGTGAAGGAA 720 

GAGCGCGGCG AGACGGCAGC AGGGGCCGGG GTCCCAGGGG AGGCCACGGG CCGCGGGGCG 780 

GGCGGGCGGC GCCGCAAGCG CCCCCTGCAG CGCGGGAAGC CGCCCTACAG CTACATCGCG 840 

CTCATCGCCA TGGCCATCGC GCACGCGCCC GAGGGCCGCC TCACGCTGGG CGGCATCTAC 900 

AAGTTCATCA CCGAGCGCTT CCCCTTCTAC CGCGACAACC CCAAAAAGTG GCAGAACAGC 960 

ATCOGCCACA ACCTCACACT CAACGACTGC TTCCTCAAGA TCCCGCGCGA GGCCGGCCGC 1020 

CCGGGTAAGG GCAACTACTG GGCGCTCGAC CCCAACGCGG AGGACATGTT CGAGAGCGGC 1080 

AGCTTCCTGC GCCGCCGCAA GCGCTTCAAG CGCTCGGACC TCTCCACCTA CCCGGCTTAC 1140 

ATGCACGACG CGGCGGCTGC CGCAGCCGCC GCTGCCGCAG CCGCCGCCGC CGCCGCCGCC 1200 

GCCGCCATCT TCCCAGGCGC GGTGCCCGCC GCGCGCCCCC CCTACCCGGG CGCCGTCTAT 1260 

GCAGGCTACG CGCCGCCGTC GCTGGCOGCG CCGCCTCCAG TCTACTACCC CGCGGCGTCG 1320 

CCCGGCCCTT GCCGCGTCTT CGGCCTGGTT CCTGAGCGGC CGCTCAGCCC AGAGCTGGGG 1380 

CCCGCACCGT CGGGGCCCGG CGGCTCTTGC GCCTTTGCCT CCGCCGGCGC CCCCGCTACC 1440 

ACCACCGGCT ACCAGCCCGC AGGCTGCACC GGGGCCCGGC CGGCCAACCC CT CTGCCT AT 1500 

GCGGCTGCCT ACGCGGGCCC CGACGGCGCG TACCCGCAGG GCGCCGGCAG TGCGATCTTT 1560 

GCCGCTGCTG GCCGCCTGGC GGGACCCGCT TCGCCCCCAG CGGGCGGCAG CAGTGGCGGC 1620 

GTGGAGACCA CGGTGGACTT CTACGGGCGC ACGTCGCCCG GCCAGTTCGG AGCGCTGGGA 1680 

GCCTGCTACA ACCCTGGCGG GCAGCTCGGA GGGGCCAGTG CAGGCGCCTA CCATGCTCGC ' 1740 

CATGCTGCCG CTTATCCCGG TGGGATAGAT CGGTTCGTGT CCGCCATGTG AGCCAGCGTA 1800 

GGGACGAAAA CTCATAGACA CATCGGCTGT TCACACGTTC CCCGCAACCT GAGAACGAAC 1860 

AGGAATGGAG AGAGGACTCA ACTGGGACCC ACGTGGAAAA GACCGAGCAG GCCACAGAGG 1920 

CTCGGTCTCC CCGCGCACAG CGTAGGCACC CTGTGTACTC TGTAAACGGG AGGAGGTGGG 1980 

GCGAGGCAGC CAGAGCCCTT GGACTGGCAC AGGGACCCTC GATGGAGCGA AGCCCTCAAA 2040 

CGGGATGCTT TCTGGCATTC TATCGGGGAG GGTCCTTGGC GGTAACCAGA GGGCAGCGTA 2100 

GTGTCAACAC CAGAGACCAG GATCCAAATT GTGGGGAATC AGTTTCAGCC TTCCATGTGC 2160 

TGCCGGAACT CGGGCCTTTT TACGCGGTTC GTCCTCTAGT GCCTTTAACT GCGTTACTAC 2220 

AATAAAAGGC TGCGGCAGCG CCTTTCTTCT TAAAGTGAGG AGGACAAATT TGCAAAAGAA 2280 

ATAGGCTTTT CTTCTTTTTT AAATTGGAGA AATCTCTGCT CTGGTTGACC TGGGCTGGTT 2340 

TTCCCTGTCT CTGAGAACTT GAGACCTAGC TCCGAGTTGA ACTGTGCGTC AGCACTCCAG 2400 

TCCCATCACC TGAACCTTCA GTCTCCCCCA TCTGTTACAC TAGAGGGCTG CAGGACTCTA 2460 

TCCACCGCCC CCGGGTTATC ATTCAGGGCC CCATCATCTT GGATGCTGCC CTGCGTATTT 2520 

GGCAGCAATG GTGGGCCACC CAGGGCCTCT GAGTAGCCAC CCAAAGCCTA GCCGCTGTTC 2580 

TAGGGAACGG AAAAGAGTTC ATGGCCAAGC GTCTAACCTA AAGTCCCAGG ATTGGCTCCA 2640 

GGCAGCAATT ATATCATAAC TTATTGAACT TTTGAGCAGG ACGTGCTGGT AATTTCATGG 2700 

CTGTTACTGC CCAGTCATAA ATCTGCTTTT CCATTATAAG GCAGAGAGAA GTACATTCGT 2760 

TCATTTGTCC ACTGTTTCTT GTCATCACGC AGCCCTGGAC CCAAAGGGTG AACTAAAGTT 2820 

TAAGGAGATG AGAGGATTCA AGGAGCCCGT TGGTGACGCC TTTCAGTAGC TGGGGAGGGC 2 880 

TCTTCCATCC CCAGCACCCC CTGCTACACC TCAGCAGCCT CCCCCATGCA AAAAGGAAAG 2940 

AGAAAAATTA AGTTAGGGCA GTCAGTAAAG TGAGCTTTAG AAAGAAACTG GAATTTTAAC 30 00 

TTCATTTTGT ATCTTGCTTA AGTAGCAGGC TCACTAAAAT TAGAGAAAGT CCAATAACTC 3 0 60 

TCCCCCTTTC CCTTGAGAAA TCTTTAAGTT TCGATTCTGG AGCAAAAACT TTCAGCATTA 3120 

AATATTTCAG AGGCTCCATT CACAGCTTTC AGATAAACTG GAGTGTTCAG ATGGACTGTT 3180 

TTAATAAAAA TCTTTGAGCA AGTGAGTTAT GGCAAGAGAA ACTCAGCCTC TTTCTGTATA 3240 

AACTTAACAG GGAAGGGCTG GGGTGTGAAA AAGAAGATTG TATGAAAACC ATTGGTAATT 33 00 

TTTATTTTTT ATTTTTGGGA CTGCACTATC CTGTTCACGA AGACATGTGA ACTTGGTTCA 3360 

GTCCAAATGG GGATTTGTAT AAACCAGTGC TCTCCATTAG AAATATGGTG CAAGCCACAT 3420 

ATGTAATTTT AAATATTCTA GTAGCCACAT TAATAAAGTN AAAAGAAACA AAAAAAAAAA 3480 
AA 

Seq ID NO: 317 Protein sequence: 
Protein Accession #: NP_004464 

1 11 21 31 41 

I I I I 1 

FKHLiTHYROI DTRANSCRIP TIONF ACTOR TTFMTAESGP PPPQPEVLAT 
GAGVPGEATG RGAGGRRRKR PLQRGKPPYS YIALIAMAIA HAPERRLTLG 
PFYRDNPKKW QNSIRHNLTL NDCFLKIPRE AGRPGKGNYW ALDPNAEDMF 
RPKRSDLSTY PAYMHDAAAA AAAAAAAAAA AAAAAIPPGA VPAARPPYPG 
LAAPPPVYYP AASPGPCRVF GLVPERPLSP ELGPAPSGPG GSCAFASAGA 
GCTGARPANP SAYAAAYAGP DGAYPQGAGS AIFAAAGRIA GPASPPAGGS 
YGRTSPGQPG ALGACYNPGG QLGGASAGAY HARHAAAYPG GIDRFVSAM 

Seq ID NO: 318 DNA sequence 
Nucleic Acid Accession ft: NM_005688 
Coding sequence: 126.. 4439 



1 11 21 31 

I I I- I 

CCGGGCAGGT GGCTCATGCT CGGGAGCGTG GTTGAGCGGC 
AGGGGCGCAG GAATTCTGAT GTGAAACTAA CAGTCTGTGA 
AGAAGATGAA GGATATCGAC ATAGGAAAAG AGTATATCAT 
GTGTGAGGGA GAGAACCAGC ACTTCTGGGA CGCACAGAGA 
GGAGAACTCG ACCGTTGGAA TGCCAAGATG CCTTGGAAAC 
TCTCTCTTGA TGCCTCCATG CATTCTCAGC TCAGAATCCT 
GAAAGTACCA TCATGGCTTG AGTGCTCTGA AGCCCATCCG 
ACCCAGTGGA CAATGCTGGG CTTTTTTCCT GTATGACTTT 



51 
I 

VKEERGETAA 60 

GIYKFITERP 120 

ESGSFLRRRX 180 

AVYAGYAPPS 240 

PATTTGYQPA 300 

SGGVETTVDF 360 



41 51 
I I 

TGGCG CGGTT GTCCTGGAGC 60 

GCCCTGGAAC CTCCGCTCAG 12 0 

CCCCAGTCCT GGGTATAGAA 180 

CCGTGAAGAT TCCAAGTTCA 240 

AGCAGCCCGA GCCGAGGGCC 300 

GGATGAGGAG CATCCCAAGG 360 

GACTACTTCC AAACACCAGC 420 

TTCGTGGCTT TCTTCTCTGG 480 



304 



WO 02/086443 

CCCGTGTGGC CCACAAGAAG GGGGAGCTCT CAATGGAAGA CGTGTGGTCT CTGTCCAAGC 540 

ACGAGTCTTC TGACGTGAAC TGCAGAAGAC TAGAGAGACT GTGGCAAGAA GAGCTGAATG 600 

AAGTTGGGCC AGACGCTGCT TCCCTGCGAA GGGTTGTGTG GATCTTCTGC CGCACCAGGC 660 

TCATCCTGTC CATCGTGTGC CTGATGATCA CGCAGCTGGC TGGCTTCAGT GGACCAGCCT 720 

TCATGGTGAA ACACCTCTTG GAGTATACCC AGGCAACAGA GTCTAACCTG CAGTACAGCT 780 

TGTTGTTAGT GCTGGGCCTC CTCCTGACGG AAATCGTGCG GTCTTGGTCG CTTGCACTGA 840 

CTTGGGCATT GAATTACCGA ACCGGTGTCC GCTTGCGGGG GGCCATCCTA ACCATGGCAT 900 

TTAAGAAGAT CCTTAAGTTA AAGAACATTA AAGAGAAATC CCTGGGTGAG CTCATCAACA 960 

TTTGCTCCAA CGATGGGCAG AGAATGTTTG AGGCAGCAGC CGTTGGCAGC CTGCTGGCTG 1020 

GAGGACCCGT TGTTGCCATC TTAGGCATGA TTTATAATGT AATTATTCTG GGACCAACAG 1080 

GCTTCCTGGG ATCAGCTGTT TTTATCCTCT TTTACCCAGC AATGATGTTT GCATCACGGC 1140 

TCACAGCATA TTTCAGGAGA AAATGCGTGG CCGCCACGGA TGAACGTGTC CAGAAGATGA 1200 

ATGAAGTTCT TACTTACATT AAATTTATCA AAATGTATGC CTGGGTCAAA GCATTTTCTC 1260 

AGAGTGTTCA AAAAATCCGC GAGGAGGAGC GTCGGATATT GGAAAAAGCC GGGTACTTCC 1320 

AGGGTATCAC TGTGGGTGTG GCTCCCATTG TGGTGGTGAT TGCCAGCGTG GTGACCTTCT 1380 

CTGTTCATAT GACCCTGGGC TTCGATCTGA CAGCAGCACA GGCTTTCACA GTGGTGACAG 1440 

TCTTCAATTC CATGACTTTT GCTTTGAAAG TAACACCGTT TTCAGTAAAG TCCCTCTCAG 1500 

AAGCCTCAGT GGCTGTTGAC AGATTTAAGA GTTTGTTTCT AATGGAAGAG GTTCACATGA 1560 

TAAAGAACAA ACCAGCCAGT CCTCACATCA AGATAGAGAT GAAAAATGCC ACCTTGGCAT 1620 

GGGACTCCTC CCACTCCAGT ATCCAGAACT CGCCCAAGCT GACCCCCAAA ATGAAAAAAG 1680 

ACAAGAGGGC TTCCAGGGGC AAGAAAGAGA AGGTGAGGCA GCTGCAGCGC ACTGAGCATC 1740 

AGGOGGTGCT GGCAGAGCAG AAAGGCCACC TCCTCCTGGA CAGTGACGAG CGGCCCAGTC 1800 

CCGAAGAGGA AGAAGGCAAG CACATCCACC TGGGCCACCT GCGCTTACAG AGGACACTGC I860 

ACAGCATCGA TCTGGAGATC CAAGAGGGTA AACTGGTTGG AATCTGCGGC AGTGTGGGAA 1920 

GTGGAAAAAC CTCTCTCATT TCAGCCATTT TAGGCCAGAT GACGCTTCTA GAGGGCAGCA 1980 

TTGCAATCAG TGGAACCTTC GCTTATGTGG CCCAGCAGGC CTGGATCCTC AATGCTACTC 2040 

TGAGAGACAA CATCCTGTTT GGGAAGGAAT ATGATGAAGA AAGATACAAC TCTGTGCTGA 2100 

ACAGCTGCTG CCTGAGGCCT GAC CTGGCCA TTCTTCCCAG CAGCGACCTG ACGGAGATTG 2160 

GAGAG CG AGG AGCCAACCTG AGCGGTGGGC AGCGCCAGAG GATCAGCCTT GCCCGGGCCT 2220 

TGTATAGTGA CAGGAGCATC TACATCCTGG ACGACCCCCT CAGTGCCTTA GATGCCCATG 2280 

TGGGCAACCA CATCTTCAAT AGTGCTATCC GGAAACATCT CAAGTCCAAG ACAGTTCTGT 2340 

TTGTTACCCA CCAGTTACAG TACCTGGTTG ACTGTGATGA AGTGATCTTC ATGAAAGAGG 2400 

6CTGTATTAC GGAAAGAGGC ACCCATGAGG AACTGATGAA TTTAAATGGT GACTATGCTA 2460 

CCATTTTTAA TAACCTGTTG CTGGGAGAGA CACCGCCAGT TGAGATCAAT TCAAAAAAGG 2520 

AAACCAGTGG TTCACAGAAG AAGTCACAAG ACAAGGGTCC TAAAACAGGA TCAGTAAAGA 2580 

AGGAAAAAGC AGTAAAGCCA GAGGAAGGGC AGCTTGTGCA GCTGGAAGAG AAAGGGCAGG 2640 

GTTCAGTGCC CTGGTCAGTA TATGGTGTCT ACATCCAGGC TGCTGGGGGC CCCTTGGCAT 2700 

TCCTGGTTAT TATGGCCCTT TTCATGCTGA ATGTAGGCAG CACCGCCTTC AGCACCTGGT 2760 

GGTTGAGTTA CTGGATCAAG CAAGGAAGCG GGAACACCAC TGTGACTCGA GGGAACGAGA 2820 

CCTCGGTGAG TGACAGCATG AAGGACAATC CTCATATGCA GTACTATGCC AGCATCTACG 2880 

CCCTCTCCAT GGCAGTCATG CTGATCCTGA AAGCCATTOG AGGAGTTGTC TTTGTCAAGG 2940 

GCACGCTGCG AGCTTCCTCC CGGCTGCATG ACGAGCTTTT CGGAAGGATC CTTCGAAGCC 3000 

CTATGAAGTT TTTTGACACG ACCCCCACAG GGAGGATTCT CAACAGGTTT TCCAAAGACA 3060 

TGGATGAAGT TGACGTGCGG CTGCCGTTCC AGGCCGAGAT GTTCATCCAG AACGTTATCC 3120 

TGGTGTTCTT CTGTGTGGGA ATGATCGCAG GAGTCTTCCC GTGGTTCCTT GTGGCAGTGG 3180 

GGCCCCTTGT CATCCTCTTT TCAGTCCTGC ACATTGTCTC CAGGGTCCTG ATTCGGGAGC 3240 

TGAAGCGTCT GGACAATATC ACGCAGTCAC CTTTCCTCTC CCACATCACG TCCAGCATAC 3300 

AGGGCCTTGC CACCATCCAC GCCTACAATA AAGGGCAGGA GTTTCTGCAC AGATACCAGG 3360 

AGCTGCTGGA TGACAACCAA GCTCCTTTTT TTTTGTTTAC GTGTGCGATG CGGTGGCTGG 3420 

CTGTGCGGCT GGACCTCATC AGCATCGCCC TCATCACCAC CACGGGGCTG ATGATCGTTC 3480 

TTATGCACGG GCAGATTCCC CCAGCCTATG CGGGTCTCGC CATCTCTTAT GCTGTCCAGT 3540 

TAACGGGGCT GTTCCAGTTT ACGGTCAGAC TGGCATCTGA GACAGAAGCT CGATTCACCT 3600 

CGGTGGAGAG GATCAATCAC TACATTAAGA CTCTGTCCTT GGAAGCACCT GCCAGAATTA 3660 

AGAACAAGGC TCCCTCCCCT GACTGGCCCC AGGAGGGAGA GGTGACCTTT GAGAACGCAG 3720 

AGATGAGGTA CCGAGAAAAC CTCCCTCTTG TCCTAAAGAA AGTATCCTTC ACGATCAAAC 3780 

CTAAAGAGAA GATTGGCATT GTGGGGCGCA CAGGATCAGG GAAGTCCTCG CTGGGGATGG 3840 

CCCTCTTCCG TCTGGTGGAG TTATCTGGAG GCTGCATCAA GATTGATGGA GTGAGAATCA 3900 

GTGATATTGG CCTTGCCGAC CTCCGAAGCA AACTCTCTAT CATTCCTCAA GAGCCGGTGC 3 960 

TGTTCAGTGG CACTGTCAGA TCAAATTTGG ACCCCTTCAA CCAGTACACT GAAGACCAGA 4020 

TTTGGGATGC CCTGGAGAGG ACACACATGA AAGAATGTAT TGCTCAGCTA CCTCTGAAAC 4080 

TTGAATCTGA AGTGATGGAG AATGGGGATA ACTTCTCAGT GGGGGAACGG CAGCTCTTGT 4140 

GCATAGCTAG AGCCCTGCTC CGCCACTGTA AGATTCTGAT TTTAGATGAA GCCACAGCTG 4200 

CCATGGACAC AGAGACAGAC TTATTGATTC AAGAGACCAT CCGAGAAGCA TTTGCAGACT 4260 

GTACCATGCT GACCATTGCC CATCGCCTGC ACACGGTTCT AGGCTCCGAT AGGATTATGG 4320 

TGCTGGCCCA GGGACAGGTG GTGGAGTTTG ACACCCCATC GGTCCTTCTG TCCAACGACA 4380 

GTTCCCGATT CTATGCCATG TTTGCTGCTG CAGAGAACAA GGTCGCTGTC AAGGGCTGAC 4440 

TCCTCCCTGT TGACGAAGTC TCTTTTCTTT AGAGCATTGC CATTCCCTGC CTGGGGCGGG 4500 

CCCCTCATCG CGTCCTCCTA CCGAAACCTT GCCTTTCTCG ATTTTATCTT TCGCACAGCA 4560 

GTTCCGGATT GGCTTGTGTG TTTCACTTTT AGGGAGAGTC ATATTTTGAT TATTGTATTT 4620 

ATTCCATATT CATGTAAAGA AAATTTAGTT TTTGTTCTTA ATTGCACTCT AAAAGGTTCA 4680 

GGGAACCGTT ATTATAATTG TATCAGAGGC CTATAATGAA GCTTTATACG TGTAGCTATA 4740 

TCTATATATA ATTCTGTACA TAGCCTATAT TTACAGTGAA AATGTAAGCT GTTTATTTTA 4800 

TATTAAAATA AGCACTGTGC TAATAACAGT GCATATTCCT TTCTATCATT TTTGTACAGT 4860 

TTGCTGTACT AGAGATCTGG TTTTGCTATT AGACTGTAGG AAGAGTAGCA TTTCATTCTT 4920 

CTCTAGCTGG TGGTTTCACG GTGCCAGGTT TTCTGGGTGT CCAAAGGAAG AOGTGTGGCA 4980 

ATAGTGGGCC CTCCGACAGC CCCCTCTGCC GCCTCCCCAC AGCCGCTCCA GGGGTGGCTG 5040 

GAGACGGGTG GGCGGCTGGA GACCATGCAG AGCGCCGTGA GTTCTCAGGG CTCCTGCCTT 5100 

CTGTCCTGGT GTCACTTACT GTTTCTGTCA GGAGAGCAGC GGGGCGAAGC CCAGGCCCCT 5160 

TTTCACTCCC TCCATCAAGA ATGGGGATCA CAGAGACATT CCTCCGAGCC GGGGAGTTTC 5220 

TTTCCTGCCT TCTTCTTTTT GCTGTTOTTT CTAAACAAGA ATCAGTCTAT CCACAGAGAG 5280 

TCCCACTGCC TCAGGTTCCT ATGGCTGGCC ACTG CACAG A GCTCTCCAGC TCCAAGACCT 5340 

GTTGGTTCCA AGCCCTGGAG CCAACTGCTG CTTTTTGAGG TGGCACTTTT TCATTTGCCT 5400 

ATTCCCACAC CTCCACAGTT CAGTGGCAGG GCTCAGGATT TCGTGGGTCT GTTTTCCTTT 5460 

CTCACCGCAG TCGTCGCACA GTCTCTCTCT CTCTCTCCCC TCAAAGTCTG CAACTTTAAG 5520 

CAGCTCTTGC TAATCAGTGT CTCACACTGG CGTAGAAGTT TTTGTACTGT AAAGAGACCT 5580 

ACCTCAGGTT GCTGGTTGCT GTGTGGTTTG GTGTGTTCCC GCAAACCCCC TTTGTGCTGT 5640 

GGGGCTGGTA GCTCAGGTGG GCGTGGTCAC TGCTGTCATC AGTTGAATGG TCAGCGTTGC 5700 
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ATGTCGTGAC CAACTAGACA TTCTGTCGCC TTAGCATGTT TGCTGAACAC CTTGTGGAAG 5760 
CAAAAATCTG AAAATGTGAA TAAAATTATT TTGGATTTTG TAAAAAAAAA AAAAAAAAAA 5820 
AAAAAAAAAA AAAAAAAA 



Seq ID NO: 319 Protein sequence: 
Protein Accession #: NP_005679 

1 11 21 31 41 51 

I I I I I I 

MKDIDIGKEY IIPSPGYRSV RERTSTSGTH RDREDSKPRR TRPLECQDAIi ETAARAEGLS 60 

LDASMHSQLR ILDEEHPKGK YHHGLSALKP IRTTSKHQHP VDNAGIiFSCM TFSWLSSLAR 120 

VAHKKGELSM EDVWSLSKHE SSDVNCRRLE RLWQEELNEV GPDAASLRRV VWIFCRTRLI 180 

LSIVCLMITQ LAGFSGPAFM VKHLLEYTQA TESNLQYSLL LVLGLLLTEl VRSWSLALTW 240 

ALNYRTGVRL RGAILTMAPK KILKLKNIKE KSLGELINIC SNDGQRMPEA AAVGSLLAGG 300 

PWAILGMIY NVIILGPTGF LGSAVPILFY PAMMFASRLT AYFRRKCVAA TDERVQKMNE 360 

VLTYIKFIKM YAWVKAFSQS VQKIREEERR ILEKAGYFQG ITVGVAPIW VIASWTFSV 420 

HMTIX3FDLTA AQAFTWTVF NSMTFALKVT PFSVKSLSEA SVAVDRFKSL FLMEEVHMIK 480 

NKPASPHIKI EMKNATLAWD SSHSSIQNSP KLTPKMKKDK RASRGKKEKV RQLQRTEHQA 540 

VLAEQKGHLL LDSDERPSPE EBEGKHIHLG HLRLQRTIjHS IDLEIQEGKfc VGICGSVGSG 600 

KTSLISAILG QMTLIiEGSIA ISGTFAYVAQ QAWILNATLR DNILFGKEYD EERYNSVLNS 660 

CCLRPDLAIL PSSDLTEIGE RGANLSGGQR QRISLARALY SDRSIYILDD PLSAIiDAHVG 720 

NHIFNSAIRK HTJCSKTVLFV THQLQYLVDC DEVIFMKEGC ITERGTHEEL MNLNGDYATI 780 

FNNLLLGETP PVEINSKKET SGSQKKSQDK GPKTGSVKKE KAVKPEEGQL VQLEEKGQGS 840 

VPWSVYGVYI QAAGGPLAFL VIMALFMLNV GSTAFSTWWL SYWIKQGSGN TTVTRGNETS 900 

VSDSMKDNPH MQYYASIYAL SMAVMLILKA IRGWFVKGT LRASSRLHDE LFRRILRSPM 960 

KFFDTTPTGR ILNRFSKDMD EVDVRLPFQA EMFIQNVILV FFCVGMIAGV FPWFLVAVGP 1020 

LVILFSVLHI VSRVLIRELK RLDNITQSPF LSHITSSIQG LATIHAYNKG QEFLHRYQEL 1080 

LDDNOAPFFL FTCAMRWLAV RLDLISIALI TTTGLMIVLM HGQIPPAYAG LAISYAVQLT 1140 

GLFQFTVRIA SET EAR FT SV ERINHYIKTL SLEAPARIKN KAPSPDWPQE GEVTFENAEM 1200 

RYRENLPLVL KKVSFTIKPK EKIGIVGRTG SGKSSLGMAL FRLVELSGGC IKIDGVRISD 1260 

I GLAD L R S KL SIIPQEPVLF SGTVRSNLDP FNQYTEDQIW DALERTHMKE CIAQLPLKLE 1320 

SEVMENGDNF SVGERQLLCI ARALLRHCKI LILDEATAAM DTETDLIjIQE TIREAFADCT 13 80 
MLTIAHRLHT VLGSDRIMVL AQGQWEFDT PSVLL8NDSS RFYAMFAAAE NKVAVKG 



Seq ID NO: 320 DNA sequence 

Nucleic Acid Accession »: AK022089.1 

Coding sequence i 181-1488 

1 11 21 31 -41 51 

I I I I I I 

AGCAGTTGCA CAACTTCCAG CAACTTTCTC AGCCGGCTAC TAATGAGCTG AAAGCCAGGA 60 

ACATCCGAGG AGAAGAGAAA GCTTCCAGCC CTCCTCCCTT CACCCTGGAA ATCCAGACAC 120 

CCCCACCCCC ACCCTCAGAT CACTTTAAGA TAATTTCTTT ATTCGTTTGC CCGACAGACC 180 

ATGGCTCCCT TTGGAAGAAA CTTGCTAAAG ACTCGGCATA AAAACAGATC TCCAACTAAA 240 

GACATGGATT CAGAAGAGAA GGAAATTGTG GTTTGGGTTT GCCAAGAAGA GAAGCTTGTC 300 

TGTGGGCTGA CTAAACGCAC CACCTCTGCT GATGTCATCC AGGCTTTGCT TGAGGAACAT 360 

GAGGCTACGT TTGGAGAGAA ACGATTTCTT CTGGGGAAGC CCAGTGATTA CTGCATCATA 420 

GAGAAGTGGA GAGGCTCCGA AAGGGTTCTT CCTCCACTAA CTAGAATCCT GAAGCTTTGG 480 

AAAGCGTGGG GAGATGAGCA GCCCAATATG CAATTTGTTT TGGTTAAAGC AGATGCTTTT 540 

CTTCCAGTTC CTTTGTGGCG GACAGCTGAA GCCAAATTAG TGCAAAACAC AGAAAAATTG 600 

TGGGAGCTCA GCCCAGCAAA CTACATGAAG ACTTTACCAC CAGATAAACA AAAAAGAATA 660 

GTCAGGAAAA CTTTCCGGAA ACTGGCTAAA ATTAAGCAGG ACACAGTTTC TCATG AT CGA 720 

GATAATATGG AGACATTAGT TCATCTGATC ATTTCCCAGG ACCATACTAT TCATCAGCAA 780 

GTCAAGAGAA TGAAAGAGCT GGATCTGGAA ATTGAAAAGT GTGAAGCTAA GTTCCATCTT 840 

GAT CGAGTAG AAAATGATGG AGAAAACTAT GTTCAGGATG CATATTTAAT GCCCAGTTTC 900 

AGTGAAGTTG AGCAAAATCT AGACTTGCAG TATGAGGAAA ACCAGACTCT GGAGGACCTG 960 

AGCGAAAGTG ATGGAATTGA ACAGCTGGAA GAACGACTGA AATATTACCG AATACTCATT 1020 

GATAAGCTCT CTGCTGAAAT AGAAAAAGAG GTAAAAAGTG TTTGCATTGA TATAAATGAA 1080 

GATGCGGAAG GGGAAGCTGC AAGTGAACTG GAAAGCTCTA ATTTAGAGAG TGTTAAGTGT 1140 

GATTTGGAGA AAAGCATJ3AA AGCTGGTTTG AAAATTCACT CTCATTTGAG TGGCATCCAG 1200 

AAAGAGATTA AATACAGTGA CTCATTGCTT CAGATGAAAG CAAAAGAATA TGAACTCCTG 1260 

GCCAAGGAAT TCAATT CACT TCACATTAGC AACAAAGATG GGTGCCAGTT AAAGGAAAAC 1320 

AGAGCGAAGG AATCTGAGGT TCCCAGTAGC AATGGGGAGA TTCCTCCCTT TACT CAAAGA 13 80 

GTATTTAGCA ATTACACAAA TGACACAGAC TCGGACACTG GTATCAGTTC TAACCACAGT 1440 

CAGGACTCCG AAACAACAGT AGGAGATGTG GTGCTGTTGT CAACATAGTT CCAATGGCTC 1500 

CTTTCTGACC TGCTTTCATG TTTTAATGTT TGTTTAATTT AATAGGAAAC CTCATTTTAA 1560 

ATATAACACT CAAAAAAATG TAAATCATAT TGTAGTATTC AATAGTTAAT AAAAACTCGA 1620 
GAAATGTGTT GTTTCTG 

Seq ID NO: 321 Protein sequence: 
Protein Accession #: NP_00543 8.1 

1 11 21 31 41 51 

MAPFGRNLLK TRHKNRSPTK DMDSEEKBIV VWVCQEEKLV CGLTKRTTSA DVIQALLEEH 60 

EATFGEKRFL LGKPSDYCII EKWRGSERVL PPLTRILKLW KAWGDEQPNM QFVLVKADAF 120 

LPVPLWRTAE AKLVQNTEKL WELSPANYMK TLPPDKQKRI VRKTFRKLAK IKQDTVSHDR 180 

DNMETLVHLI ISQDHTIHQQ VKRMKELDLE IEKCEAKFHL DRVENDGENY VQDAYLMPSF 240 

SEVEQNLDLQ YEENQTLEDL SESDGIEQLE ERLKYYRILI DKLSAEIEKE VKSVCIDINE 300 

DAEGBAASEL ESSNLESVKC DLEKSMKAGL KIHSHLSGIQ KEIKYSDSLL QMKAKEYELL 360 

AKEFNSLHIS NKDGCQIjKEN RAKESEVPSS NGEIPPFTQR VFSNYTNDTD SDTGISSNHS 420 
QDSETTVGDV VLLST 

Seq ID NO: 322 DNA sequence 

Nucleic Acid Accession #: NM_030920.1 
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Coding sequence: 317-1123 



1 11 21 31 41 51 

I I I I I I 

AGCATTGAAG GGGAAGGAAC TGCGGGTGTG GTGTGTGTAT GTGTGTGTGT ATGTGTGTGC 60 

GGCGCGTGCG TGCGTGTGTG TGCGCGCGCT AGTGTGTGGA CAAGGAGGTG GGGGCAGCTG 120 

AGTTAGAGTC CCAACTCTTG GACTCCATTT GCTATTCTCT TCTTTCTCCC CCACACCTAT 180 

CTGGTGGTGG TAGTGGGCGT TTATATTTGC GTTCCTTTTC ATTCATTTCT AAATCTCTTA 240 

AAAATTTTGG GTTGGGGGTA TTGGGGAAGG CAGGAAAGGG AAAAGGAGAG TAGTAGCTGA 300 

AGAGCAAGAG GAGGACATGG AGATGAAGAA GAAGATTAAC CTGGAGTTAA GGAACAGATC 360 

CCCGGAGGAG GTGACAGAGT TAGTCCTTGA TAATTGCCTG TGTGTCAATG GGGAAATTGA 420 

AGGCCTGAAT GATACTTTCA AAGAACTAGA ATTTCTGAGT ATGGCTAATG TGGAACTAAG 480 

TTCGCTGGCC CGGCTTCCCA GCTTAAATAA ACTTCGAAAA TTGGAGCTTA GTGATAATAT 540 

AATTTCTGGA GGCTTGGAAG TCCTGGCAGA GAAATGTCCA AATCTTACCT ACCTCAATCT 600 

GAGTGGAAAC AAAATAAAAG ATCTCAGTAC AGTAGAAGCT CTGCAAAATC TTAAAAATTT 660 

GAAAAGTCTT GACCTGTTTA ACTGTGAGAT CACAAACCTG GAAGATTATA GAGAAAGTAT 720 

TTTTGAACTA CTGCAGCAAA TCACATACTT AGATGGATTT GATCAGGAGG ATAATGAAGC 780 

GCCGGACTCT GAAGAGGAGG ATGATGAGGA TGGAGATGAA GATGATGAAG AGGAAGAGGA 840 

AAATGAAGCT GGTCCACCGG AAGGATATGA GGAAGAGGAG GAGGAAGAGG AAGAGGAGGA 900 

TGAGGATGAG GATGAAGATG AAGATGAAGC AGGTTCAGAG TTGGGAGAGG GAGAAGAGGA 960 

AGTGGGCCTC TCATACTTAA TGAAAGAAGA AATTCAGGAT GAAGAAGATG ATGATGACTA 1020 

TGTTGAAGAA GGGGAAGAAG AGGAAGAAGA GGAAGAAGGA GGTCTTCGAG GGGAGAAGAG 1080 

GAAACGAGAT GCTGAAGACG ATGGAGAGGA AGAAGATGAC TAGATCATTC TAAGACCAGA 1140 

TTCTCTAATG TTTCTGGGTG TGCAATAGAG TGATCACATC TTTGTTTCTT CATGTACGAT 1200 

AGCTATCCCT ACAGAAGATA ATGTGTAACT TTTTATAGGA AAAGTGTGGT TTTACTATTT 1260 

TTGCCTTATC ATTCCAAATA AGAACTAGTC TGTTAATGAT CATATTGTAT GTAGAGAAAA 1320 

ATTTTCATTG ACTCCCATTG TGGAATTCCC TAGCAATTTA TTTAGACTTA ATTTTTTAAA 1380 

TTCAAGCTTA CTGTATTAGT CATTTTTAGC CCATAATTAA AACATGATCA CTTTTAAACA 1440 

GGTGTAGTAT GGTGCATTTC ATTCCTTATT TATAGATTAA CTGAAATTAC AGTTTGCTAT 1500 

AATATAAAAT GACAATAGTC TCTTGAGTGG TAAGTTGGTT ATTTTTTTAG AGGTGATCCA 1560 

GGAATCTTTA GTTTGAAGGC AGTTACCTTT TTTTTTTTTT TTTTTTTTTG ACTAAGAGTG 1620 

TTTGGTTGCT TTTTTGTCAC AAGTAACTTG GAAAATAGAA GCAGAATAGT AAAGGTTCTA 1680 

TTCAGCAACA TAGTTCATGG ATTTTGTGGA GGTTCTATTC AGTAATATGG TTCATGGATT 1740 

TAGTGGTGAC TGATAAGATT TTATTTTTGA AGGAAAAATT GCTTATACTA AGTCCAGAGA 1800 

CATGCAGGTG AGCCCTTTTG TCAGGCTGCA AATCATGACA TGCCGATGGT TGTTTATTTT 1860 

GTTTTTAGGT GTGCATTCTT TTTCTTCTTA GCAATTCCTT TATGATCACC TTCCCTTCTT 1920 

GTTTCACTCC CTCCCGCTCT CTCAAAAGGA ACTTGGGAAA CTTGTGAAAC CCAGGAAAAC 1980 

CTTTAGTCTT ATACCTCAAC TACGTTTCAG TCCTGTCTGG GTTTTAAATA AGTGAAGTAG 2040 

AAGAAATTGA GTATTTTCTG ACATAAGAAT ATATTATCAA TACAGTTTTA TGCAGTAAGC 2100 

TCTCCTTACC ATAAATGTTT CTTGGTTGAC AACATCTAAG ACAATATTAG TGGGATGAAG 2160 

AAAGAAAAGC AGGGGTGCTT TTGGAAGCAG TGTTAGTGTT CCTCAAAAGT CGGAACAATT 2220 

G CCTGTTG AT ATATTAATAA GACATTAAAG TCAAATTTTA ATGTTGGCCT CTCAAATGAT 2280 

TTGGATACCA CTCTGCAAAG TATTTCTAAC CTTTAATTCC CAGTTTTAAA ACAGATATAA 2340 

TAATAGCATT TAATTGGAAT ATACTAGGCA GCTGGAAAAG TATTTGAAAC TAAATTGACA 2400 

TTAAAATTAA GATTTGTTTT CAAGTGGATG TCCATTAAAA GTAGAAAAAT ATTTGGGATA 2460 

AGTGAGTGTG TGTTTCCTTA CATGGCTACT AAATAAAATA TAATGAGTAT ACAAGTATAT 2520 

CTCCTCTTTT GCTATGGAGG CTCCATGTTC AAGGCAATGG CTTTTTAAAT CTTGGCTATC 2580 

TAAAATTTTT TCCCTTTGTT TTGAATATTT GTAAGTTTTT AAGAAGTTAG TGTCAGCAAA 2 640 

TTAATTGAAG TTATGCTTCT ATACTGGGAC ATATTTAAAT ACTGAGTATA GTACTGCTGC 2700 

TACTGCTTCT ACAATGTAAA ATGTATGACT TGGTGTTTTA AAGTAAAAAT TATGATGTTA 2 760 

CTTGTGGAGA AACTAAAAAT GTTGTACAAC TGACCGAAAG AAAACCCTTG GGGATAAGTT 2820 

TAGTGAGGGG ATTGGAATCC CCAAAAAGAT AACATTTTTC TTCTGCTTTT AAAAACTGAA 2880 

ATTCCCTGTT CTAGTTCCTA ACAATTCTCA TTACATACTA TGCCAGATTA CAAAATACTT 2940 

ATTTTTAAAA TGAAATCTAT ATATTGACTT TCTTATCAAT CATCTTACTG TGCAATCAAA 3 000 

ATTAGAGTAC TTTGGTTTGA AAACAACACT TAGAGCCTCC AGATAACTTT TAAGACTTAT 3 060 

TTAGCTTTGT GGGTGGTATT TTCATGCAAA TAAGTAAGGG TGGGTTTTAT ATTTTGTAGA 3120 

AGTTTTCGGT CCTATTTTAA TGCTCTTTGT ATGGCAGTAT GTATATATTG TGTTAAGTTC 3180 

CTCAAGAATC TCCTTAAAAA CTTTGAAGTT AATACTTTTG TGCAACTGTG TTTTGAATAA 3240 
AGCCATGACA GTGTTAAAAA CAAAC 

Seq ID NO: 323 Protein sequence: 
Protein Accession #: NP_112182.1 

1 li 21 31 41 51 

I I I I I I 

MEMKKKINLE LRNRSPEEVT ELVLDNCLCV NGEIEGLNDT FKELEFLSMA NVBLSSLARL 60 

PSLNKLRKLE LSDNIISGGL. EVLAEKCPNli TYLNLSGNKI KDLSTVEALQ NIjKNLKSLDL 120 

FNCEITNLED YRESIFELLQ QITYLDGFDQ EDNEAPDSEE EDDEDGDEDD EEEEENEAGP 180 

PEGYEEEEEE EEEEDEDEDE DEDEAGSELG EGEEEVGLSY LMKEEIQDEE DDDDYVEEGE 240 
EEEEEEEGGL RGEKRXRDAE DDGEEEDD 

Seq ZD NO: 324 DNA sequence 
Nucleic Acid Accession #: NMJJ03812 
Coding sequence: 224.. 2722 

1 11 21 31 41 51 

I I I I I I 

TCCTCTGCGT CCCGCCCCGG GAGTGGCTGC GAGGCTAGGC GAGCCGGGAA AGGGGGCGCC 60 

GCCCAGCCCC GAGCCCCGCG CCCCGTGCCC CGAGCCCGGA GCCCCCTGCC CGCGGCGGCA 120 

CCATGCGCGC CGAGCCGGCG TGACCGGCTC CGCCCGCGGC CGCCCCGCAG CTAGCCCGGC 180 

GCTCTCGCCG GCCACACGGA GCGGCGCCCG GGAGCTATGA GCCATGAAGC CGCCCGGCAG 240 

CAGCTCGCGG CAGCCGCCCC TGGCGGGCTG CAGCCTTGCC GGCGCTTCCT GCGGCCCCCA 300 

ACGCGGCCCC GCCGGCTCGG TGCCTGCCAG CGCCCCGGCC CGCACGCCGC CCTGCCGCCT 360 

GCTTCTCGTC CTTCTCCTGC TGCCTCCGCT CGCCGCCTCG TCCCGGCCCC GCGCCTGGGG 420 

GGCTGCTGCG CCCAGCGCTC CGCATTGGAA TGAAACTGCA GAAAAAAATT TGGGAGTCCT 480 

GGCAGATGAA GACAATACAT TGCAACAGAA TAGCAGCAGT AATATCAGTT ACAGCAATGC 540 

AATGCAGAAA GAAATCACAC TGCCTTCAAG ACTCATATAT TACATCAACC AAGACTCGGA 600 
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AAGCCCTTAT CACGTTCTTQ ACACAAAGGC AAGACACCAG 
CCATCTGGCC CAGGCAAGCT TCCAGATTGA AGCCTTCGGC 
CATACTGAAC AATGGTTTGT TGTCTTCTGA TTATGTGGAG 
ACCACAGTAC TCTAAGGGTG GAGAGCACTG TTACTACCAT 
AGACTCCAAG GTGGCTCTGT CAACCTGCAA TGGACTTCAT 
CTTCGTGTAT ATGATAGAGC CACTAGAGCT GGTTCATGAT 
ACATATAATC CAGAAAACCT TGGCAGGACA GTATTCTAAG 
GGAAAGAGGT GACCAGTGGC CCTTTCTCTC TGAATTACAG 
AGCAGTGAAT CCATCACGTG GTATATTTGA AGAAATGAAA 
TAATGATCAC AAAACGTATA AGAAGCATCG CTCTTCTCAT 
AAAGTCCGTG GTCAACCTTG TGGATTCTAT TTACAAGGAG 
CCTGGTGGCT GTAGAGACCT GGACTGAGAA GGATCAGATT 
GCAGATGCTC CATGAGTTCT CAAAATACCG GCAGCGCATT 
GCACCTCATC TCGCGGGTGA CATTTCACTA TAAGAGAAGC 
TGTCTGTTCT CGCACAAGAG GAGTTGGTGT GAATGAGTAT 
ACAAGTATTA TCGCAGAGCC TGGCTCAAAA CCTTGGAATC 
AAAGCCAAAA TGTGACTGCA CAGAATCCTG GGGTGGCTGC 
GTCCCATTCT CGAAAATTTT CAAAGTGCAG CATTTTGGAG 
AGGAGGTGGA GCCTGCCTTT TCAACAGGCC AACAAAGCTA 
AAATGGATAC GTGGAAGCTG GGX3AGGAGTG TGATTGTGGT 
ATTATGCTGT AAGAAATGTT CCCTCTCCAA CGGGGCTCAC 
TAACAATACC TCATGTCTTT TTCAGCCACG AGGGTATGAA 
GTGTGATATT ACTGAATATT GTACTGGAGA CTCTGGTCAG 
GCAAGACGGA TATGCATGCA ATCAAAATCA GGGCCGCTGC 
CAGAGACAAC CAGTGTCAGT ACATCTGGGG AACAAAGGCT 
CTATGAAAAG CTGAATACAG AAGGCACTGA GAAGGGAAAC 
GTGGATTCAG TGCAGCAAAC ATGATGTGTT CTGTGGATTC 
TCGAGCTCCA CGTATTGGTC AACTTCAGGG TGAGATCATT 
AGGCCGGGTG ATTGACTGCA GTGGTGCCCA TGTAGTTTTA 
CTATGTAGAA GATGGAACGC CATGTGGCCC GTCTATGATG 
ACAAATTCAA GCCCTAAATA TGAGCAGCTG TCCACTCGAT 
GGGCCATGGG GTGTGTAGTA ATGAAGCCAC CTGCATTTGT 
AGATTGCAGT ATCCGGGATC CAGTTAGGAA CCTTCACCCC 
GGGTCCTAGT GCCACCAATC TCATAATAGG CTCCATCGCT 
TATTGTCCTT GGGGGCACAG GCTGGGGATT TAAAAATGTC 
TACTCAGCAA GGCCCCATCT GAATCAGCTG CGCTGGATGG 
ATTCTGGGTA TGACATACTC GCAGCAGTGT TACTGGAACT 
CTTTGGGTGG TAATGACTAC GGAGCTAAAG TTGGGGTGAC 
CTGTCTCTTT TGGAAATAAT GTCAAAGAAC ACCTTTCACC 
GGGGGCAAAA GACCATGCTA TAAAAAGAAC TGTTCCAGAA 
ACGAAGGAAC AACACACACA CAAAAATTAA ATGCAATAAA 

Seq ID NO: 325 Protein sequence t 
Protein Accession ft: NP 003803 



MKPPGSSSRQ 
RPRAWGAAAP 
INQDSESPYH 
HYENGKPQYS 
KSTGRPHIIQ 
LELMIVNDKK 
ITTNPVQMLH 
LPMAVAQVLS 
RDFLQRGGGA 
SDGPCCNNTS 
NGECKTRDNQ 
LCTNLTRAPR 
LDRKCLQIQA 
KDEGPKGPSA 



11 
I 

PPLAGCSLAG 
SAPHWNETAE 
VLDTKARHQQ 
KGGBHCYYHG 
KTLAGQYSKQ 
TYKKHRSSHA 
EFSKYRQRIK 
QSLAQNLGIQ 
CLFNRPTKLP 
CLFQPRGYEC 
CQYIWGTKAA 
IGQLQGEIIP 
LNMSSCPLDS 
TNLIIGSIAG 



21 
I 

ASCGPQRGPA 
KNLGVIiADED 
KKNKAVHLAQ 
SIRGVKDSKV 
MKNLTMERGD 
HTNNFAKSW 
QHADAVHMS 
WEPSSRKPKC 
EPTECGNGYV 
RDAVNECDIT 
GSDKFCYEKL 
TSFYHQGRVI 
KGKVCSGHGV 
AILVAAIVIiG 



31 
I 

GSVPASAPAR 
NTLQQNSSSN 
ASFQIEAFGS 
ALSTCNGLHG 
QWPFLSELQW 
NLVDSIYKEQ 
RVTFHYKRSS 
DCTESWGGCZ 
EAGEECDCGF 
EYCTGDSGQC 
NTEGTEKGNC 
DCSGAHWLD 
CSNEATCICD 
GTGWGFKNVK 



Seq ID NO: 326 DNA sequence 
Nucleic Acid Accession &: AK074418 . 
Coding sequence: 244-1515 



1 

I 

CTTTCTCCAA 
ACCTGAGAGC 
AACAGATGGA 
AAATGACAGA 
GTCATGGCGT 
GACTTTACCA 
TTCCCCGCAG 
GTGATATGGA 
ATAAGCAGAT 
GGATCCTTGA 
CACCAGTATG 
GTGATTGATG 
CAAAACCAAG 
TATTCCGATC 
ATCACCAACA 
ACCAAGGCAG 
GCGATGGAGA 
CAATACCGAA 
ACCGAATGGA 



11 
I 

GACGG CCGGC 
CTGTGGAACC 
AAAAGTGAGG 
GCTGGAGTTC 
ATTACCAGGA 
CCTTGCGGGA 
CAGATTCTTC 
AGCGGCCACA 
TTGACATCCA 
CTCAGAACCC 
CTGGCATTTT 
ACCGCCTACC 
AGTTCTGGCC 
TGCACTATGG 
TCCATCTGCA 
GCTCCCTGAT 
ATGGGCTGGT 
GGGGCTGGGA 
GAGGGCGCTG 



21 
i 

CATGCTCTCC 
TGCCCGTCTC 
GACCGGTGAG 
TGCTGTGCCT 
GCCTTCAGTG 
TCACTGCCTG 
CATAGGCCAG 
GGATCTACCA 
ACAAGGAGGC 
ACAGTACAGG 
CCGTTTCCGG 
TGTCCAGGGA 
CTGCCTGCTG 
CTTCCTCGAG 
CTCTTCCCCT 
AACCTGTGCC 
GAGTCTCCAT 
AGAAATTATC 
GAGTGATGGG 



31 

I 

TCCT CTGCCA 
CCCTCCTCCA 
TGACTTGCTG 
GGAAAGGACC 
GAGACCTCCA 
AGCATGGGCC 
AAGCTGCTCC 
GGGGGTCCTC 
GCAGCTGACT 
CAGAAGATCC 
TTCTGGCAAT 
GATAAATGCC 
GAGAAGGCCT 
GATG CCCTGG 
GTGGACCTGG 
ACTCCAAGTG 
GCCTACACTG 
TCCCTGTGGA 
TCTCAGGAGT 



CAAAAACATA 
TCCAAATTCA 
ATTCACTACG 
GGAAGCATCA 
GGCATGTTTG 
GAGAAAAGCA 
CAAATGAAGA 
TGGTTGAAAA 
TATTTGGAAC 
GCACATACCA 
CAGCTCAACA 
GACATCACCA 
AAGCAGCATG 
AGTCTGAGTT 
GGTCTTCCAA 
CAATGGGAAC 
ATCATGGAGG 
TATAGAGACT 
TTTGAGCCCA 
TTTCATGTGG 
TGCAGCGACG 
TGCCGGGATG 
TGCCCACCAA 
TACAATGGCG 
GCAGGGTCTG 
TGCGGGAAGG 
TTACTCTGTA 
CCAACTTCCT 
GATGATGATA 
TGTTTAGATC 
TCCAAGGGTA 
GATTTCACCT 
CCCAAGGATG 
GGTGCCATCC 
AAGAAGAGAA 
ACACCGCCTT 
ATTAAGTTTG 
AAGGATGGGG 
ACCTGTCAGT 
TCTTTTTTTT 
GGAATCATTA 



41 

I 

TPPCRLIiLVL 
ISYSNAMQKE 
KFILDLILNN 
MFEDDTFVYM 
LKRRKRAVNP 
LNTRWLVAV 
LSYFGGVCSR 
MEETGVSHSR 
HVECYGLCCK 
PPNLHKQDGY 
GKDGDRWIQC 
DDTDVGYVED 
FTWAGTDCSI 
KRRFDPTQQG 



41 
I 

GTCTCCTCCA 
TCAGACACAC 
CTAAAGTTTA 
TCGGAAGTCT 
TCATCAAGTT 
GGACGTTTAA 
AGGAAAAACG 
CT CACTTCAT 
GCTGGTTCCT 
TGATGGTCCA 
GTGGCCAGTG 
TCTTTGTGCG 
ATGCCAAGCT 
TGGACCTCAC 
TGAAGGCAGT 
GGCCAACAGA 
TGACTGGGGC 
ACCCCTGGGG 
GGGAGGAAAC 



ATAAGGCTGT 
TTCTTGACCT 
AAAATGGGAA 
GAGGCGTCAA 
AAGATGATAC 
CAGGTCGACC 
ATCTCACTAT 
GAAGGAAGAG 
TTATGATTGT 
ACAACTTTGC 
CCAGGGTTGT 
CCAACCCTGT 
CTGATGCTGT 
ACTTTGGAGG 
TGGCAGTGGC 
CTTCTAGCAG 
AAACAGGGGT 
TTTTACAGAG 
CGGAATGTGG 
AATGCTATGG 
GGCCCTGCTG 
CTGTGAACGA 
ATCTTCATAA 
AGTGCAAGAC 
ACAAGTTCTG 
ATGGAGACCG 
CCAATCTTAC 
TCTACCATCA 
CGGATGTGGG 
GGAAGTGCCT 
AAGTCTGTTC 
GGGCAGGGAC 
AAGGACCCAA 
TGGTAGCAGC 
GGTTCGATCC 
GCACTGTTGG 
TAAACAAAAC 
TAAAAGAAAA 
AAACGGGGGA 
TCCCTAATGG 



660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
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51 

I 

LLLPPLAASS 
ITIiPSRLIYY 
GLLSSDYVEI 
IEPIiELVHDE 
SRGIFEEMKY 
ETWTEKDQID 
TRGVGVNEYG 
KFSKCSIIiEY 
KCSLSNGAHC 
ACNQNQGRCY 
SKHDVFCGFL 
GTPCGPSMMC 
RDPVRNLHPP 
PI 



51 
I 

CCACTCTCTA 
CTGCCTAGGA 
TACCAGATGC 
TCTAAGGAGA 
CAAAGACCAG 
GGATGAGACA 
CCTCTCCAAT 
CCTGGAT GAT 
GGCAGCACTG 
AAGCTTTTCA 
GGTGGAAGTG 
TCCTCGCCAC 
GCTCGGATCC 
AGGAGGCGTG 
GAAGACAGCG 
TACAGCACAG 
TGAGCAGATT 
CTGGGGCGAG 
CTGTGATCCG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



308 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

CGGAAAAGCC AGCTACATAA 0AAAC6GGAA GATGGCGAGT 
TTCCAACAGA AATTCATCGC CATGTTTATA TGTAGCGAAA 
GGAAACACAC TCCACGAAGG ATQGTCCCAA ATAATGTTTA 
AACACTGCAG GAGGACCTCG GAATGATGCT CAATTCAACT 
GAAGGCACCA ATGTTGTCGT GTGCGTCACA GTTGCTGTCA 
GAAGATGCAA AATTTCCACT CGATTT CCAA GTGATTCTGG 
CCAAAGCTCA AATAATAAAT TCCGCCGCAA CTTCACCATG 
QAACTATGTT GTGGTTGCAC AGACACGGAG AAAATCAGCG 
CCTGAAAATG CCAGACAGTG ACAGGCACCT GAGCAGCCAT 
AAGCCCTTCA GAACATGGCT CCCAACAAAG CATTTTCAAC 
GTACCTAGCA CCCAGGGGCC TTACGTGGGA TTGGAGAAAG 
CCCTCACAGG CCCTTACTGG GATGCAGAGA GGAGAAGTGA 
GCCTCTCTTC CTGGATCGTC TCCAGAACTG CTGTGGCTGC 
CGCCCCACCC AGTCTCATCC GGGGGACTTC AAGCTGGAAT 
GGATAATTAT GGGGTGTGAG GTGCATTGCC CTCTAAATCT 
ACCCCGTGAA ACCTTTCCTT CTCCTACTCG- GCCACCTCCC 
CCCGGGAGCT AGCCAGCTTC AGAAAGCACA TACAGCATCC 
TGCACACAGG ATTTCCTTAA TGGCTTAATA AACTGTTATA 
GAATAAAATA GCTGCCAGGG GCTCTGCACA ATGAGCCTCT 
AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 

Seq ID NO: 327 Protein sequence: 
Protein Accession #: BAB85075.1 



1 
I 

MAYYQEPSVE 
IWKRPQDIiPG 
QYAGIPRPRF 
SDLHYGFLED 
MENGLVSLHA 
KSQLHKKRED 
TAGGPRNDAQ 
KLK 



11 
\ 

TSIIKFKDQD 
GPPHFILDDI 
WQCGQWVEW 
ALVDLTGGVI 
YTVTGAEQIQ 
GBFWMSCQDP 
FNPSVQEPME 



21 
1 

FTTLRDHCLS 
SRFDIQQGGA 
IDDRLPVQGD 
TNIHLHSSPV 
YRRGWEBIIS 
QQKPIAMFIC 
GTNVWCVTV 



31 
I 

MGRTFKDETP 
ADCWFLAALG 
KCLFVRPRHQ 
DLVKAVKTAT 
LWNPWGWGET 
SEIPITLDHG 
AVTPSNLKAE 



TTTGGATGTC 
TTCCAATTAC 
GGAAGCAAGT 
TCTCTGTGCA 
CACCATCAAA 
CTGGCTCACA 
ACTTACCATC 
GAGTTCTTGC 
TTCAACCTCA 
AGATATGCTC 
GGGACCTGAG 
CTTGATGGAC 
CAAGCTCGGT 
GCAGAGCTTA 
TTAAACAAGC 
ACCAACCTGG 
TTGCTGCCAA 
AAGAACTCCT 
TACCGTTAAA 



41 
I 

PAAOSSXGQK 
SLTQNPQYRQ 
NQEFWPCLLE 
KAGSLITCAT 



NTLHEGWSQI 
DAKFPLDFQV 



GTGTCAAGAT 
CCTGGACCAT 
GATTCTAGGA 
AGAGCCAATG 
TTTGAAAGCA 
GAAACACTGT 
TGAGCCCTGG 
TCCGAATCTT 
GAATGAAGGG 
AGCAGGTATG 
GGAGGGACAG 
TATTTTACCT 
AGAGACGTGG 
GAAAGGGAGG 
AATTGGCAGT 
CAT CGTT CCT 
ACCACCTATG 
TGACTTGTCA 
AAAAAAAAAA 



51 
I 

LLQEKRLSNV 
KILMVQSPSH 
KAYAKLLGSY 
PSGPTDTAQA 
QEWEETCDPR 
MFRKQVIbGN 
ILAGSQKHCP 



Seq ID NO i 328 DNA sequence 

Nucleic Acid Acceesion ttt BC017490.1 

Coding sequence: 74-2788 



1 

1 

GTGGGTCACG 
TCGTGGTACT 
GCGTCGGCGA 
TGCCCTCACC 
CCTAGGCACA 
CATGGAAAGG 
TCTGGATGAT 
CATGCGGCAG 
GTATGACAGC 
GGCCACGGAG 
TCTCAAAGGC 
CCACCGCTTC 
GGAGCGCATC 
CTTGGCAGCC 
GCAGATCTTT 
CATCACCAAC 
GCTGAGGCAG 
TGGCGTCCTG 
GGGTCCTTTC 
GTCGGCCGGC 
CCGAATCCAG 
CATTCTCCTC 
CATCTATCAC 
CACTGTCATC 
GACCGATGAA 
GATCTTTGCC 
TCTGGCCCTG 
TATCAACGTG 
TGAGAAAGTG 
CACGGCGTAT 



CAGAACCAGC 
CGTCACCTCC 
CTACGACCCC 
CTTTGACATC 
CCGCTTCGTG 
GGCCAATGGC 
CCAGGAGGTC 
CCAGATGGAC 
GACAGGCAGC 
CCACGCGCGC 
CGTGATGCTG 
GACTTTTGCC 
GAAGCAGTTA 
CACTATTGAG 



11 
I 

TGAACCACTT 
GCTATGGCGG 
GGCAATGATC 
TCCAGCCCTG 
GAGGGGCCCC 
GACTACCGCG 
GAGGACGTAG 
CGTGACCGGG 
GATGAGGAGG 
GACGGOGAGG 
CACTCTGTGC 
AAGAACTTCC 
AGCGACATGT 
AGGGAGCACG 
GATGAGGCTG 
CACATCCATG 
CTGCATCTGA 
CCCCAGCTCA 
TGCCAGTCCC 
CCCTTTGAGG 
GAGAGTCCAG 
GCAGATCTGG 
AACAA CTATG 
CTAGCCAACC 
GATGTGAAGA 
AGCATTGCTC' 
TTCGGAGGGG 
CTCTTGTGCG 
TCCAGCCGAG 
GTCCAGCGGC 
GACCGAGGAG 
ATCCATGAGG 
CTGCAGGCTC 
TCGCTGACTT 
CTGTGTGTGG 
GTGGGCAGCC 
AGCGCTGCTG 
CTGAAGAAGT 
CAGGACAAGG 
ATCCCCATTA 
ATCCATCTGC 
GAGAGCTTCA 
CGCTACCTTT 
GTGGCAGAGC 
GTCCCTGAGA 



21 
1 

TTCGCGCGAA 
AATCATCGGA 
CTCTCACCTC 
GCCGTGACCT 
TGGAGGAAGA 
CCATCCCAGA 
AGGAGCTGAC 
AGGCTGGCCG 
ACGAGGAGCG 
AGGACGAGGA 
GCGAGTGGGT 
TGCGCACTCA 
GCAAAGAGAA 
TGCTGGCCTA 
CCCTGGAGGT 
TCCGCATCTC 
ACCAGCTGAT 
GCATGGTCAA 
AGAACCAGGA 
TCAACATGGA 
GCAAAGTGGC 
TGGACAGCTG 
ATGGCTCCCT 
ACGTGGCCAA 
TGATCACTAG 
CTTCCATCTA 
AGCCCAAAAA 
GAGACCCTGG 
CCATCTTCAC 
ACCCTGTCAG 
TGTGTCTCAT 
CCATGGAGCA 
GCTGCACGGT 
TCTCTGAGAA 
TGAGGGACAC 
ACGTCAGACA 
AGCCCGCCAT 
ACATCATCTA 
TGGCCAAGAT 
CGGTGCGGCA 
GGGACTATGT 
TAGACACACA 
CATTCCGGCG 
AGGTGACATA 
AGGACTTGGT 



31 
\ 

ACCTGGTTGT 
ATCCTTCACC 
CAGCCCTGGC 
TCCACCATTT 
AGAGGATGGA 
GCTGGACGCC 
GGCCAGTCAG 
GGGCCTGGGC 
CCCTGCCCGC 
GATGATCGAG 
GAGCATGGCG 
CGTCGACAGC 
CCGTGAGAGC 
CTTCCTGCCT 
GGTACTGGCC 
CCACCTGCCT 
CCGCACCAGT 
GTACAACTGC 
GGTGAAACCA 
GGAGACCATC 
GGCTGGCCGG 
CAAGCCAGGA 
CAACACTGCC 
GAAGGACAAC 
CCTCTCCAAG 
TGGTCATGAA 
CCCAGGTGGC 
CACAGCGAAG 
CACTGGCCAG 
CAGGGAGTGG 
TGATGAATTT 
ACAGAGCATC 
CATTGCTGCC 
CGTG GACCTC 
CGTGGACCCA 
CCACCCCAGC 
GCCCAACACG 
CGC CAAGGAG 
GTACAGTGAC 
CATCGAGTCC 
GATCGAAGAC 
GAAGTTCAGC 
TGACAACAAT 
TCAGCGCAAC 
GGATAAGGCT 



41 
I 

TGCTGTAGTG 
ATGGCATCCA 
CGAAGCTCCC 
GAGGATGAGT 
GAGGAGCTCA 
TATGAGGCCG 
AGGGAGGCAG 
CGCATGCGCC 
AAGCGCCGCC 
AGCATCGAGA 
GGCCCCCGGC 
CACGGCCACA 
CTGGTGGTGA 
GAGGCACCGG 
ATGTACCCCA 
CTGGTGGAGG 
GGGGTGGTGA 
AACAAGTGCA 
GGCTCCTGTC 
TATCAGAACT 
CTGCCCCGCT 
GACGAGATAG 
AATGGCTTCC 
AAGGTTGCTG 
GATCAGCAGA 
GACATCAAGA 
AAGCACAAGG 
TCGCAGTTTC 
GGGGCGTCGG 
ACCTTGGAGG 
GACAAGATGA 
TCCATCTCGA 
GCCAACCCCA 
ACAGAGCCCA 
GTCCAGGACG 
AACAAGGAGG 
TATGGCGTGG 
AGGGTCCACC 
CTGAGGAAAG 
ATGATCCGCA 
GACGTCAACA 
GTCATGCGCA 
GAGCTGTTGC 
CGCTTTGGGG 
CGTCAGATCA 



51 
I 

GCGGAGAGGA 
GCCCGGCCCA 
GGCGTACTGA 
CCGAGGGGCT 
TTGGAGATGG 
AGGGACTGGC 
CAGAGCGGGC 
GTGGGCTCCT 
AGGTGGAGCG 
ACCTGGAGGA 
TGGAGATCCA 
ACGTCTTCAA 
ACTATGAGGA 
CGGAGCTGCT 
AGTACGACCG 
AGCTGCGCTC 
CCAGCTGCAC 

CTGAGTGCCA 
ACCAGCGTAT 
CCAAGGACGC 
AG CTGACTGG 
CTGTCTTTGC 
TAGGGGAACT 
TCGGAGAGAA 
GAGGCCTGGC 
TACGTGGTGA 
TCAAGTATAT 
CTGTGGGCCT 
CTGGGGCCCT 
ATGACCAGGA 
AGGCTGGCAT 
TAGGAGGGCG 
TCATCTCACG 
AGATGCTGGC 
AGGAGGGGCT 
AGCCCCTGCC 
CGAAGCTCAA 
AATCTATGGC 
TGGCGGAGGC 
TGGCCATCCG 
GCATGCGCAA 
TCTTCATACT 
CCCAGCAGGA 
ACATCCACAA 



1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
I860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
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60 
120 
180 
240 
300 
360 
420 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 



309 



WO 02/086443 

CCTCTCTGCA TTTTATGACA GTQAGCTCTT CAGGATGAAC AAGTTCAGCC ACGACCTGAA 2760 

AAGGAAAATG ATCCTGCAGC AGTTCTGAGG CCCTATGCCA TCCATAAGGA TTCCTTGGGA 2 820 

TTCTGGTTTG GGGTGGTCAG TGCCCTCTGT GCTTTATGGA CACAAAACCA GAGCACTTGA 2880 

TGAACTCGGG GTACTAGGGT CAGGGCTTAT AGCAGGATGT CTGGCTGCAC CTGGCATGAC 2940 

TGTTTGTTTC TCCAAGCCTG CTTTGTGCTT CTCACCTTTG GGTGGGATGC CTTGCCAGTG 3000 

TGTCTTACTT GGTTGCTGAA CATCTTGCCA CCTCCGAGTG CTTTGTCTCC ACTCAGTACC 3060 

TTGGATCAGA GCTGCTGAGT TCAGGATGCC TGCGTGTGGT TTAGGTGTTA GCCTTCTTAC 3120 

ATGQATGTCA GGAGAGCTGC TGCCCTCTTG GCGTGAGTTG CGTATTCAGG CTGCTTTTGC 3180 

TGCCTTTGGC CAGAGAGCTG GTTGAAGATG TTTGTAATCG TTTTCAGTCT CCTGCAGGTT 3240 

TCTGTGCCCC TGTGGTGGAA GAGGGCACGA CAGTGCCAGC GCAGCGTTCT GGGCTCCTCA 3300 

GTCGCAGGGG TGGGATGTGA GTCATGCGGA TTATCCACTC GCCACAGTTA TCAGCTGCCA 3360 

TTGCTCCCTG TCTGTTTCCC CACTCTCTTA TTTGTGCATT CGGTTTGGTT TCTGTAGTTT 3420 
TAATTTTTAA TAAAGTTGAA TAAAATATAA AAAAAAAAAA AAAAAA 

Seq ID NO: 329 Protein sequence: 
Protein Accession #: AAH17490.1 

1 11 21 31 41 SI 

I I I I I I 

MAESSESPTM ASSPAQRRRG NDPLTSSPGR SSRRTDALTS SPGRDLPPFE DBSEGLLGTE 60 

GPLEEEEDGE ELIGDGMERD YRAIPELDAY EAEGLALDDE DVEELTASQR EAAERAMRQR 120 

DREAGRGLGR MRRGLLYDSD EEDEERPARK RRQVE RATED GEEDEEMIES IENLEDLKGH 180 

SVREWVSMAG PRLEIHHRFK NFLRTHVDSH GHNVFKERIS DMCKENRESL WNYEDLAAR 240 

EHVLAYFLPE APAELLQIPD EAALEWLAM YPKYDRITNH IHVRISHLPL VEELRSLRQL 300 

HLNQLIRTSG WTSCTGVLP QLSMVKYNCN KCNPVLGPFC QSQNQEVKPG SCPECQSAGP 360 

PEVNMEETIY QNYQRIRIQE SPGKVAAGRL PRSKD AI LLA. DLVDSCKPGD EIELTGIYHN 420 

NYDGSLNTAN GFPVFATVIL ANHVAKKDNK VAVGELTDED VKMITSLSKD QQIGEKIFAS 480 

IAPSIYGHED IKRGLAliALF GGEPKNPGGK HKVRGDINVli LCGDPGTAKS QFLKYIEKV8 540 

SRAIFTTGQG ASAVGLTAYV QRHPVSREWT LEAGALVLAD RGVCLIDEFD KMNDQDRTSI 600 

HEAMEQQSIS ISKAGIVTSL QARCTVIAAA NPIGGRYDPS LTFSENVDljT EPIISRFDIL 660 

CWRDTVDPV QDEMLARFW GSHVRHH P SN KEEEGLANGS AAEPAMPNTY GVEPLPQEVL 72 0 

KKYIIYAKER VHPKLNQMDQ DKVAKMYSDL RKESMATGSI PITVRHIESM IRMAEAHARI 780 

HLRDYVIEDD VNMAIRVMIiE SFIDTQKFSV MRSMRXTFAR YLSFRRDNNE LLLFILKQLV 840 

AEQVTYQRNR FGAQQDTIEV PEKDLVDKAR QINIHNLSAF YDSELFRMNK FSHDLKRKMI 900 
LQQF 

Seq ID NO: 330 DMA sequence 
Nucleic Acid Accession #: Ml 7254 
Coding sequence: 257-1645 

1 11 21 31 41 51 

I 1 I I I i 

GTCCGCGCGT GTCCGCGCCC GCGTGTGCCA GCGCGCGTGC CTTGGCCGTG CGCGCCGAGC 60 

CGGGTCGCAC TAACTCCCTC GGCGCCGACG GCGGCGCTAA CCTCTCGGTT ATTCCAGGAT 120 

CTTTGGAGAC CCGAGGAAAG CCGTGTTGAC CAAAAGCAAG ACAAATGACT CACAGAGAAA ISO 

AAAGATGGCA GAACCAAGGG CAACTAAAGC CGTCAGGTTC TGAACAGCTG GTAGATGGGC 240 

TGGCTTACTG AAGGACATGA TTCAGACTGT CCCGGACCCA GCAGCTCATA TCAAGGAAGC 300 

CTTATCAGTT GTGAGTGAGG ACCAGTCGTT GTTTGAGTGT GCCTACGGAA CGCCACACCT 360 

GGCTAAGACA GAGATGACCG CGTCCTCCTC CAGCGACTAT GGACAGACTT CCAAGATGAG 420 

CCCACGCGTC dCTCAGCAGG ATTGGCTGTC TCAACCCCCA GCCAGGGTCA CCATCAAAAT 480 

GGAATGTAAC CCTAGCCAGG TGAATGGCTC AAGGAACTCT CCTGATGAAT GCAGTGTGGC 540 

CAAAGGCGGG AAGATGGTGG GCAGCCCAGA CACCGTTGGG ATGAACTACG GCAGCTACAT 600 

GGAGGAGAAG CACATGCCAC CCCCAAACAT GACCACGAAC GAGCGCAGAG TTATCGTGCC 660 

AGCAGATCCT ACGCTATGGA GTACAGACCA TGTGCGGCAG TGGCTGGAGT GGGCGGTGAA 720 

AGAATATGGC CTTCCAGACG TCAACATCTT GTTATTCCAG AACATCGATG GGAAGGAACT 780 

GTGCAAGATG ACCAAGGACG ACTTCCAGAG GCTCACCCCC AGCTACAACG CCGACATCCT 840 

TCTCTCACAT CTCCACTACC TCAGAGAGAC TCCTCTTCCA CATTTGACTT CAGATGATGT 900 

TGATAAAGCC TTACAAAACT CTCCACGGTT AATGCATGCT AGAAACACAG ATTTACCATA 960 

TGAGCCCCCC AGGAGATCAG CCTGGACCGG TCACGGCCAC CCCACGCCCC AGTCGAAAGC 102 0 

TGCTCAACCA TCTCCTTCCA CAGTGCCCAA AACTGAAGAC CAGCGTCCTC AGTTAG AT CC 1080 

TTATCAGATT CTTGGACCAA CAAGTAGCCG CCTTGCAAAT CCAGGCAGTG GCCAGATCCA 114 0 

GCTTTGGCAG TTCCTCCTGG AGCTCCTGTC GGACAGCTCC AACTCCAGCT GCATCACCTG 1200 

GGAAGGCACC AACGGGGAGT TCAAGATGAC GGATCCCGAC GAGGTGGCCC GGCGCTGGGG 1260 

AGAGCGGAAG AGCAAACCCA ACATGAACTA CGATAAGCTC AGCCGCGCCC T CCGTT ACT A 1320 

CTATGACAAG AACATCATGA CCAAGGTCCA TGGGAAGCGC TACGCCTACA AGTTCGACTT 13 8 0 

CCACGGGATC GCCCAGGCCC TCCAGCCCCA CCCCCCGGAG TCATCTCTGT ACAAGTACCC 1440 

CTCAGACCTC CCGTACATGG GCTCCTATCA CGCCCACCCA CAGAAGATGA ACTTTGTGGC IS 00 

GCCCCACCCT CCAGCCCTCC CCGTGACATC TTCCAGTTTT TTTGCTGCCC CAAACCCATA 1560 

CTGGAATTCA CCAACTGGGG GTATATACCC CAACACTAGG CTCCCCACCA GCCATATGCC 1620 

TTCTCATCTG GGCACTTACT ACTAAAGACC TGGCGGAGGC TTTTCCCATC AGCGTGCATT 1680 

CACCAGCCCA TCGCCACAAA CTCTATCGGA GAACATGAAT CAAAAGTGCC TCAAGAGGAA 1740 

TGAAAAAAGC TTTACTGGGG CTGGGGAAGG AAGCCGGGGA AGAGATCCAA AGACTCTTGG 1800 

GAGGGAGTTA CTGAAGTCTT ACTACAGAAA TGAGGAGGAT GCTAAAAATG TCACGAATAT 1860 

GGACATATCA TCTGTGGACT GACCTTGTAA AAGACAGTGT ATGTAGAAGC ATGAAGTCTT 1920 

AAGGACAAAG TGCCAAAGAA AGTGGTCTTA AGAAATGTAT AAACTTTAGA GTAGAGTTTG 1980 

AATCCCACTA ATGCAAACTG GGATGAAACT AAAGCAATAG AAACAACACA GTTTTGACCT 2040 

AACATACCGT TTATAATGCC ATTTTAAGGA AAACTACCTG TATTTAAAAA TAGTTTCATA 2100 

TCAAAAACAA GAGAAAAGAC ACGAGAGAGA CTGTGGCCCA TCAACAGACG TTGATATGCA 2160 

ACTGCATGGC ATGTGCTGTT TTGGTTGAAA TCAAATACAT TCCGTTTGAT GGACAGCTGT 2220 

CAGCTTTCTC AAACTGTGAA GATGACCCAA AGTTTCCAAC TCCTTTACAG TATTACCGGG 2280 

ACTATGAACT AAAAGGTGGG ACTGAGGATG TGTATAGAGT GAGCGTGTGA TTGTAGACAG 23 40 

AGGGGTGAAG AAGGAGGAGG AAGAGGCAGA GAAGGAGGAG ACCAGGCTGG GAAAGAAACT 2400 

TCTCAAGCAA TGAAGACTGG ACTCAGGACA TTTGGGGACT GTGTACAATG AGTTATGGAG 2460 

ACT CGAGGGT TCATGCAGTC AGTGTTATAC CAAACCCAGT GTTAGGAGAA AGGACACAGC 2520 

GTAATGGAGA AAGGGAAGTA GTAGAATTCA GAAACAAAAA TGCGCATCTC TTTCTTTGTT 2580 

TGTCAAATGA AAATTTTAAC TGGAATTGTC TGATATTTAA GAGAAACATT CAGGACCTCA 2640 

TCATTATGTG GGGGCTTTGT TCTCCACAGG GTCAGGTAAG AGATGGCCTT CTTGGCTGCC 2700 



310 



WO 02/086443 

ACAATCAGAA ATCACGCAGG CATTTTGGGT AGGCGGCCTC CAGTTTTCCT TTGAGTCGCG 2760 

AACGCTGTGC GTTTGTCAGA ATGAAGTATA CAAGTCAATG TTTTTCCCCC TTTTTATATA 2820 

ATAATTATAT AACTTATGCA TTTATACACT ACGAGTTGAT CTCGGCCAGC CAAAGACACA 2880 

CGACAAAAGA GACAATCGAT ATAATGTGGC CTTGAATTTT AACTCTGTAT GCTTAATGTT 2940 

TACAATATGA AGTTATTAGT TCTTAGAATG CAGAATGTAT GTAATAAAAT AAGCTTGGCC 3000 

TAGCATGGCA AATCAGATTT ATACAGGAGT CTGCATTTGC ACTTTTTTTA GTGACTAAAG 3060 

TTGCTTAATG AAAACATGTG CTGAATGTTG TGGATTTTGT GTTATAATTT ACTTTGTCCA 3120 
GGAACTTGTG CAAGGGAGAG CCAAGGAAAT AGGATGTTTG GCACCC 

Seq ID NO: 331 Protein sequence 
Protein Accession #s AAA52398 

X 11 21 31 41 SI 

I I I I I I 

MIQTVPDPAA HIKBALSWS EDQSLFECAY GTPHLAKTEM TASSSSDYGQ TSKMSPRVPQ 60 

QDWLSQPPAR VTIKMECNPS QVNGSRNSPD ECSVAKGGKM VGSPDTVGMN YGSYMEEKHM 120 

PPPNMTTNBR RVIVPADPTL WSTDHVRQWL EWAVKEYGLP DVNILLFQNI DGKELCKMTK 180 

DDFQRLTPSY NADILLSHLH YLRETPLPHL TSDDVDKALQ NSPRLMHARN TDLPYEPPRR 240 

SAWTGHGHPT PQSKAAQPSP STVPKTEDQR PQLDPYQILG PTSSRLANPG SGQIQLWQPL 300 

LELLSDSSNS SCITWEGTNG EFKMTDPDEV ARRWGERKSK PNMNYDKLSR ALRYYYDKNI 360 

MTKVHGKRYA YKFDPHGIAQ ALQPHPPESS LYKYPSDLPY MGSYHAHPQK MNFVAPHPPA 420 

LPVTSSSFPA APNPYWNSPT GGIYPNTRLP TSHMPSHLGT YY 462 

Seq ID NO: 332 DNA sequence 
Nucleic Acid Accession #: NM_000020 
Coding sequence: 283-1794 

1 11 21 31 41 51 

111)11 

AGGAAACGGT TTATTAGGAG GGAGTGGTGG AGCTGGGCCA GGCAGGAAGA CGCTGGAATA 60 

AGAAACATTT TTGCTCCAGC CCCCATCCCA GTCCCGGGAG GCTGCCGCGC CAGCTGCGCC 120 

GAGCGAGCCC CTCCCCGGCT CCAGCCCGGT CCGGGGCCGC GCCGGACCCC AGCCCGCCGT 180 

CCAGCGCTGG CGGTGCAACT GCGGCCGCGC GGTGGAGGGG AGGTGGCCCC GGTCCGCCGA 240 

AGGCTAGCGC CCCGCCACCC GCAGAGCGGG CCCAGAGGGA CCATGACCTT GGGCTCCCCC 300 

AGGAAAGGCC TTCTGATGCT GCTGATGGCC TTGGTGACCC AGGGAGACCC TGTGAAGCCG 360 

TCTCGGGGCC CGCTGGTGAC CTGCACGTGT GAGAGCCCAC ATTGCAAGGG GCCTACCTGC 420 

CGGGGGGCCT GGTGCACAGT AGTGCTGGTG CGGGAGGAGG GGAGGCACCC CCAGGAACAT 480 

CGGGGCTGCG GGAACTTGCA CAGGGAGCTC TGCAGGGGGC GCCCCACCGA GTTCGTCAAC 540 

CACTACTGCT GCGACAGCCA CCTCTGCAAC CACAACGTGT CCCTGGTGCT GGAGGCCACC 600 

CAACCTCCTT CGGAGCAGCC GGGAACAGAT GGCCAGCTGG CCCTGATCCT GGGCCCCGTG 660 

CTGGCCTTGC TGGCCCTGGT GGCCCTGGGT GTCCTGGGCC TGTGGCATGT CCGACGGAGG 720 

CAGGAGAAGC AGCGTGGCCT GCACAGCGAG CTGGGAGAGT CCAGTCTCAT CCTGAAAGCA 780 

TCTGAGCAGG GCGACACGAT GTTGGGGGAC CTCCTGGACA GTGACTGCAC CACAGGGAGT 840 

GGCTCAGGGC TCCCCTTCCT GGTGCAGAGG ACAGTGGCAC GGCAGGTTGC CTTGGTGGAG 900 

TGTGTGGGAA AAGGCCGCTA TGGCGAAGTG TGGCGGGGCT TGTGGCACGG TGAGAGTGTG 960 

GCCGTCAAGA TCTTCTCCTC GAGGGATGAA CAGTCCTGGT TCCGGGAGAC TGAGATCTAT 1020 

AACACAGTAT TGCTCAGACA CGACAACATC CTAGGCTTCA TCGCCTCAGA CATGACCTCC 1080 

CGCAACTCGA GCACGCAGCT GTGGCTCATC ACGCACTACC ACGAGCACGG CTCCCTCTAC 1140 

GACTTTCTGC AGAGACAGAC GCTGGAGCCC CATCTGGCTC TGAGGCTAGC TGTGTCCGCG 1200 

GCATGCGGCC TGGCGCACCT GCACGTGGAG ATCTTCGGTA CACAGGGCAA ACCAGCCATT 1260 

GCCCACCGCG ACTTCAAGAG CCGCAATGTG CTGGTCAAGA GCAACCTGCA GTGTTGCATC 1320 

GCCGACCTGG GCCTGGCTGT GATGCACTCA CAGGGCAGCG ATTACCTGGA CATCGGCAAC 1380 

AACCCGAGAG TGGGCACCAA GCGGTACATG GCACCCGAGG TGCTGGACGA GCAGATCCGC 1440 

ACGGACTGCT TTGAGTCCTA CAAGTGGACT GACATCTGGG CCTTTGGCCT GGTGCTGTGG 1500 

GAGATTGCCC GCCX3GACCAT CGTGAATGGC ATCGTGGAGG ACTATAGACC ACCCTTCTAT 1560 

GATGTGGTGC CCAATGACCC CAGCTTTGAG GACATGAAGA AGGTGGTGTG TGTGGATCAG 1620 

CAGACCCCCA CCATCCCTAA CCGGCTGGCT GCAGACCCGG TCCTCTCAGG CCTAGCTCAG 1680 

AT GATGCGGG AGTGCTGGTA CCCAAACCCC TCTGCCCGAC TCACCGCGCT GCGGATCAAG 1740 

AAGACACTAC AAAAAATTAG CAACAGTCCA GAGAAGCCTA AAGTGATTCA ATAGCCCAGG 1800 

AG CACCTGAT TCCTTTCTGC CTGCAGGGGG CTGGGGGGGT GGGGGGCAGT GGATGGTGCC 1860 

CTATCTGGGT AGAGGTAGTG TGAGTGTGGT GTGTGCTGGG GATGGGCAGC TGCGCCTGCC 1920 

TGCTCGGCCC CCAGCCCACC CAGCCAAAAA TACAGCTGGG CTGAAACCTG ATCCCCTGCT 19B0 

GTCTGGCCTG CTCAAAGCGG CAGGCTCCCT GACGCCTGGC TCTCTCCCCA CCCCTATGGC 2040 

CAGCATGGTG CACCCCCTAC CACTCCCGGG ACAGGATGCA AAAGAGGCTC CAGAGTCAGA 2100 

GTGCCAAGCC AGGGAATCCC AGTCCCAGAC TCAGAGCCCG GGCCTGCACT TTGCCCCCTG 2 ISO 

CCCTTGATCA ACCCCACTGC CCCACCAGAG CTGCCAGGGT GGCACAGGGC CCTGTCCAGC 2220 

CCCTGGCACA CACTTCCCTG CCAGGCCTCA GCCTCTAGCA TAAGCTCCAG AGAGCCAGGG 2280 

CCCATCAGTT TCTCTCTGTG GATTTGTATC TCAGCTCCAT GATGCCTTGG GCTTTCTGTC 2340 

TCCTCAACAA GAGTGCAGCT TGCTGAATGT CAGCTGCCTG AGAGAGCTGG GGCCTGACTT 2400 

ACTAGGGCAT TAAATCCTAA GAGGTCCTAC TGAGGTGTGG CAGGATCACA GGCCAGTGGA 2460 

AAAAGGGCAG GTCAGATGGG CAAGGCCCAG GACTTTCAGA TTAACTGAGA GGATATCGAG 2520 

GCCAAGCATG GCAGGGGGAA GGTCAGTGGG TGTCAAGAGA CCCAGGTCTG ACCCCGGATG 2580 

TTTGCTCCAT GTGACAAAAG CAGGCCTGTC TCAGGACCTT TTCTTTTCTT TTTTCCTTCT 2640 

TTTTTTTTTT GACACGGAGT TTCGCTCTTG TTGTCCAGGC TAGAGTGCAA TGGCATGATC 2700 

CCAGCTCACC GCAACGTCTA CCTCCCAGGT TCAAATCATT CTCTTGCCTC AGACTCCCGA 2760 

GTAGCTGGGA TTACAGGCAC ATGCCACCAT GCCTGGCTAA TTTTGTATAT TTAGTAGAAA 2820 

CAGGGTTTCA CCATGCTGGC CATGCTGGTT CTCGAACTCC TGACCTCAGG TGTTCCACCT 2880 

ACCTCAGCCT CCCAAAGTGC TGGGGTTACA GGTGTGAGCC ATCGCGCCTG GCCAGGACCT 2 940 

TTGTTTCTTA TCTACATATT GGAAGATTTG GTCCTGATGT CCTTTGAGGC TTCTTTAGCT 3000 

CTAGTTCTCT GACACTTCAG CCTATATCAC AGCTAACTTC YTCAGTCTCA TCTATTCCTT 3060 

ATGCTCCAGC CCCTGGCAAT TTGCCTCAAG ATGGGGGTTT GAAAATAACT TTACCTGACT 3120 

CAAGGAGTGT CTGGAGCACC TCCTAGTCTA AGTCTGCAAG CTCCAGTTCT TGCCTAAAAC 3180 

CATGCCAGTG GCCACCCTTG GGCTCAGACA GCTCTGGGCC TTTTGACCAC AAGCCAGCCC 3240 

CTCGCCCTCT CTGTGGCATA GTCTTCTCTG CCCCAGGACT GCAGGGCGGC TTCCTCCAAG 3300 

GCTTCCAAGG CTCAAAAGAA ATTTGGCTCC ATCCAAGAAG GCTCCAGCTC CCCTACTGGC 3360 

CCCTGGCTTC AGGCCCACAC CCCTGGGCCA GGSCCAGAGA GTGTGTCTCA GGAGAATTCA 3420 

ATGGGCTCTA GAGAGACACA CAGAAAGTTT GGGCATTTGG GAAATTTTCA AGGRTGTATG 3480 



311 
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TATGGYTCAC GTATGGWGCA GGTTGTCCTG GTCCYKGGGT GCAGGGAAGT GGGCTGCAGG 3540 

GAAGTGGATT GGAGGGGAGC TTGAGGAATA TAAGGAGCGG GGGTGGAGAC TCAGGCTATG 3600 

GACAAGGACA GCCCCAAGGT TGGGAAGACC TGGCCTTAGT CGTCCTCAGC CTAGGGCAGG 3660 

GCAGTGAAGA AAGCTCTCCC CGCTCCTGCT GTAATGACCC AGAGTAGCCT CCCCAGGCCG 3720 

GCATCTTATG TGTGTCTTCC ACCATCCTCA TGGTGGCACT TTTCTAGGCC TGTCTCCCAG 3780 

CATTGTGCAA GGCTCGGAAG AGAACCAGGA AGTGAAACTG GGTGAAAACA GAAAGCTCAA 3 840 

TGGATGGGCT AGGTTCCCAG ATCATTAGGG CAGAGTTTGC ACGTCCTCTG GTTCACTGGG 3900 

AATCCACCCA GCCCACGAAT CATCTCCCTC TTTGAAGGAT TTTWATTTCT ACTGGGTTTT 3960 

GGAACAAACT CCTGCTGAGA CCCCACAGCC AGAAACTGAA AGCAGCAGCT CCCCAAAGCC 4020 

TGGAAAATCC CTAAGAGAAG GCCTGGGGGA MAGGAAKTGG AGTGACAGGG GACAGGTAGA 4080 

GAGAAGGGGG CCCAATGGCC AGGGAGTGAA GGAGGTGGCG TTGCTGAGAG CAGTCTGCAC 4140 

ATGCTTCTGT CTGAGTGCAG GAAGGTGTTC CAGGGTCGAA ATTACACTTC TCGTACCTGG 4200 

AGACGCTGTT TGTGGGAGCA CTGGGCTCAT GCCTGGCACA CAATAGGTCT GCAATAAACC 4260 
ATGGTTAAAT CCTGAAAAAA AAAAAAAAA 

Seq ID NO: 333 Protein sequence 
Protein Accession #: NP_000011 



1 11 21 31 41 51 

MTLGSPRKGL LMLLMAXiVTQ GDPVKPSRGP LVTCTCESPH CKGPTCRGAW CTWLVREEG 60 

RHPQEHRGCG NLKRELCRGR PTEFVNHYCC DSHLCNHNVS LVLEATQPPS EQPGTDGQLA 120 

LILGPVLALL ALVALGVLGL WHVRRRQEKQ RGLHSELGES SLILKASEQG DTMLGDLLDS 180 

DCTTGSGSGli PFLVQRTVAR QVALVECVGK GRYGEVWRGL WHGESVAVKI PSSRDEQSWF 240 

RETEIYNTVL LRHDNILGFI ASDMTSRNSS TQLWLITHYH EHGSLYDFLQ RQTLEPKLAL 300 

RLAVSAACGL AHLHVE I PGT QGKPAIAHRD FKSRNVLVKS NLQCCIADLG LAVMHSQGSD 360 

YLDIGNNPRV GTKRYMAPEV LDEQIRTDCP ESYKWTDIWA FGLVLWE IAR RTIVNGIVED 420 

YRPPFYDWP NDPSFEDMKK WCVDQQTPT IPNRLAADPV LSGLAQMMRE CWYPNPSARL 480 
TALRIKKTIiQ KISNSPEKPK VIQ 



Seq ID NO: 334 DNA sequence 

Nucleic Acid Accession #i NM_004126.l 

Coding sequence: 108-329 

1 11 21 -31 41 51 

GGCACGAGCT CGTGCCGGCC TTCAGTTGTT TCGGGACGCG CCGAGCTTCG CCGCTCTTCC 60 

AGCGGCTCCG CTGCCAGAGC TAGCCCGAGC CCGGTTCTGG GGCGAAAATG CCTGCCCTTC 120 

ACATCGAAGA TTTGCCAGAG AAGGAAAAAC TGAAAATGGA AGTTGAGCAG CTTCGCAAAG 180 

AAGTGAAGTT GCAGAGACAA CAAGTGTCTA AATGTTCTGA AGAAATAAAG AACTATATTG 240 

AAGAACGTTC TGGAGAGGAT CCTCTAGTAA AGGGAATTCC AGAAGACAAG AACCCCTTTA 300 

AAGAAAAAGG CAGCTGTGTT ATT T CAT AAA TAACTTGGGA GAAACTGCAT CCTAAGTGGA 360 

AGAACTAGTT TGTTTTAGTT TTCCCAGATA AAACCAACAT GCTTTTTAAG GAAGGAAGAA 420 

TGAAATTAAA AGGAGACTTT CTTAAGCACC ATATAGATAG GGTTATGTAT AAAAG CAT AT 480 

GTGCTACTCA TCTTTGCTCA CTATGCAGTC TTTTTTAAGA GAGCAGAGAG TATCAGATGT 540 

ACAATTATGG AAATAAGAAC ATTACTTGAG CATGACACTT CTTTCAGTAT ATTGCTTGAT 600 
GCTTCAAATA AAGTTTTGTC TT 



Seq ID NO i 335 Protein sequence 
Protein Accession #: NP_004117.1 

1 11 21 31 41 51 

I I I I I I 

MPALKIEDLP EKEKLKMEVE QLRKEVKLQR QQVSKCSEEI KNYIEERSGE DPLVKGIPED 

KNPFKEKGSC VIS 



Seq ID NO: 336 DNA sequence 
Nucleic Acid Accession #: NM_00579S 
Coding sequence: 555-1940 

1 11 21 31 41 51 

GCACGAGGGA ACAACCTCTC TCTCTSCAGC AGAGAGTGTC ACCTCCTGCT TTAGGACCAT 60 

CAAGCTCTGC TAACTGAATC TCATCCTAAT TGCAGGATCA CATTGCAAAG CTTTCACTCT 120 

TTCCCACCTT GCTTGTGGGT AAATCTCTTC TGCGGAATCT CAGAAAGTAA AGTTCCATCC 180 

TGAGAATATT TCACAAAGAA TTTCCTTAAG AGCTGGACTG GGTCTTGACC CCTGGAATTT 240 

AAGAAATTCT TAAAGACAAT GTCAAATATG AT C CAAGAG A AAATGTGATT TGAGTCTGGA 300 

GACAATTGTG CATATCGTCT AATAATAAAA ACCCATACTA GCCTATAGAA AACAATATTT 3 60 

GAATAATAAA AACCCATACT AG CC TAT AG A AAACAATATT TGAAAGATTG CTACCACTAA 420 

AAAGAAAACT ACTACAACTT GACAAGACTG CTGCAAACTT CAATTGGTCA CCACAACTTG 480 

ACAAGGTTGC TATAAAACAA GATTGCTACA ACTTCTAGTT TATGTTATAC AGCATATTTC 540 

ATTTGGGCTT AATGATGGAG AAAAAGTGTA CCCTGTATTT TCTGGTTCTC TTGCCTTTTT 600 

TTATGATTCT TGTTACAGCA GAATTAGAAG AGAGTCCTGA GGACTCAATT CAGTTGGGAG 660 

TT AC TAG AAA TAAAATCATG ACAGCTCAAT ATGAATGTTA CCAAAAGATT ATGCAAGACC 720 

CCATTCAACA AGCAGAAGGC GTTTACTGCA ACAGAACCTG GGATGGATGG CTCTGCTGGA 780 

ACGATGTTGC AGCAGGAACT GAATCAATGC AGCTCTGCCC TGATTACTTT CAGGACTTTG 840 

ATCCATCAGA AAAAGTTACA AAGATCTGTG ACCAAGATGG AAACTGGTTT AGACATCCAG 900 

CAAGCAACAG AACATGGACA AATTATACCC AGTGTAATGT TAACACCCAC GAGAAAGTGA 960 

AGACTGCACT AAATTTGTTT TACCTGACCA TAATTGGACA CGGATTGTCT ATTGCATCAC 1020 

TGCTTATCTC G CTTGG CAT A TTCTTTTATT TCAAGAGCCT AAGTTGCCAA AGG ATT AC CT 1080 

TACACAAAAA TCTGTTCTTC TCATTTGTTT GTAACTCTGT TGTAACAATC ATTCACCTCA 1140 

CTGCAGTGGC CAACAACCAG GCCTTAGTAG CCACAAATCC TGTTAGTTGC AAAGTGTCCC 1200 

AGTTCATTCA TCTTTACCTG ATGGGCTGTA ATTACTTTTG GATGCTCTGT GAAGGCATTT 1260 

ACCTACACAC ACTCATTGTG GTGGCCGTGT TTGCAGAGAA GCAACATTTA ATGTGGTATT 1320 

ATTTTCTTGG CTGGGGATTT CCACTGATTC CTGCTTGTAT ACATG CCATT GCTAGAAGCT 1380 
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TATATTACAA TGACAATTGC TGGATCAGTT CTGATACCCA 
GCCCAATTTG TGCTGCTTTA CTGGTGAATC TTTTTTTCTT 
TCATCACCAA GTTAAAAGTT ACACACCAAG CGGAATCCAA 
GAGCTACTCT TATCTTGGTG CCATTGCTTG GCATTGAATT 
CTGAAGGAAA GATTG CAGAG GAGGTATATG ACTACATCAT 
AGGGTCTTTT GGTCTCTACC ATTTTCTGCT TCTTTAATGG 
GAAGAAACTG GAATCAATAC AAAATCCAAT TTGGAAACAG 
TTCGTAGTGC GTCTTACACA GTGTCAACAA TCAGTGATGG 
GTCCTAGTGA ACACTTAAAT GGAAAAAGCA TCCATGATAT 
CAGAAAATTT ATATAATTGA AAATAGAAGG ATGGTTGTCT 
AACTCAAGGA CTTGGACCCA TGACTCTGTA GCCAGAAGAC 
GGGAATGTCA TAAAGAAGAG CCTTCACATG AAATTAGTAG 
ATCCAG CTCT ATGTGGGAAA AAAGAAATCC TGGTTTGTAA 
CACTATGCCT GATGTGACGC TACTAACCTG ACATCACCAA 
ACAATCAACT TTTCTGAGCT GGTGTAAGCC AGTTCCAGCA 
AAATGGCTGT AAAACTAAAC ATACATGTTG GGCATGATTC 
GACCTAGCTA AGGTCTATAA ACATGAAGGG AAAATTAGCT 
TCCCATCTTG ATTGGGGCAG TTGACTTTTT TTTTTTCCCA 
TAACTACCCT CTCAAATGGA CAATACCAGA AGTGAATTAT 
CTATGAAAAG CAACTGAGTA CAATTGTTAT GATCTACTCA 
ATCTTGTGGC ATATCCATTG TGGAAACTGG ATGAACAGGA 
TTCTATATCA TTAGGAAAAC ATCTTAGTTG ATGCTACAAA 
TGTCTTACCA AACAGTGGGA GGGAATTCCT AGCTGTAAAT 
TCTACTGTAT AAACAAATTA GCAATCATTT TATATAAAGA 
ATT TTCTTGG AATTTTGTAA AAAGAAATTG TGAAAAATGA 
TTTATTTTAT AGTCTCAAAT CAAATACATA CAACCTATGT 
AATGCAACAA TGTGTGTATG TTAATATCTG ATACTGTATC 
AATAGAGTCT GGAATGCT 

Seq ID NO: 337 protein sequence 
Protein Accession #: NP_005786.1 



PCT/US02/12476 



TCTCCTCTAC 
GTTAAATATT 
TCTGTACATG 
TGTGCTGATT 
GCACATCCTT 
AGAGGTTCAA 
CTTTTCCAAC 
TCCAGGTTAT 
TGAAAATGTT 
CACTGTTTGG 
TTCAATATTA 
TGTGTTGATA 
TGTTTGTCAG 
GTGTGGAATT 
CACCATTGAT 
TACCCTTATT 
TTTAGTTTTA 
GAGTGCCX3TA 
CCCTGCTGGC 
TTTGCTGACA 
TGTATAATAT 
ACACCTTGTC 
ATAAATTTTG 
AAATCAATGA 
GCTTGTAAAT 
AATTTTTAAA 
TGGGCTGATT 



1 
I 

MEKKCTLYFL 
EGVYCNRTWD 
WTNYTQCNVN 
FFSFVCNSW 
IWAVPAEKQ 
ALLVNLPPIiL 
AEEVYDYIMH 
YTVSTISDGP 



11 

I 

VLLPFFMILV 
GWLCWNDVAA 
THEKVKTAXiN 
TIIHLTAVAN 
HLMWYYFLGW 
NIVRVLITKL 
ILMHFQGLLV 
GYSHDCPSEH 



21 

! 

TAELEESPED 
GTESMQLCPD 
LFYLTIIGHG 
NQALVATNPV 
GFPLIPACIH 
KVTHQAESNL 
STIFCFPNGE 
LNGKSIHDIE 



31 
I 

SIQLGVTRNK 
YPQDFDPSEK 
LSIASLLISL 
SCKVSQFIHL 
AIARSLYYND 
YMKAVRATLI 
VQAILRRNWN 
NVLLKPENLY 



41 

I 

IMTAQYECYQ 
VTKICDQDGN 
GIFPYFKSLS 
YLMGCNYFWM 
NCWXSSDTKL 
LVPLLGIEFV 
QYKIQFGNSF 
N 



ATTATCCATG 
GTACGCGTTC 
AAAGCTGTGA 
CCATGGCGAC 
ATGCACTTCC 
GCAATTCTGA 
TCAGAAGCTC 
AGT CATG ACT 
CTCTTAAAAC 
TGCTTCTCCT 
AATGACTTTG 
AGAGTGTAAC 
TAAATACTCC 
GGAGAAAAGC 
GAATTCAAAC 
CSCCCCAAGA 
AAACTCTTTA 
GTCCTTTTTG 
TT T CTTTT CT 
CATCAGTTAT 
GCAATCTTAC 
AACCTCTTCC 
CCCTTCCATT 
AGGATTTCTT 
ACTCCATTAT 
GCAAATATAT 
TTTTAAATAA 



51 
I 

KIMQDPIQQA 
WFRHPASNRT 
CQRITLHKNL 
LCEGIYLHTL 
LYIIHGPICA 
LIPWRPEGKI 
SNSEALRSAS 



Seq ID NO t 338 DNA sequence 
Nucleic Acid Accession #s NM_001795 
Coding sequence: 25-2379 



GCACGATCTG 



CGGGACACCC 
CAGATGCACA 
TCAAGCGTGA 
TTCCGGGTCG 
ATCTCAGAGT 
ACTCCTTCCA 
CATCGGTTGT 
GTGACAGCAG 
ATCCTGAAGG 
AAAAGCTTGG 
CAGGGCCTCC 
GACAACTTCC 
GTGGGCACCT 
ACCAAGTACA 
GCCCACAACG 
TACAGCTTCA 
GCGGG AAA CA 
CAGCAGCCTT 
GTGCTGGCCA 
AGTGACAAGG 
CTGGACAGAG 
ACTGGAACCC 
AATGACAATG 
CATGGCCAGC 
AAGTTCAAAT 
ACGGCCAACA 
CTACCCGTGG 
GTGGCCGTGT 
CAGGTGGGCG 
GTGATCACCC 
AAGAGCGTGC 
ATGGACACCA 
CCCCCGCGGC 
AGGCACGCGC 
AAGGACGAGG 
TACGAGGGCT 



11 
I 

TTCCTCCTGG 
GCCTGCTGGC 
ACAGCCTGCT 
TTGATGAAGA 
GTCGCAAGAA 
ATGCAGAGAC 
ACCACCTCAC 
GCTTCACCAT 
TCAATGCGTC 
TGGATGCAGA 
GGAAAGAGTA 
ACCGAGAGAA 
GGGGGGACTC 
CCTTCTTCAC 
CTGTGGGCTC 
GCATCTTGCG 
AGGGCATCAT 
TCGTCGAGGC 
GAGCCCAGGT 
TCTACCACTT 
TGGACCCTGA 
GCCAGTTCTT 
AAGTCTACCC 
CCACAGGAAA 
CCCCGGAGTT 
TGGTCCTGCA 
TCACCTTGAA 
TCACAGTCAA 
TCATCTCAGA 
GCAAGTGCAA 
TGAGCATCCA 
TGCTCATCTT 
CGQAGATCCA 
CCAGCTACGA 
CCGCGCTGGA 
CTGGGGCACA 
CGGACCACGA 
CCGAGTCCAT 



21 
I 

GAAGATGCAG 
AGTGGCAGCA 
GCCCACCCAC 
GAAAAACACC 
TGCCAAGTAC 
AGGAGACGTG 
TGCTGTCATT 
CAAAGTTCAT 
CGTGCCTGAG 
CGACCCCACT 
TTTTGCCATC 
GCAGGCCAGG 
GGGCACGGCC 
CCAGACCAAG 
TCTGTTTGT T 
GGGCGACTAC 
CAAGCCCATG 
CACAGACCCC 
CATTATCAAC 
CCAGCTGAAG 
TGCGGCTAGG 
CCGAGTCACA 
CTGGTATAAC 
AGAATCCATT 
TGCCAAGCCC 
OATCTCCGCA 
TACTGAGAAC 
GTATGGGCAG 
CAATGGGATG 
CGAGCAGGGC 
GGCAGTGGTA 
CCTGCGGCGG 
CGAGCAGCTG 
TGTGTCGGTG 
CGCCCGGCCT 
CGGAGGGCCC 
CGGCGACGGC 
AGCCGAGTCC 



31 
I 

AGGCTCATGA 
GTGGCAGCAG 
CGGCGCCAAA 
TCACTTCCCC 
CTGCTCAAAG 
TTCGCCATTG 
GTGGACAAGG 
GACGTGAACG 
TCGTCGGCTG 
GTGGGAGACC 
GATAATTCTG 
TATGAGATCG 
ACCGTGCTGG 
TACACATTTG 
GAGGACCCAG 
CAGGACGCTT 
AAGCCTCTGG 
ACCATCGACC 
AT CACAGATG 
GAAAACCAGA 
CATAGCATTG 
AAAAAGGGGG 
CTGACTGTGG 
GTGCAAGTCC 
TACCAGCCCA 
ATAGACAAGG 
AACTTTACCC 
TTTGACCGGG 
CCAAGTCGCA 
GAGTTCACCT 
GCCATCTTAC 
CGGCTCCGGA 
GTCACCTACG 
CTCAACTCGG 
TCCCTCTATG 
GGGGAGATGG 
CCCCCCTACG 
CTCAGCTCCC 



41 

I 

TGCTCCTCGC 
CAGGTGCTAA 
AGAGAGATTG 
ATCATGTAGG 
GAGAATATGT 
AGAGGCTGGA 
ACACTGGTGA 
ACAACTGGCC 
TGGGGACCTC 
ACG CCTCTGT 
GACGTATTAT 
TGGTG6AAGC 
TCACTCTGCA 
TCGTGCCTGA 
ATGAGCCCCA 
TCACCATTGA 
ATTATGAATA 
TCCGATACAT 
TGGACGAGCC 
AGAAGCCTCT 
GATACTCCAT 
ACATTTACAA 
AGGCCAAAGA 
ACATTGAAGT 
AAGTGTGTGA 
ACATAACACC 
TCACGGATAA 
AGCATACCAA 
CGGGCACCAG 
TCTGCGAGGA 
TCTGCATCCT 
AGCAGGCCCG 
ACGAGGAGGG 
TGCGCCGCGG 
CGCAGGTGCA 
CAGCCATGAT 
ACACGCTGCA 
TGGGCACCGA 



1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 



51 
I 

CACATCGGGC 
CCCTGCCCAA 
GATTTGGAAC 
CAAGATCAAG 
GGGCAAGGTC 
CCGGGAGAAT 
AAACCTGGAG 
TGTGTTCACG 
AGT CAT CTCT 
CATGTACCAA 
CACAATAACG 
GCGAGATGCC 
AG A CAT CAAT 
AGACACCCGT 
GAACCGGATG 
GACAAACCCC 
CATCCAGCAA 
GAGCCCTCCC 
CCCCATTTTC 
GATTGGCACA 
CCGCAGGACC 
TGAGAAAGAA 
ACTGGATTCC 
TTTGGATGAG 
GAACGCTGTC 
ACGAAACGTG 
TCACGATAAC 
GGTCCACTTC 
CACGCTGACC 
TATGGCCGCC 
CACCATCACA 
CGCGCACGGC 
CGGCGGCGAG 
CGGGGCCAAG 
GAAGCCACCG 
CGAGGTGAAG 
CATCTACGGC 
CTCATCCGAC 



60 
120 
180 
240 
300 
360 
420 



60 
120 
180 
240 
300 
3 60 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
16B0 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160' 
2220 
2280 
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TCTGACGTGG ATTACGACTT CCTTAACGAC TG6GGACCCA GGTTTAAGAT GCTGGCTGAO 2340 

CTGTACGGCT CGGACCCCCG GGAGGAGCTQ CTGTATTAGG CGGCCGAGGT CACTCTGGGC 2400 

CTGGGGACCC AAACCCCCTG CAGCCCAGGC CAGTCAGACT CCAGGCACCA CAGCCTCCAA 2460 

AAATGGCAGT GACTCCCCAG CCCAGCACCC CTTCCTCGTG GGTCCCAGAG ACCTCATCAG 2520 

CCTTGGGATA GCAAACTCCA GGTTCCTGAA ATATCCAGGA ATATATGTCA GTGATGACTA 2580 

TTCTCAAATG CTGGCAAATC CAGGCTGGTG TTCTGTCTGG GCTCAGACAT CCACATAACC 2640 

CTGTCACCCA CAGACCGCCG TCTAACTCAA AGACTTCCTC TGGCTCCCCA AGGCTGCAAA 2700 

GCAAAACAGA CTGTGTTTAA CTGCTGCAGG GTCTTTTTCT AGGGTCCCTG AACGCCCTGG 2760 

TAAGGCTGGT GAGGTCCTGG TGCCTATCTG CCTGGAGGCA AAGGCCTGGA CAGCTTGACT 2820 

TGTGGGGCAG GATTCTCTGC AGCCCATTCC CAAGGGAGAC TGACCATCAT GCCCTCTCTC 2880 

GGGAGCCCTA GCCCTGCTCC AACTCCATAC TCCACTCCAA GTGCCCCACC ACTCCCCAAC 2940 

CCCTCTCCAG GCCTGTCAAG AGGGAGGAAG GGGCCCCATG GCAGCTCCTG ' ACCTTGGGTC 3000 

CTGAAGTGAC CTCACTGGCC TGCCATGCCA GTAACTGTGC TGTACTGAGC ACTGAACCAC 3060 

ATTCAGGGAA ATGCTTATTA AACCTTGAAG CAACTGTGAA TTCATTCTGG AGGGGCAGTG 3120 

GAGATCAGGA GTGACAGATC ACAGGGTGAG GGCCACCTCC ACACCCACCC CCTCTGGAGA 3180 

AGGCCTGGAA GAGCTGAGAC CTTGCTTTGA GACTCCTCAG CACCCCTCCA GTTTTGCCTG 3240 

AGAAGGGGCA GATGTTCCCG GAGATCAGAA GACGTCTCCC CTTCTCTGCC TCACCTGGTC 3300 

GCCAATCCAT GCTCTCTTTC TTTTCTCTGT CTACTCCTTA TCCCTTGGTT TAGAGGAACC 3360 

CAAGATGTGG CCTTTAGCAA AACTGACAAT GTCCAAACCC ACT CATGACT GCATGACGGA 3420 

GCCGAGCATG TGTCTTTACA CCTCGCTGTT GTCACATCTC AGGGAACTGA CCCTCAGGCA 3480 

CACCTTGCAG AAGGAAGGCC CTGCCCTGCC CAACCTCTGT GGTCACCCAT GCATCATTCC 3540 

ACTGGAACGT TTCACTGCAA ACACACCTTG GAGAAGTGGC ATCAGTCAAC AGAGAGGGGC 3600 

AGGGAAGGAG ACACCAAGCT CACCCTTCGT CATGGACCGA GGTTCCCACT CTGGCAAAGC 3660 

CCCTCACACT GCAAGGGATT GTAGATAACA CTGACTTGTT TGTTTTAACC AATAACTAGC 3720 

TTCTTATAAT GATTTTTTTA CTAATGATAC TTACAAGTTT CTAGCTCTCA CAGACATATA 3780 

GAATAAGGGT TTTTGCATAA TAAGCAGGTT GTTATTTAGG TTAACAATAT TAATTCAGGT 3840 

TTTTTAGTTG GAAAAACAAT TCCTGTAACC TTCTATTTTC TATAATTGTA GTAATTGCTC ,3900 

TACAGATAAT GTCTATATAT TGGCCAAACT GGTGCATGAC AAGTACTGTA TTTTTTTATA 3960 
CCTAAATAAA GAAAAATCTT TAGCCTGGGC AACAAAAAAA 

Seq ID NO: 33 9 Protein sequence 
Protein Accession #: NP_001786 

1 11 21 31 41 51 

I I I I I I 

MQRLMMLLAT SGACLGLLAV AAVAAAGANP AQRDTHSLLP THRRQKRDWI WNQMHIDBBK 60 

NTSLPHHVGK IKSSVSRKNA KYLLKGEYVG KVFRVDAETG DVFAIERLDR ENISEYHLTA 120 

VIVDKDTGEM LBTPSSFTIK VHDVNDNWPV PTHRLFNASV PESSAVGTSV I SVTAVD ADD 180 

PTVGDHASVM YQILXGKEYF AIDNSGRIIT ITKSLDREKQ ARYEIWEAR DAQGLRGDSG 240 

TATVLVTLQD INDNFPFFTQ TKYTFWPED TRVGTSVGSL PVEDPDEPQN RMTKYS I LRG 300 

DYQDAPTIET NPAHNEGIIK PMKPUQYEYI QQYSFIVEAT DPTIDLRYMS PPAGNRAQVI 360 

INITDVDEPP IFQQPFYHFQ LKENQKKPLI GTVIAMDPDA ARHSIGYSIR RTSDKGQFFR 420 

VTKKGDIYNE KELDREVYPW YNLTVEAKEL DSTGTPTGKE SIVQVHIBVL DENDNAPEFA 480 

KPYQPKVCEN AVHGQLVLQI SAIDKDITPR NVKFKPTLNT ENNFTLTDNH DNTANITVKY 540 

GQFDREHTKV HFLPWISDN GMPSRTGTST LTVAVCKCNE QGEFTFCEDM AAQVGVSIQA 600 

WAILLCILT ITVITLLIFL RRRLRKQARA HGKSVPEIHE QLVTYDEEGG GEMDTTSYDV 660 

SVLNSVRRGG AKPPRPALDA RPSLYAQVQK PPRHAPGAHG GPGEMAAMIE VKKDEADHDG 720 

DGPPYDTLHI YG YEGSESIA ESLSSLGTDS SDSDVDYDFL NDWGPRFKML AEIiYGSDPRE 780 
KTJiY 



Seq ID NO: 340 DNA sequence 
Nucleic Acid Accession 8: NM_003088 
Coding sequence i 112-1593 

1 11 21 31 41 51 

I I I I I I 

GCGGAGGGTG CGTGCGGGCC GCGGCAGCCG AACAAAGGAG CAGGGGCGCC GCCGCAGGGA 60 

CCCGCCACCC ACCTCCCGGG GCCGCGCAGC GGCCTCTCGT CTACTGCCAC CATGACCGCC 120 

AACGGCACAG CCGAGGCGGT GCAGATCCAG TTCGGCCTCA TCAACTGCGG CAACAAGTAC 180 

CTGACGGCCG AGGCGTTCGG GTTCAAGGTG AACGCGTCCG CCAGCAGCCT GAAGAAGAAG 240 

CAGATCTGGA CGCTGGAGCA GCCCCCTGAC GAGGCGGGCA GCGCGGCCGT GTGCCTGCGC 300 

AGCCACCTGG GCCGCTACCT GGCGGCGGAC AAGGACGGCA ACGTGACCTG CGAGCGCGAG 360 

GTGCCCGGTC CCGACTGCOG TTTCCTCATC GTGGCGCACG ACGACGGTCG CTGGTCGCTG 420 

CAGTCCGAGG CGCACCGGCG CTACTTCGGC GGCACCGAGG ACCGCCTGTC CTGCTTCGCG 480 

CAGACGGTGT CCCCCGCCGA GAAGTGGAGC GTGCACATCG CCATGCACCC TCAGGTCAAC 540 

ATCTACAGTG TCACCCGTAA GCGCTAGGCG CACCTGAGCG CGCGGCCGGC CGACGAGATC 600 

GCCGTGGACC GCGACGTGCC CTGGGGCGTC GACTCGCTCA TCACCCTCGC CTTCCAGGAC 660 

CAGCGCTACA GCGTGCAGAC CGCCGACCAC CGCTTCCTGC GCCACGACGG GCGCCTGGTG 720 

GCGCGCCCCG AGCCGGCCAC TGGCTACACG CTGGAGTTCC GCTCCGGCAA GGTGGCCTTC 780 

CGCGACTGCG AGGGCCGTTA CCTGGCGCCG TCGGGGCCCA GCGGCACGCT CAAGGCGGGC 840 

AAGGCCACCA AGGTGGGCAA GGACGAGCTC TTTGCTCTGG AGCAGAGCTG CGCCCAGGTC 900 

GTGCTGCAGG CGGCCAACGA GAGGAAOGTG TCCACGCGCC AGGGTATGGA CCTGTCTGCC 960 

AATCAGGACG AGGAGACCGA CCAGGAGACC TTCCAGCTGG AGATCGACCG CGACACCAAA 1020 

AAGTGTGCCT TCCGTACCCA CACGGGCAAG TACTGGACGC TGACGGCCAC CGGGGGCGTG 1080 

CAGTCCACCG CCTCCAGCAA GAATGCCAGC TGCTACTTTG ACATCGAGTG GCGTGACCGG 1140 

CGCATCACAC TGAGGGCGTC CAATGGCAAG TTTGTGACCT CCAAGAAGAA TGGGCAGCTG 1200 

GCCGCCTCGG TGGAGACAGC AGGGGACTCA GAGCTCTTCC TCATGAAGCT CATCAACCGC 1260 

CCCATCATCG TGTTCCGCGG GGAGCATGGC TTCATCGGCT GCCGCAAGGT CACGGGCACC 1320 

CTGGACGCCA ACOGCTCCAG CTATGACGTC TTCCAGCTGG AGTTCAACGA TGGCGCCTAC 1380 

AACATCAAAG ACTCCACAGG CAAATACTGG ACGGTGGGCA GTGACTCCGC GGTCACCAGC 1440 

AGCGGCGACA CTCCTGTGGA CTTCTTCTTC GAGTTCTGCG ACTATAACAA GGTGGCCATC 1500 

AAGGTGGGCG GGCGCTACCT GAAGGGCGAC CACGCAGGCG TCCTGAAGGC CTCGGCGGAA 1560 

ACCGTGGACC CCGCCTCGCT CTGGGAGTAC TAGGGCCGGC CCGTCCTTCC CCGCCCCTGC 1620 

CCACATGGCG GCTCCTGCCA ACCCTCCCTG CTAACCCCTT CTCCGCCAGG TGGGCTCCAG 1680 

GGCGGGAGGC AAGCCCCCTT GCCTTTCAAA CTGGAAACCC CAGAGAAAAC GGTGCCCCCA 1740 

CCTGTCGCCC CTATGGACTC CCCACTCTCC CCTCCGCCCG GGTTCCCTAC TCCCCTCGGG 1800 
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TCAGCGGCTG CGGCCTGGCC CTGGGAGGGA TTTCAOATGC CCCTGCCCTC TTGTCTGCCA 1860 

CGGGGCGAGT CTGGCACCTC TTTCTTCTGA CCTCAGACGG CTCTGAGCCT TATTTCTCTO 1920 

GAAGCGGCTA AGGGACGGTT GGGGGCTGGG AGCCCTGGGC GTGTAGTGTA ACTGGAATCT 1980 

TTTGCCTCTC CCAGCCACCT CCTCCCAGCC CCCCAGGAGA GCTGGGCACA TGTCCCAAGC 2040 

CTGTCAGTGG CCCTCCCTGG TGCACTGTCC CCGAAACCCC TGCTTGGGAA GGGAAGCTGT 2100 

CGGGAGGGCT AGGACTGACC CTTGTGGTGT TTTTTTGGGT GGTGGCTGGA AACAGCCCCT 2160 

CTCCCACGTG GGAGAGGCTC AGCCTGGCTC CCTTCCCTGG AGCX5GCAGGG CGTGACGGCC 2220 

ACAGGGTCTG CCCGCTGCAC GTTCTG CCAA GGTGGTGGTG GCGGGCGGGT AGGGGTGTGG 2280 

GGGCCGTCTT CCTCCTGTCT CTTTCCTTTC ACCCTAGCCT GACTGGAAGC AGAAAATGAC 2340 

CAAATCAGTA TTTTTTTTAA TGAAATATTA TTGCTGGAGG CGTCCCAGGC AAGCCTGGCT 2400 

GTAGTAGCGA GTGATCTGGC GGGGGGCGTC TCAGCACCCT CCCCAGGGGG TGCATCTCAG 2460 

CCCCCTCTTT CCGTCCTTCC OGTCCAGCCC CAGCCCTGGG CCTGGGCTGC CGACACCTGG 2520 

GCCAGAGCCC CTGCTGTGAT TGGTGCTCCC TGGGCCTCCC GGGTGGATGA AGCCAGGCGT 2580 

CGCCCCCTCC GGGAGCCCTG GGGTGAGCCG CCGGGGCCCC CCTGCTGCCA GCCTCCCCCG 2640 

TCCCCAACAT GCATCTCACT CTGGGTGTCT TGGTCTTTTA TTTTTTGTAA GTGTCATTTG 2700 

TATAACTCTA AACGCCCATG ATAGTAGCTT CAAACTGGAA ATAG CGAAAT AAAATAACTC 2760 
AGTCTGC 

Seq ID NO: 341 Protein sequence 
Protein Accession NP_003079 

1 11 21 31 41 51 

I I I I I I 

MTANGTAEAV QIQFGLINCG NKYLTAEAFG FKVNASASSL KKXQIWTLEQ PPDEAGSAAV 60 

CLRSHLGRYL AADKDGNVTC EREVPGPDCR FLIVAHDDGR WSLQSEAHRR YPGGTEDRLS 120 

CPAQTVSPAE KWSVHIAMHP QVNIYSVTRK RYAHLSARPA DEIAVDRDVP WGVDSLITLA 180 

FQDQRYSVQT ADHRPLRHDG RLVARPEPAT GYTLEFRSGK VAFRDCEGRY LAPSGPSGTIj 240 

KAGKATKVGK DELFAIiEQSC AQWLQAANE RNVSTRQGMD LSANQDEETD QETFQLEIDR 300 

DTKKCAFRTH TGKYWTLTAT GGVQSTASSK NASCYFDIEW RDRRITLRAS NGKFVTSKKN 360 

GQLAASVETA GDSELFLMKL INRPIIVFRG EHGFIGCRKV TGTLDANRSS YDVFQLEFND 420 

GAYNIKDSTG KYWTVGSDSA VTSSGDTPVD FFFEFCDYNK VAIKVGGRYL KGDHAGVLKA 480 
SAETVDPASI* WEY 



Seq ID NO: 342 DNA sequence 

Nucleic Acid Accession #: FGENESH predicted 

Coding sequence t 660 .. 1705 

1 11 21 31 41 51 

I I I I I I 

CGCTCCGCAC ACATTTCCTG TCGCGGCCTA AGGGAAACTG TTGGCCGCTG GGCCCGCGGG 60 

GGGATTCTTG GCAGTTGGGG GGTCCGTCGG GAGCGAGGGC GGAGGGGAAG GGAGGGGGAA 120 

CCGGGTTGGG GAAGCCAGCT GTAGAGGGCG GTGACCGCGC TCCAGACACA GCTCTGCGTC 180 

CTCGAGCGGG ACAGATCCAA GTTGGGAGCA GCTCTGCGTG CGGGGCCTCA GAGAATGAGG 240 

CCGGCGTTCG CCCTGTGCCT CCTCTGGCAG GCGCTCTGGC CCGGGCCGGG CGGCGGCGAA 300 

CACCCCACTG CCGACCGTGC TGGCTGCTCG GCCTCGGGGG CCTGCTACAG CCTGCACCAC 360 

GCTACCATGA AGCGGCAGGC GGCCGAGGAG GCCTGCATCC TGCGAGGTGG GGCGCTCAGC 420 

ACCGTGCGTG CGGGCGCCGA GCTGCGCGCT GTGCTCGCGC TCCTGCGGGC AGGCCCAGGG 480 

CCCGGAGGGG GCTCCAAAGA CCTGCTGTTC TGGGTCGCAC TGGAGCGCAG GCGTTCCCAC 540 

TGCACCCTGG AGAACGAGCC TTTGCGGGGT TTCTCCTGGC TGTCCTCCGA CCCCGGCGGT 600 

CTCGAAAGCG ACACGCTGCA GTGGGTGGAG GAGCCCCAAC GCTCCTGCAC CGCGCGGAGA 660 

TGCGCGGTAC TCCAGGCCAC CGGTGGGGTC GAGCCCGCAG CTGGAAGGAG ATGCGATGCC 720 

ACCTGCGCGC CAACGGCTAC CTGTGCAAGT ACCAGTTTGA GGTCTTGTGT CCTGCGCCGC 780 

GCCCCGGGGC CGCCTCTAAC TTGAGCTATC GCGCGCCCTT CCAGCTGCAC AGCGCCGCTC 840 

TGGACTTCAG TCCACCTGOG ACCGAGGTGA GTGCGCTCTG CCGGGGACAG CTCCCGATCT 900 

CAGTTACTTG CATCGCGGAC GAAATCGGCG CTCGCTGGGA CAAACTCTCG GGCGATGTGT 960 

TGTGTCCCTG CCCCGGGAGG TACCTCCGTG CTGGCAAATG CGCAGAGCTC CCTAACTGCC 1020 

TAGACGACTT GGGAGGCTTT GCCTGCGAAT GTGCTACGGG CTTCGAGCTG GGGAAGGACG 1080 

GCCGCTCTTG TGTGACCAGT GGGGAAGGAC AGCCGACCCT TGGGGGGACC GGGGTGCCCA 1140 

CCAGGCGCCC GCCGGCCACT GCAACCAGCC CCGTGCCGCA GAGAACATGG CCAATCAGGG 1200 

TCGACGAGAA GCTGGGAGAG ACACCACTTG TCCCTGAACA AGACAATTCA GTAACATCTA 1260 

TTCCTGAGAT TCCTCGATGG GGATCACAGA GCACGATGTC TACCCTTCAA ATGTCCCTTC 1320 

AAGCCGAGTC AAAGGGCACT ATCACCCCAT CAGGGAGCGT GATTTCCAAG TTTAATTCTA 1380 

CGACTTCCTC TGCCACTCCT CAGGCTTTCG ACTCCTCCTC TGCCGTGGTC TTCATATTTG 1440 

TGAGCACAGC AGTAGTAGTG TTGGTGATCT TGACCATGAC AGTACTGGGG CTTGTCAAGC 1500 

TCTGCTTTCA CGAAAGCCCC TCTTCCCAGC CAAGGAAGGA GTCTATGGGC CCGCCGGGCC 1560 

TGGAGAGTGA TCCTGAGCCC GCTGCTTTGG GCTCCAGTTC TGCACATTGC ACAAACAATG 1620 

GGGTGAAAGT CGGGGACTGT GATCTGCGGG ACAGAGCAGA GGGTGCCTTG CTGGCGGAGT 1680 
CCCCTCTTGG CTCTAGTGAT GCATAG 

Seq ID NO: 343 Protein sequence 
Protein Accession #: FGENESH predicted 

1 11 21 31 41 51 

I I I I I I 

MGKDFMTKTP KAFATKAKID KWDLIKUCSF CTAKETIIRV NSQPTDWQKT FAIYPSDKGV 60 

IARIYKELEQ IYKFOOCPTKT LRTHFLSRPK GNCWPLGPRG DSWQLGGPSG ARAEGKGGGT 120 

GXiGKPAVEGG DRAPDTAIiRP RAGQIQVGSS SACGASENEA GVRPVPPLAG ALARAGRRRT 180 

PHCRPCWLLG LGGLLQPAPR YHEAAGGRGG LHPARWGAQH RACGRRAARC ARAPAGRPRA 240 

RRGLQRPAVL GRTGAQAFPL HPGERAFAGF LLAVIiRPRRS RKRHAAVGGG APTLLHRAEM 300 

RGTPGHRWGR ARSWKEMRCH LRANGYIiCKY QFEVLCPAPR PGAASNLSYR APFQLHSAAL 360 

DFSPPGTEVS ALCRGQLPIS VTCIADEIGA RWDKLSGDVL CPCPGRYLRA GKCAELPNCL 420 

DDLGGFACEC ATGFELGKDG RSCVTSGEGQ PTLGGTGVPT RRPPATATSP VPQRTWPIRV 480 

DEKIiGETPLV PEQDNSVTSI PEIPRWGSQS TMSTLQMSLQ AESKATITPS GSVISKFNST 540 

TSSATPQAFD SSSAWFIFV STAWVLVIL TMTVLGLVKL CFHESPSSQP RKESMGPPGL 600 
ESDPEPAALG SSSAHCTNNG VKVGDCDIiRD RAEGALLAES PLGSSDA 
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Seq ID NO: 344 DNA sequence 
Nucleic Acid Accession #: NM_012072 
Coding sequence j 149-2107 



1 ■ 
I 

AAAGCCCTCA 



TCCCGCAGAG 
GCTGCTGCTC 
CGTGGGGACC 
CCACTGCAAC 
CGTCCAGCGA 
CAAGTTCTGG 
GAAGGGCTTC 
GCTCCGGAAC 
GCTCCTTCCC 
CGGAAGTAAC 
GGCCCTGGGG 
CTTGGAGGCT 
CGAGACTCAG 
CAGCTCGGGC 
CCACCAGGAC 
" CCGGCTGCTG 
TCGTGGGGGG 
CCAAGGGTAC 
CTCCCCCTGT 
TGGCTATGAG 
GGGTCGCTCG 
TGAGGAGGGC 
TGTGGGCCCG 
CTGTGGCTGC 
TGTGTCTCTG 
GAGCACCGTG 
GGCTACACCC 
ACT CAAGATG 
CGCCACAGCT 
AAACAACGAT 
GGCCATCCTA 
GAAGAGGGAG 
TCCAGAGCGA 
CTGCTGAAAG 
TGAACTCCCC 
CAAACAATTG 
TGTTTGATGT 
TCTATAATQA 
GGGTGTGAGG 
ATCTAAGAGG 
CCTAGGATGA 
TCAAAGGGAA 
AGCACAAGTC 
TAACCTCTTA 
CTTGGGTTTA 
CAGGTGTTTG 
CACAGATACT 
TGTGATCAAC 
CCTCAGACAC 
CGAGCTCAGA 
TGAACGGGAG 
TCATAGTCCA 
CACACCAAGT 
TTCCTTAAAA 
TTTTTACAGC 
TTGCAAATAT 
TCTCTCTCAC 
CCTGGGGCAC 
TGAGTATCTC 
ACAGACAGAG 
GGTTTTGCCT 
CAAGGTGCAG 
TTTTAATAGA 
TCGATGGGGC 
AACTCTGGTG 
GGCCACGAGA 
CTGTCTGGCC 
CACTTAAATA 
TCATTTGGGG 
GTATGATTCC 
GCCCAGGCCC 
GAACCCCTCT 
ACCTTCCCTG 
CCCACCACGG 
AACCACTGGG 
ATGGAAATTC 
GCCAAGAGGC 
CACAGTGGGG 
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1 

GCCTTTGTGT 
CCCAGCTGGG 
GGCCACACAG 
CTGACCCAGC 
GCCTGCTACA 
CAGAACGGGG 
GTACTGGCCC 
ATTGGGCTCC 
AGCTGGGTGG 
TCGTGCATCT 
AACCGCCTGC 
ATTGAGGGCT 
GGCCCAGGTC 
GTGCCCTTTG 
AGTCATTATT 
CCCCTCTGTG 
TGCTTTGAAG 
GATGACCTGG 
GCCACGTGCG 
CAGCTGGACT 
GCCCAGGAGT 
CCGGGCGGTC 
CCTTGCGCCC 
TACGTCCTGG 
GGGGGCCCCC 
CTGCCAGGCT 
GGACCACCAT 
CCCCGCGCTG 
ACCACAAGTA 
CTGGCCCCCA 
GCCTCTGGCC 
GGCACTGACG 
CTCCTGCTGG 
GAGAAGAAGG 
GCTGAGAGCA 
TGAGGTGGCC 
ATTCCAAAGG 
TAAGTCTCCT 
TCCTGAAGTG 
TTGTTACTCC 
AGGCTGGGGC 
AAAAGGTGAG 
AAACTAAATC 
CATGTTCGGA 
TTGCTAAATG 
GGTGGCAAGG 
TTTGCAAAGG 
TGAAGTCACA 
TGAATTAATT 
ACTAACAAGG 
CCTGCCTGTG 
CAGAGGAAGC 
ATGATGCACT 
CAGTTGATGC 
AGGGAGCTAG 
TTGGGGGTAA 
AAAAACTGCT 
TTCTCCCTAT 
ACACACACAC 
TGGAACACAT 
TGGGAGGCCT 
GAAATGTGTC 
TAGCAATGCA 
GGTTAATACT 
AAACTAAAGG 
ATTTGGAACT 
TTTAACACTT 
CTCTAGGTGA 
ATTCAGAGGA 
AATGCAAATG 
TGAAGGAGAC 
TGGGATCCAA 
ATCGTCTGTT 
GTGGAACCCA 
GCAGGCTGGG 
GGGGAGAGCC 
CTCAAACACG 
TTGTTTGGGG 
CATTAACAAA 
AATCCAAGGG 



21 
I 

CCTTCTCTGC 
AGCCGAGATA 
AGACCGGGAT 
CCGGGGOGGG 
CGGCCCACTC 
GCAACCTGGC 
AGCTCCTGAG 
AGCGAGAGAA 
GCGGGGGGGA 
CCAAGCGCTG 
CCAAGTGGTC 
TCGTGTGCAA 
AGGTGACCTA 
CCTCTGCGGC 
TCCTGTGCAA 
TCAGCCCCAA 
GGGGGGATGG 
TGACCTGTGC 
TCCTGGGACC 
CGAGTCAGCT 
GTGTCAACAC 
CTGGAGA6GG 
AGGGCTGCAC 
CCGGGGAGGA 
TCTGCGACAG 
GGGTGCTGGC 
CTGGGCCCCC 
CAACAGCCAG 
GACCTTCGCT 
GTGGGTCCTC 
CCCAGGAGCC 
GGCAAAAGCT 
CCCTGGCTCT 
AGAAGAAGCC 
GGGCCATGGA 
CTAGAGACAC 
GGCACCCACA 
CCTTAAAGGC 
GAAGCTGTGT 
CCCTCCCTTT 
TAAGGGGCTC 
TTGCTCATGC 
AATTAATTAT 
CTGGAAACAT 
TGATACTGTT 
AGGCAGGAAG 
AAGCTTGAAA 
TAATCTACGG 
CATCCAAATG 
AAACAAATTC 
GCCCCGCCTC 
CCTGCAGAAA 
GTGTTTTGAA 
AGCATCCTGA 
TCAGGCAGTT 
GGAGGGAAGG 
CAAAGCCATT 
GATAATGCAG 
ACACACACAC 
TCCTGGGGGT 
CATGTCTCCT 
TCCCTCCAAG 
TCGGTCTCTG 
CTTGCCAGTT 
GGCAGGGGAA 
TCTTTTTAAA 
AAGGGAGACA 
TGTGTGAAGC 
TTCTAAAGAC 
CAACATTTCT 
ATTTCTGTCC 
CGAGCCCTCC 
CTCTGAATGC 
CAAGGGGAGA 
TCCCTCTCCT 
AGCAACCCAA 
TGCTTTATTC 
GATCTTGGGG 
TCGTCCTTGT 
TCACAGTATG 



31 
I 

GCCGGAGTGG 
GAAGCTCCTG 
GGCCACCTCC 
GACGGGAGCT 
GGGCAAGCTG 
CACTGTGAAG 
GCGGGAGGCA 
GGGCAAGTGC 
GGACACGCCT 
TGTGTCTCTG 
TGAGGGCCCC 
GTTCAGCTTC 
CACCACCCCC 
CAATGTAGCC 
GGAGAAGGCC 
GTATGGCTGC 
CTCCTTCCTC 
CTCTCGAAAC 
CCATGGGAAA 
GGACTGTGTG 
CCCTGGGGGC 
GGCCTGTCAG 
CAACACAGAT 
CGGGACTCAG 
CTTGTGCTTC 
CCCAAATGGG 
CGATGAGGAG 
TCCCACAAGG 
GTCATCTGAC 
AGGCGTCTGG 
TGCAGGTGGG 
GCTTTTATTC 
GGGGCTACTG 
CCAGAATGCG 
GAACCAGTAC 
TAGAGTCACC 
TTTTTTTGAA 
CCCTTGGAAC 
GTTGGCGTGC 
TCAAATTCCA 
CCCTGAATAT 
TGATTAGGAT 
TCAATTAGGT 
TTCTTTACAT 
GACATCCTCC 
TGCCTCTTTA 
AATATGAGAA 
GGCTAGGGCG 
TACTGAGGTT 
AAGGACAACC 
CACTTCATCC 
GTTCCATCAG 
AGTTGTCATT 
GATTTTAAAT 
TGCTTAAGGA 
AAGAGGGAAA 
TAAATTATAT 
TCGATAGTGT 
ACACACACAC 
CACCGATGGT 
GTGGGCTTTT. 
GCCCCAAAGC 
AGGTGACACT 
TTGAAATATA 
GTGAAAGGAA 
GTCATCTCAT 
AAGGCTGTGT 
TGGGCAGTCT 
ATGGCTGGAT 
CCCTCTGGGC 
TTGGCTTCCC 
TATTTTCACA 
AGCCCTGTTC 
AATGGGTGAT 
GCTGGGTGGT 
CCAGACAGCT 
TCCTGTTTAT 
CTACAGTAGT 
CCTGAGGGGC 
GGGAGAGGTG 
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CTGCAGCTCA 
TCGCCGCTGG 
ATGGGCCTGC 
GACACGGAGG 
AGCGCTGCCG 
AGCAAGGAGG 
GCCCTGACGG 
CTGGACCCTA 
TACTCTAACT 
CTGCTGGACC 
TGTGGGAGCC 
AAAGGCATGT 
TTCCAGACCA 
TGTGGGGAAG 
CCCGATGTGT 
AACTTCAACA 
TGCGGCTGCC 
CCTTGCAGCT 
AACTACACGT 
GACGTGGATG 
TtCCGCTGCG 
GATGTGGATG 
GGCTCATTTC 
TGCCAGGACG 
AACACACAAG 
GTCTCTTGCA 
GACAAAGGAG 
GGCCCCGAGG 
GCCCCCATCA 
AGGGAGCCCA 
GACTCCTCCG 
TACATCCTAG 
GTCTATCGCA 
GCAGACAGTT 
AGTCCGACAC 
AGCCACCATC 
AGACTGGACT 
ATGCAGGTAT 
CACGGTGGGG 
ATGTGACCAA 
CTTCTCTGCT 
TGAAATGATT 
AAGAAGATCT 
TTGCATTCCT 
AGAATGGCCA 
GTTCTTACAT 
AAGTTGCTTG 
AGAGAGGCCA 
ACCACACACT 
TGTCTTTGAG 
TGCCCGGAAT 
GCTGTTTCCT 
TTAAAGCATT 
CCTGAAGTGT 
ACTTTTGTTC 
GAGATGACTA 
CCTCATTTTA 
GCACTCTTTC 
AGAGACACGG 
CAGAGTCACT 
TACCACCACT 
CTCAGAGAAA 
CTGGAGTGGT 
GATGCTATGG 
AGATGGAGGT 
GGTCTCCAGT 
CCATTTGGCA 
GTGGTGTGGA 
GCGCTGCTGA 
CTTGAAAATC 
ACAGCCCCAA 
GTGTTCTGAT 
TCAACAACAG 
AAAGAATCCA 
GCTTTCT CTT 
CAGGTTGTGC 
TTTTGCTGTT 
GGGTAAACAA 
CCCAGCTTGC 
CACCCTGCCA 
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CCCCTCAGCT 
GCTTCTCGCC 
TGCTGCTGCT 
CGGTGGTCTG 
AGGCCCAGAA 
AGGCCCAGCA 
CGAGGATGAG 
GTCTGCCGCT 
GGCACAAGGA 
TGTCCCAGCC 
CAGGCTCCCC 
GCCGGCCTCT 
CCAGTTCCTC 
GTGACAAGGA 
TCGACTGGGG 
ATGGGGGCTG 
GACCAGGATT 
CCAGCCCATG 
GCCGCTGCCC 
AATGCCAGGA 
AATGCTGGGT 
AGTGTGCTCT 
ACTGCTCCTG 
TGGATGAGTG 
GGTCCTTCCA 
CCATGGGGCC 
AGAAAGAAGG 
GCACCCCCAA 
CATCTGCCCC 
GCATCCATCA 
TGGCCACACA 
GCACCGTGGT 
AGCGGAGAGC 
ACTCCTGGGT 
CTGGGACAGA 
CTCAGAGCTT 
GGAATCTTAG 
TTTCTACGGG 
ATTTCGTGAC 
TTCCGGATCA 
CACTTCCACC 
TGTTTCTCTT 
GGTTTTTTGG 
CCATTTCGCC 
GAAGTGCAAT 
TTCTAATAGC 
AAGTGCATTA 
GGGATTTGTT 
TGACTACGGA 
CCAGGGCAGG 
GCCAGTGCTC 
AAAGGATGTG 
TTAGCACAGT 
GGGTGGCGCA 
TCTGTCTCTT 
ACTAAAATCA 
AAAGTTACAT 
TCTCTCTCTC 
CACCATTCTG 
AGAAGTTACC 
GTGCAGGAGA 
GGGTGTTTCT 
TGAAGGGCCA 
TTCAGATTGT 
TTTGTGCGGC 
TTTCAGTTGG 
AAACTTCCTT 
GAGCAGCCAT 
CCAACATCAG 
CTTGCCCTTA 
CGCAGTCTGT 
TGCTCTCACA 
GGAGGTCATG 
GTTCCTCAAA 
GCACACCACT 
ATCTGATGGA 
ACTTTGAAGC 
ATGCCCACCG 
TCGGGCGTGG 
CCTGCTAACT 



€0 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
13B0 
1440 
1500 
1560 
1620 
1660 
1740 
1S0O 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
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TCTCGCTAGA CACAGTGTTT CTGCCCAGGT QACCTGTTCA GCAGCAGAAC AAGCCAGGGC 4B60 

CATGGGGACG GGGGAAGTTT TCACTTGGAG ATGGACACCA AGACAATGAA GATTTGTTGT 4920 

CCAAATAGGT CAATAATTCT GGGAGACTCT TGGAAAAAAC TOAATATATT CAGGACCAAC 4980 

TCTCTCCCTC CCCTCATCCC ACATCTCAAA GCAGACAATG TAAAGAGAGA ACATCTCACA 5040 

CACCCAGCTC GCCATGCCTA CTCATTCCTG AATTTCAGGT GCCATCACTG CTCTTTCTTT 5100 

CTTCTTTGTC ATTTGAGAAA GGATGCAGGA GGACAATTCC CACAGATAAT CTGAGGAATG 5160 

CAGAAAAACC AGGGCAGGAC AGTTATCGAC AATGCATTAG AACTTGGTGA GCATCCTCTG 5220 

TAGAGGOACT CCACCCCTGC TCAACAGCTT GGCTTCCAGG CAAGACCAAC CACATCTGGT 5280 

CTCTGCCTTC GGTGGCCCAC ACACCTAAGC GTCATCGTCA TTGCCATAGC ATCATGATGC 5340 

AACACATCTA CGTGTAGCAC TACGACGTTA TGTTTGGGTA ATGTGGGGAT GAACTGCATG 5400 

AGGCTCTGAT TAAGGATGTG GGGAAGTGGG CTGCGGTCAC TGTCGGCCTT GCAAGGCCAC 5460 

CTGGAGGCCT GTCTGTTAGC CAGTGGTGGA GGAGCAAGGC TTCAGGAAGG GCCAGCCACA 5520 

TGCCATCTTC CCTGCGATCA GGCAAAAAAG TGGAATTAAA AAGTCAAACC TTTATATGCA 5580 

TGTGTTATGT CCATTTTGCA GGATGAACTG AGTTTAAAAG AATTTTTTTT TCTCTTCAAG 5640 

TTGCTTTGTC TTTTCCATCC TCATCACAAG CCCTTGTTTG AGTGTCTTAT CCCTGAGCAA 5700 

TCTTTCGATG GATGGAGATG ATCATTAGGT ACTTTTGTTT CAACCTTTAT TCCTGTAAAT 5760 

ATTT CTGTG A AAACTAGGAG AACAGAGATG AGATTTGACA AAAAAAAATT GAATTAAAAA 5820 

TAACACAGTC TTTTTAAAAC TAACATAGGA AAGCCTTTCC TATTATTTCT CTTCTTAGCT 5880 

TCTCCATTGT CT AAA T CAGG AAAACAGGAA AACACAGCTT TCTAGCAGCT GCAAAATGGT 5940 

TTAATGCCCC CTACATATTT CCATCACCTT GAACAATAGC TTTAGCTTGG GAATCTGAGA 6000 

TATGATCCCA GAAAACATCT GTCTCTACTT CGGCTGCAAA ACCCATGGTT TAAATCTATA 6060 

TGGTTTGTGC ATTTTCTCAA CTAAAAATAG AGATGATAAT CCGAATTCTC CATATATTCA 6120 

CTAATCAAAG ACACTArTTT CATACTAGAT TCCTGAGACA AATACTCACT GAAGGGCTTG 6180 

TTTAAAAATA AATTGTGTTT TGGTCTGTTC TTGTAGATAA TGCCCTTCTA TTTTAGGTAG 6240 

AAGCTCTGGA ATCCCTTTAT TGTGCTGTTG CTCTTATCTG CAAGGTGGCA AGCAGTTCTT 6300 

TTCAGCAGAT TTTGCCCACT ATTCCTCTGA GCTGAAGTTC TTTG CAT AGA TTTGGCTTAA 6360 

GCTTGAATTA GATCCCTGCA AAGGCTTGCT CTGTG ATGTC AGATGTAATT GTAAATGTCA 6420 

GTAATCACTT CATGAATGCT AAATGAGAAT GTAAGTATTT TTAAATGTGT GTATTTCAAA 6480 

TTTGTTTGAC TAATTCTGGA ATTACAAGAT TTCTATGCAG GATTTACCTT CATCCTGTGC 6540 

ATGTTTCCCA AACTGTGAGG AGGGAAGGCT CAGAGATCGA GCTTCTCCTC TGAGTTCTAA 6600 

CAAAATGGTG CTTTGAGGGT CAGCCTTTAG GAAGGTGCAG CTTTGTTGTC CTTTGAGCTT 6660. 
TCTGTTATGT GCCTATCCTA ATAAACTCTT AAACACATT 

Seq ID NOt 345 Protein sequence 
Protein Accession #: NP_036204 

1 11 21 31 41 51 

| I I I I I 

MATSMGLLLL LLLLLTQPGA GTGADTEAW CVGTACYTAH SGKLSAAEAQ NHCNQNGGNL 60 

ATVKSKEEAQ HVQRVLAQLL RREAALTARM SKFWIGLQRE KGKCLDPSLP LKGFSWVGGG 120 

EDTPYSNWHK ELRNSCISKR CVSLLIiDLSQ PLLPNRLPKW SEGPCGSPGS PGSNIEGFVC 180 

KFSFKGMCRP LALGGPGQVT YTTPFQTTSS SLEAVPFASA ANVACGEGDK DETQSHYFLC 240 

KEKAPDVFDW GSSGPLCVSP KYGCNFNNGG CHQDCFEGGD GSFLCGCRPG FRLLDDLVTC 300 

ASRNPCSSSP CRGGATCVLG PHGKNYTCRC PQGYQLDSSQ LDCVDVDECQ DSPCAQECVN 360 

TPGGFRCECW VGYEPGGPGE GACQDVDECA LGRSPCAQGC TNTDGSFHCS CEEGYVLAGE 420 

DGTQCQDVDE CVGPGGPIiCD SLCFNTQGSF HCGCLPGWVL APNGVSCTMG PVSLGPPSGP 480 

PDEEDKGEKE GSTVPRAATA SPTRGPEGTP KATPTTSRPS LSSDAPITSA PLKMLAPSGS 540 

SGVWREPSIH HATAASGPQE PAGGDSSVAT QNNDGTDGQK LLLFYILGTV VAILLLLALA 600 
LGLLVYRKRR AKREEKKEKK PQNAADSYSW VPERAESRAM ENQYSPTPGT DC 

Seq ID NO* 346 DNA sequence 
Nucleic Acid Accession #i Z31560 
Coding sequence: <l-966 

1 11 21 31 41 51 

1 I I I 1 I 

CACAGCGCCC GCATGTACAA CATGATGGAG ACGGAGCTGA AGCCGCCGGG CCCGCAGCAA 60 

ACTTCGGGGG GCGGCGGCGG CAACTCCACC GCGGCGGCGG CCGGCGGCAA CCAGAAAAAC 120 

AGCCCGGACC GCGTCAAGCG GCCCATGAAT GCCTTCATGG TGTGGTCCCG CGGGCAGCGG 180 

CGCAAGATGG CCCAGGAGAA CCCCAAGATG CACAACTCGG AGATCAGCAA GCGCCTGGGC 240 

GCCGAGTGGA AACTTTTGTC GGAGACGGAG AAGCGGCCGT TCATCGACGA GGCTAAGCGG 300 

CTGCGAGCGC TGCACATGAA GGAGCACCCG GATTATAAAT ACCGGCCCCG GCGGAAAACC 360 

AAGACGCTCA TGAAGAAGGA TAAGTACACG CTGCCCGGCG GGCTGCTGGC CCCCGGCGGC 420 

AATAGCATGG CGAGCGGGGT CGGGGTGGGC GCCGGCCTGG GCGCGGGCGT GAACCAGCGC 480 

ATGGACAGTT ACGCGCACAT GAACGGCTGG AGCAACGGCA GCTACAGCAT GATGCAGGAC 540 

CAGCTGGGCT ACCCGCAGCA CCCGGGCCTC AATGCGCACG GCGCAGCGCA GATGCAGCCC 600 

ATGCACCGCT ACGACGTGAG CGCCCTGCAG TACAACTCCA TGACCAGCTC GCAGACCTAC 660 

ATGAACGGCT CGCCCACCTA CAGCATGTCC TACTCGCAGC AGGGCACCCC TGGCATGGCT 720 

CTTGGCTCCA TGGGTTCGGT GGTCAAGTCC GAGGCCAGCT CCAGCCCCCC TGTGGTTACC 780 

TCTTCCTCCC ACTCCAGGGC GCCCTGCCAG GCCGGGGACC TCCGGGACAT GATCAGCATG 840 

TATCTCCCCG GCGCCGAGGT GCCGGAACCC GCCGCCCCCA GCAGACTTCA CATGTCCCAG 900 

CACTACCAGA GCGGCCCGGT GCCCGGCACG GCCATTAACG GCACACTGCC CCTCTCACAC 960 

ATGTGAGGGC CGGACAGCGA ACTGGAGGGG GGAGAAATTT TCAAAGAAAA ACGAGGGAAA 1020 

TGGGAGGGGT GCAAAAGAGG AGAGTAAGAA ACAGCATGGA GAAAACCCGG TACGCTCAAA 1080 
AAAAA 

Seq ID NO: 347 Protein sequence 
Protein Acceosion CAA8343S 

1 11 21 31 41 51 

I 1 I I I I 

HSARMYNMME TELKPPGPQQ TSGGGGGNST AAAAGGNQKN SPDRVKRPMN AFMVWSRGQR 60 

RKMAQENPKM HNSEISKRLG AEWKLLSETE KRPF IDEAKR LRALHMKEHP DYKYRPRRKT 120 

KTLMKKDKYT LPGGLLAPGG NSMASGVGVG AGLGAGVNQR MDSYAHMNGW SNGSYSMMQD 180 

QLGYPQHPGL NAHGAAQMQP MHRYDVSALQ YNSMTSSQTY MNGSPTYSMS YSQQGTPGMA 240 

LGSMGSWKS EASSSPPWT SSSHSRAPCQ AGDLRDMISM YLPGAEVPEP AAPSRLHMSQ 300 
HYQSGPVPGT AINGTXPLSH M 
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Seq ZD NO: 34 B DNA sequence 
Nucleotide Accession ft: NM_002638 
Coding sequence: 120-473 

1 11 21 31 41 51 

1 I I I 1 I 

CAATACAGCT AAGGAATTAT CCCTTGTAAA TACCACAGAC CCGCCCTGGA GCCAGGCCAA 60 
GCTGGACTGC ATAAAGATTG GTATGGCCTT AGCTCTTAGC CAAACACCTT CCTGACACCA 120 
TGAGGGCCAG CAGCTTCTTG ATCGTGQTGG TGTTCCTCAT OGCTGGGACG CTGGTTCTAG 180 
AGGCAGCTGT CACGGGAGTT CCTGTTAAAG GTCAAGACAC TGTCAAAGGC CGTGTTCCAT 240 
TCAATGGACA AGATCCCGTT AAAGGACAAG TTTCAGTTAA AGGTCAAGAT AAAGTCAAAG 300 
CGCAAGAGCC AGTCAAAGGT CCAGTCTCCA CTAAGCCTGG CTCCTGCCCC ATTATCTTGA 360 
TCCGGTGCGC CATGTTGAAT CCCCCTAACC GCTGCTTGAA AGATACTGAC TGCCCAGGAA 420 
TCAAGAAGTG CTGTGAAGGC TCTTGCGGGA TGGCCTGTTT CGTTCCCCAG TG AAGGG AG C 480 
CGGTCCTTGC TGCACCTGTG CCGTCCCCAG AGCTACAGGC CCCATCTGGT CCTAAGTCCC 540 
TGCTGCCCTT CCCCTTCCCA CACTGTCCAT TCTTCCTCCC ATTCAGGATG CCCACGGCTG 600 
GAGCTGCCTC TCTCATCCAC TTTCCAATAA A 

Seq ID NO: 34 9 Protein sequence: 
Protein Accession #: NP_ 002629 

1 11 21 31 41 51 

I I I I I I 

MRASSFLIW VFLIAGTLVL EAAVTGVPVK GQDTVKGRVP FNGQDPVKGQ VSVKGQDKVK 60 
AQEPVKGPVS TKPGSCPIIL IRCAMIiNPPN RCLKDTDCPG IKKCCEGSCG MACFVPQ 



Seq ID NO: 350 DNA sequence 
Nucleic Acid Accession #: NM_007183 
Coding sequence: 75-2468 

1 11 21 31 41 51 

I I I I I I 

GAATTCCGGA CAGGACGTGA AGATAGTTGG GTTTGGAGGC GGCCGCCAGG CCCAGGCCCG 60 

GTGGACCTGC CGCCATGCAG GACGGTAACT TCCTGCTGTC GGCCCTGCAG CCTGAGGCCG 120 

GCGTGTGCTC CCTGGCGCTG CCCTCTGACC TGCAGCTGGA CCGCCGGGGC GCCGAGGGGC 180 

CGGAGGCCGA GCGGCTGCGG GCAGCCCGCG TCCAGGAGCA GGTCCGCGCC CGCCTCTTGC 240 

AGCTGGGACA GCAGCCGCGG CACAACGGGG CCGCTGAGCC CGAGCCTGAG GCCGAGACTG 300 

CCAGAGGCAC ATCCAGGGGG CAGTACCACA CCCTGCAGGC TGGCTTCAGC TCTCGCTCTC 360 

AGGGCCTGAG TGGGGACAAG ACCTCGGGCT TCCGGCCCAT CGCCAAGCCG GCCTACAGCC 420 

CAGCCTCCTG GTCCTCCCGC TCCGCCGTGG ATCTGAGCTG CAGTCGGAGG CTGAGTTCAG 480 

CCCACAATGG GGGCAGCGCC TTTGGGGCCG CTGGGTACGG GGGTGCCCAG CCCACCCCTC 540 

CCATGCCCAC CAGGCCCGTG TCCTTCCATG AGCGCGGTGG GGTTGGGAGC CGGGCCGACT 600 

ATGACACACT CTCCCTGCGC TCGCTGCGGC TGGGGCCCGG GGGCCTGGAC GACCGCTACA 660 

GCCTGGTGTC TGAGCAGCTG GAGCCCGCGG CCACCTCCAC CTACAGGGCC TTTGCGTACG 720 

AGCGCCAGGC CAGCTCCAGC TCCAGCCGGG CAGGGGGGCT GGACTGGCCC GAGGCCACTG 780 

AGGTTTCCCC GAGCCGGACC ATCCGTGCCC CTGCCGTGCG GACCCTGCAG CGATTCCAGA 840 

GCAGCCACCG GAGCCGCGGG GTAGGCGGGG CAGTGCCGGG GGCCGTCCTG GAGCCAGTGG 900 

CTCGAGCGCC ATCTGTG CGC AGCCTCAGCC TCAGCCTGGC TGACTCGGGC CACCTGCCGG 960 

ACGTGCATGG GTTCAACAGC TACGGTAGCC ACCGAACCCT GCAGAGACTC AGCAGCGGTT 1020 

TTGATGACAT TGACCTGCCC TCAGCAGTCA AGTACCTCAT GGCTTCAGAC CCCAACCTGC 1080 

AGGTGCTGGG AGCGGCCTAC ATCCAGCACA AGTGCTACAG CGATGCAGCC GCCAAGAAGC 1140 

AGGCCCGCAG CCTTCAGGCC GTGCCTAGGC TGGTGAAGCT CTTCAACCAC GCCAACCAGG 1200 

AAGTGCAGCG CCATGCCACA GGTGCCATGC GCAACCTCAT CTACGACAAC GCTGACAACA 1260 

AGCTGGCCCT GGTGGAGGAG AACGGGATCT TCGAGCTGCT GCGGACACTG CGGGAGCAGQ 1320 

ATGATGAGCT TCGCAAAAAT GTCACAGGGA TCCTGTGGAA CCTTTCATCC AGCGACCACC 1380 

TGAAGGACCG CCTGGCCAGA GACAOGCTGG AGCAGCTCAC GGACCTGGTG TTGAGCCCCC 1440 

TGTCGGGGGC TGGGGGTCCC CCCCTCATCC AGCAGAACGC CTCGGAGGCG GAGATCTTCT 1500 

ACAACGCCAC CGGCTTCCTC AGGAACCTCA GCTCAGCCTC TCAGGCCACT CGCCAGAAGA 1560 

TGCGGGAGTG CCACGGGCTG GTGGACGCCC TGGTCACCTC TATCAACCAC GCCCTGGACG 1620 

CGGGCAAATG CGAGGACAAG AGCGTGGAGA ACGCGGTGTG CGTCCTGCGG AACCTGTCCT 1680 

ACCGCCTCTA CGACGAGATG CCGCCGTCCG CGCTGCAGCG GCTGGAGGGT CGCGGCCGCA 1740 

GGGACCTGGC GGGGGCGCCG CCGGGAGAGG TCGTGGGCTG CTTCACGCCG CAGAGCCGGC 1800 

GGCTGCGCGA GCTGCCCCTC GCCGCCGATG CGCTCACCTT CGCGGAGGTG TCCAAGGACC 1860 

CCAAGGGCCT CGAGTGGCTG TGGAGCCCCC AGATCGTGGG GCTGTACAAC CGGCTGCTGC 1920 

AGCGCTGCGA GCTCAACCGG CACACGACGG AGGCGGCCGC CGGGGCGCTG CAGAACATCA 1980 

OGGCAGGCGA CCGCAGGTGG GCGGGGGTGC TGAGCCGCCT GGCCCTGGAG CAGG AG CGT A 2040 

TTCTGAACCC CCTGCTAGAC CGTGTCAGGA CCGCCGACCA CCACCAGCTG CGCTCACTGA 2100 

CTGGCCTCAT CCGAAACCTG TCTCGGAACG CTAGGAACAA GGACGAGATG TCCACGAAGG 2160 

TGGTGAGCCA CCTGATCGAG AAGCTGCCAG GCAGCGTGGG TGAGAAGTCG CCCCCAGCCG 2220 

AGGTGCTGGT CAACATCATA GCTGTGCTCA ACAACCTGGT GGTGGCCAGC CCCATCGCTG 2280 

CCCGAGACCT GCTGTATTTT GACGGACTCC GAAAGCTCAT CTTCATCAAG AAGAAGCGGG 2340 

ACAGCCCCGA CAGTGAGAAG TCCTCCCGGG CAGCATCCAG CCTCCTGGCC AACCTGTGGC 2400 

AGTACAACAA GCTCCACCGT GACTTTCGGG CGAAGGGCTA TCGGAAGGAG GACTTCCTGG 2460 

GCCCATAGGT GAAGCCTTCT GGAGGAGAAG GTGACGTGGC CCAGCGTCCA AGGGACAGAC 2520 

TCAGCTCCAG GCTGCTTGGC AGCCCAGCCT GGAGGAGAAG GCTAATGACG GAGGGGCCCC 2580 

TCGCTGGGGC CCCTGTGTGC ATCTTTGAGG GTCCTGGGCC ACCAGGAGGG GCAGGGTCTT 2640 

ATAGCTGGGG ACTTGGCTTC CGCAGGGCAG GGGGTGGGGC AGGGCTCAAG GCTGCTCTGG 2700 

TGTATGGGGT GGTGACCCAG TCACATTGGC AGAGGTGGGG GTTGGCTGTG GCCTGGCAGT 2760 

ATCTTGGGAT AGCCAGCACT GGGAATAAAG ATGGCCATGA ACAGTCACAA AAAAAAAAAA 2820 
AAAAGGAATT C 

Seq ID NO: 351 Protein sequence 
Protein Accession #i NP_009114.1 

1 11 21 31 41 51 
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WO 02/086443 
I I I 

MQDGNFLLSA LQPEAGVCSL ALPSDLQLDR 
PRHNGAAEPE PEAETARGTS RGQYHTLQAG 
SRSAVDLSCS RRLSSAHNGG SAFGAAGYGG 
LRSI*RLGPGG LDDRYSLVSE QLEPAATSTY 
RTIRAPAVRT LQRFQSSHRS RGVGGAVPGA 
NSYGSHRTLQ RLSSGFDDID LPSAVKYLMA 
QAVPRLVFCLF NHANQEVQRH ATGAMRNL I Y 
KNVTGILWNL SSSDHLKDRL ARDTLEQLTD 
FLRNLSSASQ ATRQKMRECH GLVDALVTSI 
EMPPSALQRL EGRGRRDLAG APPGEWGCP 
WLWSPQIVGL YNRLLQRCEL NRHTTEAAAG 
LDRVRTADHH QLRSLTGLIR NLSRNARNKD 
IIAVLNNLW ASPIAARDLL YFDGLRKLIF 
HRDPRAKGYR KEDPLGP 



RGAEGPEAER 
FSSRSQGLSG 
AQPTPPMPTR 
RAFAYERQAS 
VLEPVARAPS 
SDPNLQVLGA 
DNADNKLALV 
LVLSPLSGAG 
NHALDAGKCE 
TPQSRRLREL 
ALQNITAGDR 
EMSTKWSHL 
IKKXRDSPOS 



I 

LRAARVQEQV 
DKTSGFRPIA 
PVSFHBRGGV 
SSSSRAGGLD 
VRSLSLSLAD 
AYIQHKCYSD 
EENGIFELLR 
GPPLIQQNAS 
DKSVENAVCV 
PLAADALTFA 
RWAGVLSRLA 
IEKLPGSVGE 
EKSSRAASSL 



1 

RARLLQLGQQ 
KPAYSPASWS 
GSRADYDTLS 
WPEATEVSPS 
SGHLPDVHGP 
AAAKKQARSL 
TLREQDDELR 
EAEIFYNATG 
LRNLSYRLYD 
EVSKDPKGLE 
LEQERILNPL 
KSPPAEVLVN 
LANLWQYNKL 



PCT/US02/12476 



Seq ID NO: 352 DNA sequence 
Nucleic Acid Accession ft: M31469 
Coding sequence; 1-651 



1 

I 

ATGGCTGCGC 
ACTGGAAAAA 
GCCACCTTGG 
TTCAATGTAT 
ATCCAAGCCC 
GTGCCTAACT 
GGCAACAAAG 
AAGAAGAATC 
TTCCTCTGGC 

TTAGAGGTTG 



11 
I 

AGGGAGAGCC 
CGACCTTCGT 
GTGTTGAGGT 
GGGACACAGC 
AGTGTGCCAT 
GGCATAGAGA 
TGGATATTAA 
TTCAGTACTA 
TTGCTAGGAA 
CACCAGAAGT 
CTCAGACAAC 



21 
I 

CCAGGTCCAG 
GAAACGTCAT 
TCATCCCCTA 
CGGCCAGGAG 
CATAATGTTT 
TCTGGTACGA 
GGACAGGAAA 
CGACATTTCT 
GCTCATTGGA 
TGTCATGGAC 
TGCTCTCCCG 



31 
I 

TTCAAACTTG 
TTGACTGGTG 
GTGTT CCACA 
AAATTCGGTG 
GATGTAACAT 
GTGTGTGAAA 
GTGAAGGCGA 
GCCAAAAGTA 
GACCCTAACT 
CCAGCTTTGG 
GATGAGGATG 



41 
I 

TATTGGTTGG 
AATTTGAGAA 
CCAACAGAGG 
GACTGAGAGA 
CGAGAGTTAC 
ACATCCCCAT 
AATCCATTGT 
ACTACAACTT 
TGGAATTTGT 
CAGCACAGTA 
ATGACCTGTG 



51 
I 

TGATGGTGGT 
GAAGTATGTA 
ACCTATTAAG 
TGGCTATTAT 
TTACAAGAAT 
TGTGTTGTGT 
CTTCCACCGA 
TGAAAAGCCC 
TGCCATGCCT 
TGAGCACGAC 
A 



Seq ID NO: 353 Protein sequence 
Protein Accession ft: AAA36546 



11 



21 31 41 51 

I ! ! I I I 

MAAQGEPQVQ FKLVLVGDGG TGKTTFVKRH LTGEFEKKYV ATLGVEVHPL VFHTNRGP I K 

FNVWDTAGQE KFGGLRDGYY IQAQCAIIMF DVTSRVTYKN VPNWHRDLVR VCENIPlVliC 

GNKVDIKDRK VKAKSIVFHR KKNLQYYDIS AKSNYNFEKP FLWLARKLIG DPNLEFVAMP 

ALAPPEWMD PALAAQYEHD LEVAQTTALP DEDDDI* 



Seq ID NOx 354 DNA sequence 
Nucleic Acid Accession ft: NM_002820 
Coding sequence: 304-831 



CCGGTTCGCA 
CCCTGTTCCA 
CGTGTAAACA 
TTCAGAGGAA 
GTTTGGAGAA 
ACGATGCAGC 
GTGCCCTCCT 
GAACATCAGC 
CTTCACCATC 
CCTAACTCCA 
GAGGGCAGAT 
AAGACACCTG 
AAACGGCGAA 
GACCACCTGT 
CTGGCCCGTA 
GCTTGGACAA 
CAGAGAATAA 
TGTCCTCCAG 
CATCAATCCT 
ATCTTCATAA 
TTCTTCAGTG 
GAT ATT AT CT 
ACTTTTTATT 
TAAATTATGT 
CCAGCTCATA 
GGTTTTTCTC 
CCGTAGGAAA 



11 
I 

AAGAAGCTGA 
CGAACCCAGG 
CACTACTTAT 
GCGCCTCTGA 
AGCACAGTTG 
GGAGACTGGT 
GCGGGCGCTC 
TCCTCCATGA 
TGATCGCAGA 
AGCCCTCTCC 
ACCTAACTCA 
GGAAGAAAAA 
CTCGCTCTGC 
CTGACACCTC 
GCCTCAGCGG 
ACCTAGAATT 
CTCAGAATAT 
CACCATAGAG 
TTACCACTCT 
TTTGCTGGAG 
TTTTTCATTT 
ACAAACACTG 
TAATTAAATG 
TTTAAACACA 
CAAAATAAAT 
ATGTATCTTT 
AATAAAACTT 



21 
I 

CTTCAGAGGG 
AGAACTGCTG 
CATTGATGCA 
TTTGTTTCTT 
GAGTAG CCGG 
TCAGCAGTGG 
GGTGGAGGGT 
CAAGGGGAAG 
AATCCACACA 
CAACACAAAG 
GGAAACTAAC 
GAAAGGCAAG 
CTGGTTAGAC 
CACAACGTCG 
GGTGCTCTCA 
TTCTCCCTTT 
TGTCTGCCTT 
AGGCGCTAGA 
ACCAAATAAT 
AAGTGTATTT 
CTTACGTTCT 
CAGAACAGCA 
TATTTAATTA 
TGCCTTAAAT 
GGTTTCTGAA 
TTGTTCATTG 
CACATTTAAA 



31 

t 

GGAAACTTTC 
GCCAGATTAA 
TATATAAAAC 
TTTTCCCTTT 
TTGCTAAATA 
AGCGTCGCGG 
CTCAGCCGCC 
TCCATCCAAG 
GCTGAAATCA 
AACCACCCCG 
AAGGTGGAGA 
CCCGGGAAAC 
TCTGGAGTGA 
CTGGAGCTCG 
GCTGGGTTTT 
ATGTATCTCT 
AAAGCAGTAC 
GCCCATTCCT 
TTCATATTCA 
CTTCCCCTTA 
TTCACTTCAA 
TCATGTCATA 
AATCTCAAAT 
TTGTTTAATT 
AATGTTTAAG 
GCAAGATGAA 
AAAAA 




AGTCCCGAGC 
TGTTCCTGCT 
GCCTCAAAAG 
ATTTACGGCG 
GAGCTACCTC 
TCCGATTTGG 
CGTACAAAGA 
GCAAGGAGCA 
CTGGGAGTGG 
ATTCACGGTA 
GGAGCCTCCC 
ATCGATTGTG 
CCCCCTACCA 
CTTTCTCCAC 
AGCTTCAGAA 
CTCTCACACC 
GGGAGAATAT 
AACGATTCTG 
TTATTTTAAT 
AAATTTAACT 
TATTAACTTA 
ATAATTTTTC 



51 
1 

AGGCGGTTAG 
CTATGGGAGA 
TCGCTATTAT 
TGGCTGTGTG 
GCGAGCGGAG 
GAGCTACGCG 
AGCTGTGTCT 
ACGATTCTTC 
GGAGGTGTCC 
GTCTGATGAT 
GCAGCCGCTC 
GGAAAAGAAA 
GCTAGAAGGG 
ACAGGCTTCT 
TTCTG CCTTG 
TAGCAATTGA 
CACACACCCC 
CGTCACCCAA 
GCTAGTGACC 
TGGGCAAACT 
AGAAGCATTT 
AGCCATTCAC 
GTAAAGAACT 
CTGGTTTCTA 
CAAGGATATA 
T AG GGTAATG 



Seq ID NO: 355 Protein sequence 
Protein Accession fts NM_002820 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



60 
120 
180 



60 
120 
180 
240 
300 
360 
420 
460 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 



1 11 21 31 41 51 

I I 1 I 1 I 

MQRRLVQQWS VAVFLLSYAV PSCGRSVEGL SRRLKRAVSE HQLLHDKGKS IQDLRRRFFL 
HHLIAEIHTA EIRATSEVSP NSKPSPNTKN HPVRFGSDDE GRYLTQETNK VETYKEQPLK 



60 
120 



319 



WO 02/086443 

TPGKKKKGKF GKRKEQEKKK RRTRSAWLDS GVTGSGLEGD HLSDTSTTSL ELDSR 



PCT/US02/12476 



Seq ID NO: 3S6 DNA sequence 
Nucleic Acid Accession #: NMJ317522 
5 Coding sequence: 1-2100 

1 ll 21 31 41 51 

I I I I I 

ATGGGCCTCC CCGAGCCGGG CCCTCTCCGG CTTCTGGCGC TGCTGCTGCT GCTGCTGCTG 60 

10 CTGCTGCTGC TGCGGCTCCA GCATCTTGCG GCGGCAGCGG CTGATCCGCT GCTCGGCGGC 120 

CAAGGGCCGG CCAAGGAGTG CGAAAAGGAC CAATTCCAGT GCCGGAACGA GCGCTGCATC 180 

CCCTCTGTGT GGAGATGCGA CGAGGACGAT GACTGCTTAG ACCACAGCGA CGAGGACGAC 240 

TGCCCCAAGA AGACCTGTGC AGACAGTGAC TTCACCTGTG ACAACGGCCA CTGCATCCAC 300 

GAACGGTGGA AGTGTGACGG CGAGGAGGAG TGTCCTGATG GCTCCGATGA GTCCGAGGCC 360 

15 ACTTGCACCA AGCAGGTGTG TCCTGCAGAG AAGCTGAGCT GTGGACCCAC CAGCCACAAG 420 

TGTGTACCTG CCTCGTGGCG CTGCGACGGG GAGAAGGACT GCGAGGGTGG AGCGGATGAG 480 

GCCGGCTGTG CTACCTCACT GGGCACCTGC CGTGGGGACG AGTTCCAGTG TGGGGATGGG 540 

ACATGTGTCC TTGCAATCAA GCACTGCAAC CAGGAGCAGG ACTGTCCAGA TGGGAGTGAT 600 

GAAGCTGGCT GCCTACAGGG GCTGAACGAG TGTCTGCACA ACAATGGCGG CTGCTCACAC 660 

20 ATCTGCACTG ACCTCAAGAT TGGCTTTGAA TGCACGTGCC CAGCAGGCTT CCAGCTCCTG 720 

GACCAGAAGA CTTGTGGCGA CATTGATGAG TGCAAGGACC CAGATGCCTG CAGCCAGATC 780 

TGTGTCAATT ACAAGGGCTA TTTTAAGTGT GAGTGCTACC CTGGCTGCGA GATGGACCTA 840 

CTGACCAAGA ACTGCAAGGC TGCTGCTGGC AAGAGCCCAT CCCTAATCTT CACCAACCGC 900 

ACGAGTGCGG AGGATCGACC TGTGAAGCGG AACTATTCAC GCCTCATCCC CATGCTCAAG 960 

25 AATGTCGTGG CACTAGATGT GGAAGTTGCC ACCAATCGCA TCTACTGGTG TGACCTCTCC 1020 

TACCGTAAGA TCTATAGCGC CTACATGGAC AAGGCCAGTG ACCCGAAAGA GCGGGAGGTC 1080 

CTCATTGACG AGCAGTTGCA CTCTCCAGAG GGCCTGGCAG TGGACTGGGT CCACAAGCAC 1140 

ATCTACTGGA CTGACTCGGG CAATAAGACC ATCTCAGTGG CCACAGTTGA TGGTGGCCGC 1200 

CGACGCACTC TCTTCAGCCG TAACCTCAGT GAACCCCGGG CCATCGCTGT TGACCCCCTG 1260 

30 CGAGGGTTCA TGTATTGGTC TGACTGGGGG GACCAGGCCA AGATTGAGAA ATCTGGGCTC 1320 

AACGGTGTGG ACCGGCAAAC ACTGGTGTCA GACAATATTG AATGGCCCAA CGGAATCACC 1380 

CTGGATCTGC TGAGCCAGCG CTTGTACTGG GTAGACTCCA AGCTACACCA ACTGTCCAGC 1440 

ATTGACTTCA GTGGAGGCAA CAGAAAGACG CTGATCTCCT CCACTGACTT CCTGAGCCAC 1500 

CCTTTTGGGA TAGCTGTGTT TGAGGACAAG GTGTTCTGGA CAGACCTGGA GAACGAGGCC 1560 

35 ATTTTCAGTG CAAATCGGCT CAATGGCCTG GAAATCTCCA TCCTGGCTGA GAACCTCAAC 1620 

AACCCACATG ACATTGTCAT CTTCCATGAG CTGAAGCAGC CAAGAGCTCC AGATGCCTGT 1680 

GAGCTGAGTG TCCAGCCTAA TGGAGGCTGT GAATACCTGT GCCTTCCTGC TCCTCAGATC 1740 

TCCAGCCACT CTCCCAAGTA CACATGTGCC TGTCCTGACA CAATGTGGCT GGGTCCAGAC 1800 

ATGAAGAGGT GCTACCGAGA TGCAAATGAA GACAGTAAGA TGGGCTCAAC AGTCACTGCC I860 

40 GCTGTTATCG GGATCATCGT GCCCATAGTG GTGATAGCCC TCCTGTGCAT GAGTGGATAC 1920 

CTGATCTGGA GAAACTGGAA GCGGAAGAAC ACCAAAAGCA TGAATTTTGA CAACCCAGTC 1980 

TACAGGAAAA CAACAGAAGA AGAAGATGAA GATGAGCTCC ATATAGGGAG AACTGCTCAG 2040 

ATTGGCCATG TCTATCCTGC ACGAGTGGCA TTAAGCCTTG AAGATGATGG ACTACCCTGA 2100 

GGATGGGATC ACCCCCTTCG TGCCTCATGG AATTCAGTCC CATGCACTAC ACTCCGGATG 2160 

45 GTGTATGACT GGATGAATGG GTTTCTATAT ATGGGTCTGT GTGAGTGTAT GTGTGTGTGT 2220 

GATTTTTTTT TTTAAATTTA TGTTGCGGAA AGGTAACCAC AAAGTTATGA TGAACTGCAA 2280 

ACATCCAAAG GATGTGAGAG TTTTTCTATG TATAATGTTT TATACACTTT TTAACTGGTT 2340 

GCACTACCCA TGAGGAATTC GTGGAATGGC TACTGCTGAC TAACATGATG CACATAACCA 2400 

AATGGGGGCC AATGGCACAG TACCTTACTC ATCATTTAAA AACTATATTT ACAGAAGATG 2460 

50 TTTGGTTGCT GGGGGGCTTT TTTAGGTTTT GGGCATTTGT TTTTTGTAAA TAAGATGATT 2520 
ATGCTTTGTG GCTATCCATC AACATAAGT 

Seq ID NO: 357 Protein sequence 
Protein Accession #* NP_059992 
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1 11 21 31 41 51 

I I I I I 1 

MGLPEPGPIiR r»T.&T.T.T.T,T,r.T, LLIiLHIiOHlA AAAADPLLGG OGPAKECEKD QFQCRNERCI 
PSVWRCDEDD DCLDHSDEDD CPKKTCADSD FTCDNGHCIH ERWKCDGEEE CPDG9DESEA 

60 TCTKQVCPAE KLSCGPTSHK CVPASWRCDG EKDCEGGADE AGCATSLGTC RGDEFQCGDG 

TCVLAIKHCN QEQDCPDGSD EAGCLQGLNE CLHNNGGCSH ICTDLKIGFE CTCPAGFQLL 
DQKTCGDIDE CKDPDACSQI CVNYKGYFKC ECYPGCEMDL LTKNCKAAAG KSPSLIFTNR 
TSABDRPVKR NYSRLIPMLK NWALDVEVA TNRIYWCDLS YRKIYSAYMD KASDPKEREV 
LIDEQLHSPE GLAVDWVHKH IYWTDSGNKT ISVATVDGGR RRTLFSRNLS EPRAIAVDPL* 

65 RGFMYWSDWG DQAKIEKSGL NGVDRQTLVS DNIEWPNGIT LDLLSQRLYW VDSKLHQLSS 

IDFSGGNRKT LISSTDFLSH PFGIAVFEDK VFWTDLENEA I FSANRLNGIt EISILAENIiN 
NPHDIVIFHE LKQPRAPDAC ELSVQPNGGC EYLCLPAPQI SSHSPKYTCA CPDTMWLGPD 
MKRCYRDANE DSKMGSTVTA AVIGIIVPIV VIALLCMSGY LIWRNWKRKN TKSMNFDNPV 
YRKTTEEEDE DELHI GRTAQ IGHVYPARVA LSLEDDGLP 

70 

Seq ID NO: 358 DNA sequence 
Nucleic Acid Accession #: M27826 
Coding sequence: <l-503 

75 1 11 21 31 41 51 

I I I I I I 

AGCCCAAGAA ACATCTCACC AATTTCAAAT CTGATCTATT CGGCTTAGCG ACTGAAGATT 
GACGCTGCCC GATCGCCTCG GAAGTCCCCT GGACCATCAC AGAAGCCGAG CTTCGGGTAA 
CTCTCACAGT GGAGGGTAAG TCCATCCCCT GTTTAATCGA TACGGGGGCT ACCCACTCCA 

80 CGTTGCCTTC TTTTCAAGGG CCTGTTTCCC TTGCCCCCAT AACTGTTGTG GGTATTGACG 

GCCAAGCTTC AAAACCCCTG AAAACTCCCC CACTCTGGTG CCAACTTGGA CAACACTCTT 
TTATGCACTC TTTTTTAGTT ATCCCCACCT GCCCACTTCC CTTATTAGGC CGAAATATTT 
TAACCAAATT ATCTGCTTCC CTGACTATTC CTGGAGTACA GCTACATCTC ATTGCTGCCC 
TTCTTCCCAA TCCAAAGCCT CCTTTGTGTC CTCTAACATC CCCACAATAT CAGCCCTTAC 

85 CACAAGACCT CCCTTCAGCT TAATCTCTCC CACTCTAGGT TCCCACGCCG CCCCTAATCC 

CACTTGAAGC AGCCCTGAGA AACATCGCCC ATTCTCTCTC CATACCACCC CCCAAAAATT 
TTCGCCGCTC CAACACTTCA ACACTATTTT GTTTTATTTG TCTTATTAAT ATCAGAAGGC 



320 



WO 02/086443 

AQGAATGTCA GGCCTCTGAG CCCAGGCCAG GCCATCGCAT CCCCTGTGAC TTGCACGTAT 720 

ACATCCAGAT GGCCTGAAGT AACTGAAGAT CCACAAAAGA AGTAAAAACA GCCTTAACTG 780 

ATGACATTCC ACCATTGTGA TTTGTTCCTG CCCCACCCTA ACTGATCAAT GTACTTTGTA 840 

ATCTCCCCCA CCCTTAAGAA GGTTCTTTGT AATTCTCCCC ACCCTTGAGA ATGTACTTTG 900 

TGAGATCCAC CCCTGCCCAC CAGAGAACAA CCCCCTTTGA TTGTAATTTT TTATTACCTT 960 

CCCAAATCCT ATAAAACAGC CCCACCCCTA TCTTCCTTCA CTGACTCTCT TTTCGGACTC 1020 
AGCCACOGGC ACCCAGGTGA AATAAACAGC TTTATTGCTC AC 

Seq ID NO: 3 59 Protein sequence 
Protein Accession 8: AAA65999 

1 11 21 31 41 51 

I I I I I I 

PKKHLTNFKS DLFGIATEDW RCPIASEVPW TITEAEfcRVT LTVEGKSIPC LIDTGATHST 60 

LPSFQGPVSL APITWGIDG QASKPLKTPP LWCQLGQHSF MHSFLVIPTC PLPIiLGRNIL 120 
TKLSASLTIP GVQLHLIAAL LPNPKPPLCP LTSPQYQPLP QDLPSA 

Seq ID NO: 360 DNA sequence 
Nucleic Acid Accession #i NM_0018S4 
Coding sequence: 162-5582 

1 11 21 31 41 51 

I I I I I I 

AACCATCAAA TTTAGAAGAA AAAGCCCTTT GACTTTTTCC CCCTCTCCCT\ CCCCAATGGC 60 

TGTGT AG CAA A CAT CCCTGG CGATACCTTG GAAAGGACGA AGTTGGT CTG CAGTCGCAAT 120 

TTCGTGGGTT GAGTTCACAG TTGTGAGTGC GGGGCTCGGA GATGGAGCCG TGGTCCTCTA 180 

GGTGGAAAAC GAAACGGTGG CTCTGGGATT TCACCGTAAC AACCCTCGCA TTGACCTTCC 240 

TCTTCCAAGC TAGAGAGGTC AGAGGAGCTG CTCCAGTTGA TGTACTAAAA GCACTAGATT 300 

TTCACAATTC TCCAGAGGGA ATATCAAAAA CAACGGGATT TTGCACAAAC AGAAAGAATT 360 

CTAAAGGCTC AGATACTGCT TACAGAGTTT CAAAGCAAGC ACAACTCAGT GCCCCAACAA 420 

AACAGTTATT TCCAGGTGGA ACTTTCCCAG AAGACTTTTC AATACTATTT ACAGTAAAAC 480 

CAAAAAAAGG AATTCAGTCT TTCCTTTTAT CTATATATAA TGAGCATGGT ATTCAGCAAA 540 

TTGGTGTTGA GGTTGGGAGA TCACCTGTTT TTCTGTTTGA AGACCACACT GGAAAACCTG 600 

CCCCAGAAGA CTATCCCCTC TTCAGAACTG TTAACATCGC TGACGGGAAG TGGCATCGGG 660 

TAGCAATCAG CGTGGAGAAG AAAACTGTGA CAATGATTGT TGATTGTAAG AAGAAAACCA 720 

CGAAACCACT TGATAGAAGT GAGAGAGCAA TTGTTGATAC CAATGGAATC ACGGTTTTTG 780 

GAACAAGGAT TTTGGATGAA GAAGTTTTTG AGGGGGACAT TCAGCAGTTT TTGATCACAG 840 

GTGATCCCAA GGCAGCATAT GACTACTGTG AG CATTAT AG TCCAGACTGT GACTCTTCAG 900 

CACCCAAGGC TGCTCAAGCT CAGGAACCTC AGATAGATGA GTATGCACCA GAGGATATAA 960 

TCGAATATGA CTATGAGTAT GGGGAAGCAG AGTATAAAGA GGCTGAAAGT GTAACAGAGG 1020 

GACCCACTGT AACTGAGGAG ACAATAGCAC AGACGGAGGC AAACATCGTT GATGATTTTC 1080 

AAGAATACAA CTATGGAACA ATGGAAAGTT ACCAGACAGA AGCTCCTAGG CATGTTTCTG 1140 

GGACAAATGA GCCAAATCCA GTTGAAGAAA TATTTACTGA AGAATATCTA ACGGGAGAGG 1200 

ATTATGATTC CCAGAGGAAA AATTCTGAGG ATACACTATA TGAAAACAAA GAAATAGACG 1260 

GCAGGGATTC TGATCTTCTG GTAGATGGAG ATTTAGGCGA ATATGATTTT TATGAATATA 1320 

AAGAATATGA AGATAAACCA ACAAGCCCCC CTAATGAAGA ATTTGGTCCA GGTGTACCAG 1380 

CAGAAACTGA TATTACAGAA ACAAGCATAA ATGGCCATGG TGCATATGGA GAGAAAGGAC 1440 

AGAAAGGAGA ACCAG CAGTG GTTGAGCCTG GTATGCTTGT CGAAGGACCA CCAGGACCAG 1500 

CAGGACCTGC AGGTATTATG GGTCCTCCAG GTCTACAAGG CCCCACTGGA CCCCCTGGTG 1560 

ACCCTGGCGA TAGGGGCCCC CCAGGACGTC CTGGCTTACC AGGGGCTGAT GGTCTACCTG 1620 

GTCCTCCTGG TACTATGTTG ATGTTACCGT TCCGTTATGG TGGTGATGGT TCCAAAGGAC 1680 

CAACCATCTC TGCTCAGGAA GCTCAGGCTC AAGCTATTCT TCAGCAGGCT CGGATTGCTC 1740 

TGAGAGGCCC ACCTGGCCCA ATGGGTCTAA CTGGAAGACC AGGTCCTGTG GGGGGGCCTG 1800 

GTTCATCTGG GGCCAAAGGT GAGAGTGGTG ATCCAGGTCC TCAGGGCCCT CGAGGCGTCC 1860 

AGGGTCCCCC TGGT CCAACG GGAAAACCTG GAAAAAGGGG TCGTCCAGGT GCAGATGGAG 1920 

GAAGAGGAAT GCCAGGAGAA CCTGGGGCAA AGGGAGATCG AGGGTTTGAT GGACTTCCGG 1980 

GTCTGCCAGG TGACAAAGGT CACAGGGGTG AACGAGGTCC TCAAGGTCCT CCAGGTCCTC 2040 

CTGGTGATGA TGGAATGAGG GGAGAAGATG GAGAAATTGG ACCAAGAGGT CTTCCAGGTG 2100 

AAGCTGGCCC ACGAGGTTTG CTGGGTCCAA GGGGAACTCC AGGAGCTCCA GGGCAGCCTG 2160 

GTATGGCAGG TGTAGATGGC CCCCCAGGAC CAAAAGGGAA CATGGGTCCC CAAGGGGAGC 2220 

CTGGGCCTCC AGGTCAACAA GGGAATCCAG GACCTCAGGG TCTTCCTGGT CCACAAGGTC 2280 

CAATTGGTCC TCCTGGTGAA AAAGGACCAC AAGGAAAACC AGGACTTGCT GGACTTCCTG 2340 

GTGCTGATGG GCCTCCTGGT CATCCTGGGA AAGAAGGCCA GTCTGGAGAA AAGGGGGCTC 2400 

TGGGTCCCCC TGGT CCA CAA GGTCCTATTG GATNNCCGGG CCCCCGGGGA GTAAAGGGAG 2460 

CAGATGGTGT CAGAGGTCTC AAGGGATCTA AAGGTGAAAA GGGTGAAGAT GGTTTTCCAG 2520 

GATTCAAAGG TGACATGGGT CTAAAAGGTG ACAGAGGAGA AGTTGGT CAA ATTGGCCCAA 2580 

GAGGGNAAGA TGGCCCTGAA GGACCCAAAG GTCGAGCAGG CCCAACTGGA GACCCAGGTC 2640 

CTTCAGGTCA AG CAG G AGAA AAGGGAAAAC TTGGAGTTCC AGGATTACCA GGATATCCAG 2700 

GAAGACAAGG TCCAAAGGGT TCCACTGGAT TCCCTGGGTT TCCAGGTGCC AATGGAGAGA 2760 

AAGGTGCACG GGGAGTAGCT GGCAAACCAG GCCCTCGGGG T CAGCGTGGT CCAACGGGTC 2820 

CTCGAGGTTC AAGAGGTGCA AGAGGTCCCA CTGGGAAACC TGGGCCAAAG GGCACTTCAG 2880 

GTGGCGATGG CCCTCCTGGC CCTCCAGGTG AAAGAGGTCC TCAAGGACCT CAGGGTCCAG 2940 

TTGGATTCCC TGGACCAAAA GGCCCTCCTG GACCACCAGG AAGGATGGGC TGCCCAGGAC 3000 

ACCCTGGGCA ACGTGGGGAG ACTGGATTTC AAGGCAAGAC CGGCCCTCCT GGGCCAGGGG 3060 

GAGTGGTTGG ACCACAGGGA CCAACOGGTG AGACTGGTCC AATAGGGGAA CGTGGGTATC 3120 

CTGGTCCTCC TGGCCCTCCT GGTGAGCAAG GTCTTCCTGG TGCTGCAGGA AAAGAAGGTG 3180 

CAAAGGGTGA TCCAGGTCCT CAAGGTATCT CAGGGAAAGA TGGACCAGCA GGATTACGTG 3240 

GTTTCCCAGG GGAAAGAGGT CTTCCTGGAG CTCAGGGTGC ACCTGGACTG AAAGGAGGGG 3300 

AAGGTCCCCA GGGCCCACCA GGTCCAGTTG GCTCACCAGG AGAACGTGGG TCAGCAGGTA 3360 

CAGCTGGCCC AATTGGTTTA CGAGGGCGCC CGGGACCTCA GGGTCCTCCT GGTCCAGCTG 3420 

GAGAGAAAGG TGCTCCTGGA GAAAAAGGTC CCCAAGGGCC TGCAGGGAGA GATGGAGTTC 3480 

AAGGTCCTGT TGGT CTCCCA GGGCCAGCTG GTCCTGCCGG CTCCCCTGGG GAAGACGGAG 3540 

ACAAGGGTGA AATTGGTGAG CCGGGACAAA AAGGCAGCAA GGGTGGCAAG GGAGAAAATG 3600 

GCCCTCCCGG TCCCCCAGGT CTTCAAGGAC CAGTTGGTGC CCCTGGAATT GCTGGAGGTG 3660 

ATGGTGAACC AGGTCCTAGA GGACAGCAGG GGATGTTTGG GCAAAAAGGT GATGAGGGTG 3720 

CCAGAGGCTT CCCTGGACCT CCTGGTCCAA TAGGTCTTCA GGGTCTGCCA GGCCCACCTG 3780 

GTGAAAAAGG TGAAAATGGG GATGTTGGTC CATGGGGGCC ACCTGGTCCT CCAGGCCCAA 3840 
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GAGGCCCTCA 
CAGTTGGTGG 
GGGAAGCAGG 
CTGGAGCTGC 
ACCCGGGTCC 
GTCAAGATGG 
CTGGCCCATC 
CTGCAGGTGC 
GTCCTCCTGG 
AAGGTCTTCG 
AAGATGGACC 
GCTCCAAGGG 
AAGGGGAAAA 
ATGGGGGAAT 
GTCCTCAAGG 
GTGGTCTTCC 
CAATCTTGTC 
ATAATATTCT 
AACAAGACAT 
GTAAAGACCT 
ACCAAGGTTG 
CTTGCATTTA 
AACCAGGAAG 
AAGGAAATTC 
GGCAAAATTT 
GTTATGACAA 
ATCCTTTTAT 
TCATTGAAAT 
ACTTTGGTGA 
AAGATTAAGA 
TTTTGTGCCA 
TACCATTTAG 
ATCATAAAGA 
TTCTCAACTC 
AATATATATT 
TGTGTTTAAT 
CCAAAACTTG 
GATGGCAATA 
TTTCTTTGGT 



AGGTCCCAAT 
TGTTGGAGAA 
TGTAGGCGGT 
TGGACCTCCA 
TGTTGGTTTT 
TGTTGGTGGT 
TGGTGAGGCT 
AGAGGGAAGA 
AAAAACCGGC 
GGGCATCCCT 
ACCTGGTCCT 
TGAAAAGGGA 
AGGTGACCGA 
TCCTGGTCCT 
CCCAAAGGGT 
AGGGCCT CCT 
CTCCAAAAAA 
TGATTACTCG 
CGAGCATATG 
GCAACTCAGC 
CTCAGGAGAT 
TCCAGACAAA 
TTGGTTTAGT 
CATCAATATG 
CACCTACCAC 
AGCACTTCGC 
CAAAACACTG 
CAATACACCA 
TCAGAATCAG 
CAAAGAACAT 
CATGCAAGTT 
GAAATACCGA 
TATAAGTTGG 
TCCTTTTCCT 
CATAAAAAAT 
AAATTGTAAT 
CACGTGTCCC 
ATATATGTAT 
TAATGATGAA 



GGAGCTGATG 
AAGGGTGAAC 
CCCAAAGGAG 
GGTGCCAAGG 
CCTGGAGATC 
GACAAGGGTG 
GGCCCACCAG 
CAAGGTGAAA 
CCAGTCGGTC 
GGTCCTGTGG 
ATGGGACCTC 
CATCCTGGTT 
GGGCTCCCTG 
GCTGGTCCCT 
AACAAAGGCT 
GGGCCTCCAG 
ACGAGAAGAC 
GATGGAATGG 
AAATTTCCAA 
CATCCTGACT 
TCCTTCAAAG 
AAATCTGAGG 
GAATTTAAGA 
GTGCAAATGA 
TGTCATCAGT 
TTCCTGGGAT 
TATGATGGTT 
AAAATTGAtC 
AAGTTCGGAT 
ATCAAATCAA 
TTGAATAAGG 
TGCCTTTGTG 
TGTGGCTAAG 
ATTTGAATTT 
ATGGTGCTCA 
TATTTTGTGT 
TGAATTCCGC 
TATGAAAATG 
ATTCCTTTGT 



GACCACAAGG 
CTGGAGAAGC 
AAAGAGGAGA 
GGCCGCCAGG 
CTGGTCCTCC 
AAGATGGAGA 
GTCCTCCTGG 
AAGGTGCTAA 
CTCAGGGACC 
GAGAACAAGG 
CTGGCTTACC 
TAATTGGCCT 
GAACTCAAGG 
TAGGTCCACC 
CTACTGGACC 
GTCCACCTGG 
ATACTGAAGG 
AAGAAATATT 
TGGGTACTCA 
TCCCAGATGG 
TTTACTGTAA 
GAGTAAGAAT 
GGGGAAAACT 
CATTCCTGAA 
CAGCAGCCTG 
CAAATGATGA 
GTACGTCCAG 
AAGTACCTAT 
TTGAAGTTGG 
CAGAAAATGT 
ATGTATGGAA 
GGGGCAGAAT 
ATGGAAACAG 
CTTTGGTGCT 
TTCTCATCCA 
ACAGTTCTAT 
TGACTCTAAT 
AAGTTATGAT 
GTGTGTTT 



Seq ID NO: 361 Protein sequence 
Protein Accession #: NP_001845 



MEPWSSRWKT 
CTNRKNSKGS 
EHGIQQIGVE 
DCKKKTTKPL 
PDCDSSAPKA 
NIVDDFQEYN 
ENKEIDGRDS 
AYGEKGQKGE 
GADGLPGPPG 
GPVGGPGSSG 
GFDGLPGLPG 
GAPGQPGMAG 
GLAGLPGADG 
GEDGPPGFKG 
GLPGYFGRQG 
GPKGTSGGDG 
GPPGPGGWG 
GPAGLRGFPG 
GPPGPAGEKG 



GLPGPPGEKG 
GNPGPPGEAG 
GELGPAGQDG 
GBAGAEGPPG 
GLKGDPGSKG 
GPPGLPGPQG 
MQ AD AD DN I L 
EYWIDPNQGC 
LSYLDVEGNS 
EMSYDNNPFI 
PVCFLG 



11 
I 

KRWLWDFTVT 
DTAYRVSKQA 
VGRSPVFLPE 
DRSERAIVDT 
AQAQEPQIDE 
YGTMESYQTE 
DLLVDGDLGE 
PAWEPGMLV 
TMLMLPFRYG 
AKGESGDPGP 
DKGHRGERGP 
VDGPPGPKGN 
PPGHPGKEGQ 
DMGLKGDRGE 
PKGSTGFPGF 
PPGPPGERGP 
PQGPTGBTGP 
ERGLPGAQGA 
APGEKGPQGP 
PPGLQGPVGA 
ENGDVGPWGP 
VGGPKGERGE 
VGGDKGEDGD 
KTGPVGPQGP 
EKGHPGLIGIi 
PKGNKGSTGP 
DYSDGMEEIF 
SGDSFKVYCN 
INMVQMTFLK 
KTLYDGCTSR 



21 
I 

TLALTFLFQA 
QLSAPTKQLF 
DHTGKPAPED 
NGITVFGTRI 
YAPEDIIEYD 
APRHVSGTNE 
YDFYEYKEYE 
EGPPGPAGPA 
GDGSKGPTXS 
QGPRGVQGPP 
QGPPGPPGDD 
MGPOGEPGPP 
SGEKGALGP P 
VGQIGPRGXD 
PGANGEKGAR 
QGPQGPVGFP 
IGERGYPGPP 
PGLKGGEGPQ 
AGRDGVQGPV 
PGIAGGDGEP 
PGPPGPRGPQ 
KGEAGPPGAA 
PGQPGPPGPS 
AGKPGPEGLR 
ZGPPGEQGEK 
AGQKGDSGLP 
GSLNSLKQDI 
FTSGGETCIY 
LLTASARQNF 
KGYEKTVIEI 



31 
I 

REVRGAAPVD 
PGGTFPEDF3 
YPLFRTVNIA 
LDEEVFEGDI 
YEYGEAEYKE 
PNPVEEIFTE 
DKPTSPPNEE 
GIMGPPGLQG 
AQEAQAQAIL 



GMRGEDGEIG 
GQQGNPGPQG 
GPQGPIGXPG 
GPEGPKGRAG 
GVAGKPGPRG 
GPKGPPGPPG 
GPPGEQGLPG 
GPPGPVGSPG 
GL PGP AGP AG 
GPRGQQGMFG 
GPNGADGPQG 
GPPGAKGPPG 
GEAGPPGPPG 
GIPGPVGEQG 
GDRGLPGTQG 

GPPGPPGPPG 
EHMKFPMGTQ 
PDKKSEGVRI 
TYHCHQSAAW 
NTPKIDQVPI 



ACCCCCAGGT 
AGGAAACCCA 
GAAAGGGGAA 
TGATGATGGC 
TGGGGAACTT 
TCCTGGTCAA 
AAAACGAGGT 
GGGGGAAGCA 
TGCAGGAAAG 
TCTCCCTGGA 
TGGTCTCAAA 
GATTGGTCCT 
ATCTCCAGGA 
TGGTCCTCCA 
CGCTGGCCAG 
TGAAGTCATT 
CATGCAAGCA 

GACCAATCCA 
TGAATATTGG 
TTTCACATCT 
TTCATCATGG 
GCTTTCATAC 
ACTTCTGACT 
GTATGATGTG 
GGAGATGTCC 
AAAAGGCTAT 
TGTTGATGTC 
TCCTGTTTGT 
ACCTTGGTGC 
AACAACGCTG 
CACAGACAAA 
GGCTGATTCT 
GTAGAAAACA 
TCCAGGATGT 
ACTGTTATCT 
TTATGAGGAT 
TTCCGATGAC 



41 

I 

VLKALDFHNS 
ILFTVKPKKG 
DGKWHRVA I S 
QQFLITGDPK 
AESVTEGPTV 
EYLTGEDYDS 
FGPGVPAETD 
PTGPPGDPGD 
QQARIALRGP 
RPGADGGRGM 
PRGLPGEAGP 
IjPGPOGPXGP 
PRGVKGADGV 
PTGDPGPSGQ 
QRGPTGPRGS 
RMGCPGHPGQ 
AAGKEGAKGD 
ERGSAGTAGP 
SPGEDGDKGE 
QKGDEGARGF 
PPGSVGSVGG 
DDGPKGNPGP 
KRGP PGAAGA 
LPGAAGQDGP 
SPGAKGDGGI 
EVIQPLPILS 
TNPARTCKDb 
SSWPKEKPGS 
YDVSSGSYDK 
VDVMISDFGD 



TCTGTTGGTT 
GGGCCTCCTG 
GCTGGTCCAC 
CCTAAGGGTA 
GGCCCTGCAG 
CCGGGTCCTC 
CCTCCTGGAG 
GGTGCAGAAG 
CCTGGTCCAG 
GCTGCAGGCC 
GGTGACCCTG 
CCAGGAGAAC 
GCAAAAGGGG 
GGCTTACCAG 
AAAGGTGACA 
CAGCCTTTAC 
GATGCAGATG 
AATTCCCTGA 
GCCCGAACTT 
ATTGATCCTA 
GGTGGTGAGA 
CCAAAGGAGA 
TTAGATGTTG 
GCCTCTGCTC 
TCATCAGGAA 
TATGACAATA 
GAAAAAACTG 
ATGATCAGTG 
TTTCTTGGCT 
CACCAACCCA 
CATATACAGG 
AGCTTTGAAA 
TGATTCCCAA 
AAAAAAGAAA 
ACTAAAACAG 
GTGTCCATTT 
GCCGAACTCT 
CCTAAGTCCC 



51 
I 

PEGISKTTGF 
IQSFLLSIYN 
VEKKTVTMIV 
AAYDYCEHYS 
TEETIAQTEA 
QRKNSEDTLY 
ITETSINGHG 
RGPPGRPGLP 
PGPMGLTGRP 
PGEPGAKGDR 
RGIiLGPRGTP 
PGEKGPQGKP 
RGLKGSKGEK 
AGEKGKLGVP 
RGARGPTGKP 
RGETGFQGKT 
PGPQGISGKD 
IGLRGRPGPQ 
IGEPGQKGSK 
PGPPGPIGLQ 
VGEKGEPGEA 
VGFPGDPGPP 
EGRQGEKGAK 
PGPMGPPGLP 
PGPAGPLGPP 
SKKTRRHTEG 
QLSHPDFPDG 
WFSEFKRGKL 
ALRFLGSNDE 
QNQKFGFEVG 



3900 
3960 
4020 
40B0 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4660 
4740 
4800 
4860 
4920 
4960 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 



80 



85 



Seq ID NO: 362 DNA sequence 
Nucleic Acid Accession NM_003107 
Coding sequence: 351-1775 



1 11 21 31 41 51 

] | i I I 1 

TTCCCCAGCA TTCGAGAAAC TCCTCTCTAC TTTAGCACGG TCTCCAGACT CAGCCGAGAG 
ACAGCAAACT GCAGCGCGGT GAGAGAGCGA GAGAGAGGGA GAGAGAGACT CTCCAGCCTG 
GGAACTATAA CTCCTCTGCG AGAGGCGGAG AACTCCTTCC CCAAATCTTT TGGGGACTTT 



60 
120 
180 



322 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

TCTCTCTTTA CCCACCTCCG CCCCTGCGAG GAGTTGAGGG 
CGTCTTCCCG TTCGGCGTGT GCTTGGCCCG GGGAACCGGG 
CGGCCGCCGC GAGGGTGTGA GOGCGCGTGG GCGCCCGCCG 
AAACCAACAA TGCCGAGAAC AOGGAAGCGC TGCTGGCCGG 
CCGGCCTCGA GCTGGGAATC GCCTCCTCCC CCACGCCCGG 
GCAAGGCCGA CGACCCGAGC TGGTGCAAGA CCCCGAGTGG 
ACGCCTTCAT GGTGTGGTCG CAGATCGAGC GGCGCAAGAT 
TGCACAACGC CGAGATCTCC AAGCGGCTGG GCAAACGCTG 
ACAAGATCCC TTTCATTCX3A GAGGCGGAGC GGCTGCGCCT 
CCGACTACAA GTACCGGCCC AGGAAGAAGG TGAAGTCCGG 
CGGCCGCCGC CTCCTCCAAG CCGGGGGAGA AGGGAGACAA 
GCGGCCATGG GGGCGGCGGC GGCGGCGGGA GCAGCAACGC 
CGAGTGGCGG CGGCGCCAAC TCCAAACCGG CGCAGAAAAA 
CGGGCGGCGC GGGCGGTGGG GTTAGCAAAC CGCACGCCAA 
GCGGCGGCGG GAAAGCAGCG GCTGCCGCCG CCGCCTCCTT 
CCGCCGCCCT GCTGCCCCTG GGCGCCGCCG CCGACCACCA 
CTCCCAGCGC CTCGGCCTCC GCCTCCTCGG CAGCCTCGGC 
CGGGCAAGCA CCTGGCGGAG AAGAAGGTGA AGCGCGTCTA 
CGTCGTCGTC GCCCGTGGGC GGCGTGGGCG CGGGAGCCGA 
TGTACGAGGA GGAGGGCGCG GGCTGCTCGC CCGACGCGCC 
GCGCCGCCTC GTCCCCCGCC GCCGGCCGCT CGCCCGCCGA 
TGCGCGCCGC CTCGCCCGCC CCGTCCAGCG CGCCCTCGCA 
CCCACTCCTC CTCTTCCTCC TCCTCGGGCT CCTCGTCCTC 
ACCTGCTCGA CCTGAACCCC AGCTCAAACT TTGAGAGCAT 
CGTCGTCGGC GCTCGACCGG GACCTGGATT TTAACTTCGA 
TCGAGTTCCC GGACTACTGC ACGCCCGAGG TGAGCGAGAT 
AGTCCAGCAT CTCCAACCTG GTTTTCACCT ACTGAAGGGC 
CGGGGGGGGT AGGAGAGGAG" AAAAAAAAAG TGAAAAAAAG 
GAGTTTAAAG AGAAAAGGGA AAAAAGAAAG AAAAAGTAAG 
TCTCGTCGTC GGATCAAGGA GCGCGGCGGC GTTTTGGACC 
TCCCGGGCCG GGGACCCACT CTGCCCAGCC GGAGGGACGC 
AGGGGCGACC TGTGATTGTT GTTATTGATG TTGTTGTTGA 
rrCGAGTTTG CTCCCCTTTG CTTGAAGAGA CCCCCTCCCC 
TTGTCTGCAC CCCCAGCAAG AAGGCGAGTT AGTTTTCTAG 
CTTCCTGCAT CACCACCTTG GTTTTGTTTT ATTTTGCTTC 
GAACCCAGCG CACCCCTCCC CCCCTTTTTT TAAACGCGTG 
GGTGACGAAT TTGGCCGATG GCAGATGTTT TGGGGGAACG 
GCAGGCGAAT TCCCGTTTGG GGCCTTTTTT TCCTCCCTCT 
AGCCGGAGGA GGAGATGTTG AGGGGAGGAG GCCAGCCAGT 
ACCCGAGAAC CCCGTTGGAA GCGCAGCAGC GGGAGCTAGG 
CGAACTGGAA GGGGGTTCAC GGTCAAACTG AAATGGATTT 
CGGCGGCTGC TGGGCCTCCG CCTTCTTTTC TACGTGAAAT 
GACCCCGGAG GCGTGGAGGA GAGGAGACTG TTTGATGTGG 
GGCGAGTGGT TTCGGAAAAA AAAAAAGAAA AAAAGGG 

3eq ID NO: 363 Protein sequence 
Protein Accession ft: NP 003098 



PCT7US02/12476 



i 
I 

MVQQTNNAEN 
RPMNAFMVWS 
ADYPDYKYRP 
GGGASGGGAN 
QAGAAALLPL 
GLGTSSSPVG 
YASLRAASPA 
SFSSSSALDR 



11 
I 

TEALLAGESS 
QIERRKIMEQ 
RKKVKSGNAN 
SKPAQKKSCG 
GAAADHHSLY 
GVGAGADPSD 
PSSAPSHASS 
DLDFNPEPGS 



21 
I • 

DSGAGLELGI 
SPDMHNAEIS 
SSSSAAASSK 
SKVAGGAGGG 
KARTPSASAS 
PLGLYEEEGA 



31 
I 

ASSPTPGSTA 
KRLGKRWKLL 
PGEKGDKVGG 
VSKPHAKLIti 
ASSAASASAA 
GCSPDAPSLS 



GSHPEFPDYC TPEVSEMISG 



Seq ID NO: 364 DNA sequence 
Nucleic Acid Accession #: U10860 
Coding sequence: 123-2204 



1 
I 

TGCCGGCTGC 
GGCACCCTCC 
CGATGGCTCT 
GCCACCACCA 
TCATAGACCG 
CAGCATTTGC 
CTGTGTATGC 
TTCTTGGAAT 
AAAAAAGTGT 
TCAGGGGCCT 
TAGCTGATGG 
AATCTAAAAA 
AAGTAATACT 
AGAACAGAGA 
TGGTTTTACT 
TGAACCAAGA 
GCCAGTCTGT 
CTCATTCTTT 
GGAAAAGAAT 
TTGGGGATAC 
AGGAGGTTTT 



11 
I 

TCCTCGACCA 
CGCCCCGTCT 
GTGCAACGGA 
CTATGAAGGA 
AAGAGTGAGG 
TATAAAGGAA 
TGAAGATGCT 
TTGCTATGGT 
CAGAGAAGAT 
TCAGAAGGAA 
ATTCAAGGTT 
GTTATATGGA 
GAAGAATTTC 
ACTTGAGTGT 
CAGTGGTGGA 
ACAAGTCATT 
TGAAGAGGCC 
CTACAATGGA 
TAGCAAAACG 
TTTTGTTAAG 
CCTTGCCCAA 



21 

I 

GGCCTCCTTC 
CGTACTGTCG 
GACTCCAAGC 
GCTGTTGTCA 
GAACTGTTCG 
CAAGGATTCC 
CCCTGGTTTG 
ATGCAGATGA 
GGAGTTTTCA 
GAAGTTGTTT 
GTGGCACGTT 
GCACAGTTCC 
CTTTATGATA 
ATTCGAGAGA 
GTAGACTCAA 
GCTGTGCACA 
CTCAAAAAGC 
ACAACAACCC 
TTAAATATGA 
ATTGCCAATG 
GGTACTTTAC 



31 

I 

TCAACCTCAG 
CCGTCACCGC 
TGGAGAATGC 
TTCTGGATGC 
TGCAGTCTGA 
GTGCTATTAT 
ATCCAGCAAT 
TGAATAAGGT 
ACATTAGTGT 
TGCTTACACA 
CTGGAAACAT 
ACCCTGAAGT 
TAGCTGGATG 
TCAAAGAGAG 
CAGTTTGTAC 
TTGATAATGG 
TTGGAATTCA 
TACCAATATC 
CCACAAGTCC 
AAGTAATTGG 
GGCCTGATCT 



GCCAGTTCGG 
AGGGCCCGGC 
AG CCGAGGCC 
CGAGAGCTCG 
CTCCACCGCC 
GCACATCAAG 
CATGGAGCAG 
GAAGCTGCTC 
CAAGCACATG 
CAACGCCAAC 
GGTCGGTGGC 
GGGGGGAGGA 
GAGCTGCGGC 
GCTCATCCTG 
CGCCGCCGAA 
CTCGCTGTAC 
CTCCGCAGCG 
CCTGTTCGGC 
CCCCAGCGAC 
CAGCCTGAGC 
CCACCGCGGC 
CGCGTCCTCC 
CGACGACGAG 
GTCCCTGGGC 

GATCTCGGGA 
GCGCAGGCAG 
AAACGAAAAG 
CAGGGCTCGT 
CGCGCTCCCA 
GGAGGAGGAA 
TGGCAAAAAA 
CTTCCAACGA 
AGACTTGAAG 
TTGGTCAAGA 
ATGAAGACAG 
CCGGGACTGA 
TTTCCCCTTG 
GTGACCGGCG 
GGCGGGGGCG 
GCACGTTGGG 
CAGTGAGGTG 
TACAGGGGCA 



41 

I 

STGGKADDPS 
KDSDKIPFIR 
SGGGGHGGGG 
AGGGGGGKAA 
LAAPGKHLAE 
GRSSAASSPA 
FEDDLLDLNP 
DWLESSISNL 



41 

I 

CCCGCGGCGC 
CGCGGCTCCG 
TGGAGGAGAC 
TGGTGCTCAG 
AATTTTCCCC 
CATCTCTGGA 
ATTCACTATT 
ATTTGGAGGT 
GGATAATACA 
TGGAGATAGT 
AGTAGCAGGC 
TGGCCTTACA 
CAGTGGAACC 
AGTAGGCACG 
AGCTTTGCTA 
CTTTATGAGA 
GGTCAAAGTG 
AGATGAAGAT 
TGAAGAGAAA 
AGAAATGAAC 
AATTGAAAGT 



CCGCCGCGCG 
GATCGCGCGG 
ATGGTGCAGC 
GACTCGGGCG 
TCCACGGGCG 
CGACCCATGA 
TCGCCCGACA 
AAAGACAGCG 
GCTGACTACC 
TCCAGCTCCT 
AGTGGCGGGG 
GGCGGCGGTG 
TCCAAAGTGG 
GCAGGCGGCG 
CAGGCGGGGG 
AAGGCGCGGA 
CTCGCGGCCC 
GGCCTGGGCA 
CCCCTGGGCC 
GGCCGCAGCA 
TACGCCAGCC 
TCGGCCTCGT 
TTCGAAGACG 
AGCTTCAGTT 
GGCTCGCACT 
GACTGGCTCG 
GGAGAAGGGC 
GACAGACGAA 
TCGCCCGCGT 
TCCCCCACCT 
GAGGGTAGAC 
AAAAAGCGAC 
GCTTCCGGAC 
GAGTCTCCCC 

AAGGCTCCGG 
GAGACTCCAC 
CCCCCTCTGC 
CTAGGAAATG 
GAGGAGGACA 
GAGCTGGCGG 
AGACTTCCCA 
GTCAGTGGAG 



51 
I 

WCKTPSGHIK 
EAERLRLKHM 
GGGSSNAGGG 
AAAAASFAAE 
KKVKRVYLFG 
AGRSPADHRG 



VFTY 



51 
I 

CGACCCTTCC 
GCCCTGGCCC 
CTTAAGGATG 
TACGGGAAAG 
TTGGAAACAC 
GGACCTAATT 
GGCAAGCCTG 
ACTGTGCACA 
TGTTCATTAT 
GTAGACAAAG 
ATAGCAAATG 
GAAAATGGAA 
TTCACCGTGC 
TCAAAAGTTT 
AATCGTGCTT 
AAACGAGAAA 
ATAAATGCTG 
AGAACCCCAC 
AGAAAAATCA 
TTGAAACCAG 
GCATCCCTTG 



240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1660 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 



60 
120 
180 
240 
300 
360 
420 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 



323 



WO 02/086443 



PCT/US02/12476 



5 

10 
15 

20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



TTGCAAGTGQ 
AGTTGAGAGA 
GAATTTTGGG 
GTCCTGGCCT 
CTGAAACCAA 
ATACCCTATT 
AAATTACCAG 
AGGGTGACTG 
GGGAATCACT 
TTGTTTATAT 
TGACAACAGG 
GGGAGTCTGG 
TTGATCGGGA 
TTATTACTAG 
AGGTGGTATT 
ATGACTTAAC 



CAAAGCTGAA 
GGAGGGAAAA 
CAGAGAACTT 
GG CAATCAG A 
CAATATTTTG 
ACAGAGAGTC 
TCTGCATTCA 
TCGTTCCTAC 
TATTTTTCTG 
ATTTGGCCCA 
GGTGCTCAGT 
GTATGCTGGG 
CCCACTTCAA 
TGACTTCATG 
AAAGATGGTC 
ATCAAAGCCC 



CTCATCAAAA 
GTAATAGAAC 
GGACTTCCAG 
GTAATATGTG 
AAAATAGTAG 
AAAGCCTGCA 
CTGAATGCCT 
AGTTACGTGT 
GCTAGGCTTA 
CCAGTTAAAG 
■ACTTTACGCC 
AAAATCAGCC 
AAGCAGCCTT 
ACTGGTATAC 
ACTGAGATTA 
CCAGGAACTA 



CCCATCACAA 
CTCTGAAAGA 
AAGAGTTAGT 
CTGAAGAACC 
CTGATTTTTC 
CAACAGAAGA 
TCTTGCTGCC 
GTGGAATCTC 
TACCTCGCAT 
AACCTCCTAC 
AAGCTGATTT 
AGATGCCGGT 
CATGCCAGAG 
CTGCAACACC 
AGAAGATTCC 
CTGAGTGGGA 



Seq ID NO: 365 Protein sequence 
Protein Accession ft: AAA60331 



1 
I 

MALCNGDSKI* 
AFAIKEQGFR 
KSVREDGVFN 
SKKLYGAQFH 
VLLSGGVDST 
HSFYNGTTTL 
EVFLAQGTLR 
ILGRELGLPE 
TLLQRVKACT 
BSLI FIiARLI 
ESGYAGKISQ 
WLKMVTEIK 



11 
I 

ENAGGDLKDG 
AIIISGGPNS 
ISVDNTCSLF 
PEVGLTENGK 
VCTALLNRAL 
PI 9DEDRTPR 
PDLIESASL.V 
ELVSRHPFPG 
TEEDQEKLMQ 
PRMCHMVNRV 
MPVILTPLHF 
KIPGISRIMY 



21 
I 

HHHYEGAWI 
VYAEDAPWFD 
RGLQKEEWL 
VILKNFLYDI 
NQEQVIAVHI 
KRISKTLNMT 
ASGKAELIKT 
PGLAIRVICA 
ITSLHSLNAF 
VYIFGPPVKE 
DRDPLQKQPS 
DLTSKPPGTT 



31 

I 

liDAGAQYGKV 
PAIFTIGKPV 
1/IHGDSVDKV 
AGCSGTFTVQ 
DNGFMRKRES 
TSPEEKRKII 
HHNDTELIRK 
EEPYICKDFP 
LLPIKTVGVQ 
PPTDVTPTFL 
CQRSWIRTP 
EWE 



TGACACAGAG 
TTTTCATAAA 
TTCCAGGCAT 
TTATATTTGT 
TGCAAGTGTT 
GGATCAGGAG 
AATTAAAACT 
CAGTAAAGAT 
GTGTCACAAC 
AGATGTTACT 
TGAGGCCCAT 
GATTTTGACA 
ATCTGTGGTT 
TGGCAATGAG 
TGGTATTTCT 
GTAATAAACT 



41 

I 

IDRRVRBLFV 
LGICYGMQMM 
ADGFKWARS 
NRELECIREI 
QSVEEALKKL 
GDTFVKIANE 
LREEGKVIEP 
ETNNILKIVA 
GDCRSYSYVC 
TTGVLSTLRQ 
ITSDFMTGIP 



CTCATCAGAA 
GATGAAGTGA 
CCATTTCCAG 
AAGGACTTTC 
AAAAAGCCAC 
AAGCTGATGC 
GTAGGTGTGC 
GAACCTGACT 
GTTAACAGAG 
CCCACTTTCT 
AACATTCTCA 
CCATTACATT 
ATTCGAACCT 
ATCCCTGTAG 
CGAATTATGT 
TC 



1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
I860 
1920 
1980 
2040 
2100 
2160 



51 
I 

QSEIFPLETP 
NKVFGGTVHK 
GNIVAGIANE 
KERVGTSKVL 
GIQVKVINAA 
VIGEMNLKPE 
LKDFHKDEVR 
DFSASVKKPH 
GISSKDEPDW 
ADFEAHNILR 
ATPGNEIPVE 



Seq ZD NO: 366 DNA sequence 
Nucleic Acid Accession #: NM_004219 
Coding sequence i 46-654 



1 

I 

GCGGCCTCAG 
TATGTTGATA 
CTGGGGTCTG 
TTTGGCAAAA 
ACTGTCAACA 
CCAAGCTTTT 
GCCTCAGATG 
GAGAGTTTTG 
CTCATGATCC 
CCTGTGAAGA 
CTGTCGACCC 
TAGTGCTTCA 



11 
I 

ATGAATGCGG 
AGGAAAATGG 
GACCTTCAAT 
CGTTCGATGC 
GAGCTACAGA 
CTGCCAAAAA 
ATGCCTATCC 
ACCTGCCTGA 
TTGACGAGGA 
TGCCCTCTCC 
TGGATGTTGA 
GAGTTTGTGT 



21 
I 

CTGTTAAGAC 
AGAACCAGGC 
CAAAGCCTTA 
CCCACCAGCC 
AAAGTCTGTA 
GATGACTGAG 
AGAAATAGAA 
AGAGCACCAG 
GAGAGAGCTT 
ACCATGGGAA 
ATTGCCACCT 
GTATTTGTAT 



31 

I 

CTGCAATAAT 
ACCCGTGTGG 
GATGGGAGAT 
TTACCTAAAG 
AAGACCAAGG 
AAGACTGTTA 
AAATTCTTTC 
ATTGOGCACC 
GAAAAGCTGT 
TCCAATCTGT 
GTTTGCTGTG 
TAATAAAGCA 



41 
I 

CCAGAATGGC 
TTGCTAAGGA 
CTCAAGTTTC 
CTACTAGAAA 
GACCCCTCAA 
AAGCAAAAAG 
CCTTCAATCC 
TCCCCTTGAG 
TTCAGCTGGG 
TGCAGTCTCC 
ACATAGATAT 
TTCTTCAACA 



SI 
I 

TACTCTGATC 
TGGGCTGAAG 
AACACCACGT 
GGCTTTGGGA 
ACAAAAACAG 
CTCTGTTCCT 
TCTAGACTTT 
TGGAGTGCCT 
CCCCCCTTCA 
TTCAAGCATT 
TTAAATTTCT 
GAAAAAAAAA 



Seq ID NOt 367 Protein sequence 
Protein Accession #j np_004210 

1 11 21 31 41 51 

I I I ! 1 I 

MATLIYVDKE NGEPGTRWA KDGLKLGSGP SIKALDGRSQ VSTPRFGKTF DAPPALPKAT 

RKALGTVNRA TEKSVKTKGP LKQKQPSFSA KKMTEKTVKA KSSVPASDDA YPEIEKFFPF 

NPLDFESFDL PEBHQIAHLP LSGVPLMILD EERELEKLFQ LGPPSPVKMP SPPWESNLLQ 
SPSSILSTLD VELPPVCCDI DI 



Seq ID NO j 368 DNA sequence 
Nucleic Acid Accession #: NM_000597 
Coding sequence i 118-1104 



ATTCGGGGCG 
CCTGCCCGCC 
CTGCCGAGAG 
CCGCTGCTGC 
CTGTTCCGCT 
GCGCCGCCCG 
GTCCGGGAGC 
GGCGTCTACA 
CTGCCCCTGC 
TATGGCGCCA 
GTGGAGAACC 
AAGCCCCTCA 
CACCGGCAGA 
CGACCACCCC 



11 

I 

AGGGAGGAGG 

TGGGCTGCCC 
TGCTGCTACT 
GCCCGCCCTG 
CCGCGGTGGC 
CGGGCTGCGG 
CCCCGCGCTG 
AGGCGCTGGT 
GCCCGGAGCA 
ACGTGGACAG 
AGTCGGGTAT 
TGGGCAAGGG 
CTGCCAGGAC 



21 
1 

AAGAAGCGGA 
CTCGCTCGCC 
CGCGCTGCCG 
GGGCGCGAGT 
CACACCCGAG 
CGCAGTGGCC 
CTGCTGCTCG 
CGGCCAGGGG 
CATGGGCGAG 
GGTTGCAGAC 
CACCATGAAC 
GAAGGAGCTG 
TGGCAAGCAT 
TCCCTGCCAA 



31 
I 

GGAGGCGGCT 
CGCCGCGCCG 
CTGCCGCCGC 
GGCGGCGGCG 
CGCCTGGCCG 
GGAGGCGCCC 
GTGTGCGCCC 
CTGCGCTGCT 
GGCACTTGTG 
AATGGCGATG 
ATGTTGGGCG 
GCCGTGTTCC 
CACCTTGGCC 
CAGGAACTGG 



41 

I 

CCCGCTCGCA 
CGCTGCCGAC 
CGCCGCTGCT 
GCGGGGCGCG 
CCTGCGGGCC 
GCATGCCATG 
GG CTGGAGGG 
ATCCCCACCC 
AGAAGCGCCG 
ACCACTCAGA 
GGGGAGGCAG 
GGGAGAAGGT 
TGGAGGAGCC 
ACCAGGTCCT 



51 

I 

GGGCCGTGCA 
CGCCAGCATG 
GCCGCTGCTG 
CGCGGAGGTG 
CCCGCCGGTT 
CGCGGAGCTC 
CGAGGCGTGC 
GGGCTCCGAG 
GGACGCCGAG 
AGGAGGCCTG 
TGCTGGCCGG 
CACTGAGCAG 
CAAGAAGCTG 
GGAGCGGATC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



60 
120 
160 
240 
300 
360 
420 
480 
540 
•600 
660 
720 



60 
120 
180 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 



324 



10 



WO 02/086443 

TCCACCATGC GCCTTCCGGA TGAGCGGGGC CCTCTOGAGC ACCTCTACTC CCTGCACATC 900 

CCCAACTGTG ACAAGCATGG CCTGTACAAC CTCAAACAGT GCAAGATGTC TCTGAACGGG 960 

CAGCGTGGGG AGTGCTGGTG TGTGAACCCC AACACCGGGA AGCTGATCCA GGGAGCCCCC 1020 

ACCATCCGGG GGGACCCCGA GTGTCATCTC TTCTACAATG AGCAGCAGGA GGCTTGCGGG 10 BO 

GTGCACACCC AGCGGATGCA GTAGACCGCA GCCAGCCGGT GCCTGGCGCC CCTGCCCCCC 1140 

GCCCCTCTCC AAACACCGGC AGAAAACGGA GAGTGCTTGG GTGGTGGGTG CTGGAGGATT 1200 

TTCCAGTTCT GACACACGTA TTTATATTTG GAAAGAGACC AGCACCGAGC TCGGCACCTC 1260 

CCCGGCCTCT CTCTTCCCAG CTGCAGATGC CACACCTGCT CCTTCTTGCT TTCCCCGGGG 1320 

GAGGAAGGGG GTTGTGGTCG GGGAGCTGGG GTACAGGTTT GGGGAGGGGG AAGAGAAATT 1380 
TTTATTTTTG AACCCCTGTG TCCCTTTTGC ATAAGATTAA AGGAAGGAAA AGT 



PCT/US02/12476 



Seg ID NO: 369 Protein sequence 
Protein Accession tii NP_000588 



15 i ii 
1 I 

MLPRVGCPAL PLPPPPLLPL 
VAPPAAVAAV AGGARMPCAE 
ELPIiQALVMG EGTCEKRRDA 
20 RKPLKSGMKE UWFREKVTE 
I STMRL PDER GPLEHLYSLH 
PTIRGDPECH LFYNEQQEAC 



21 31 41 51 

1111 

LPLLLLiLLGA SGGGGGARAE VI.FRCPPCTP ERLAACGPPP 60 

LVREPGCGCC SVCARLEGEA CGVYTPRCGQ GLRCYPHPGS 120 

EYGASPEQVA DNGDDHSEGG LVENHVDSTM NMLGGGGSAG 180 

QHRQMGKGGK HHLGIiEEPKK LRPPPARTPC QQELDQVLER 240 

IPNCDKHGLY NLKQCKMSLN GQRGECWCVN PNTGKLIQGA 3 00 
GVHTQRMQ 



Seq ID NO: 370 DNA sequence 
25 Nucleic Acid Accession ft: NM_004264 
Coding sequence: 6-440 

. 1 11 21 31 41 31 

30 GGAACATGGC GGATCGGCTC ACGCAGCTTC AGGACGCTGT GAATTCGCTT GCAGATCAGT 60 

TTTGTAATGC CATTGGAGTA TTGCAGCAAT GTGGTCCTCC TGCCTCTTTC AATAATATTC 120 

AGACAGCAAT TAACAAAGAC CAGCCAGCTA ACCCTACAGA AGAGTATGCC CAGCTTTTTG 180 

CAGCACTGAT TGCACGAACA GCAAAAGACA TTGATGTTTT GATAGATTCC TTACCCAGTG 240 

AAGAATCTAC AGCTGCTTTA CAGGCTGCTA GCTTGTATAA GCTAGAAGAA GAAAACCATG 300 

35 AAGCTGCTAC ATGTGTGGAG GATGTTGTTT ATCGAGGAGA CATGCTTCTG GAGAAGATAC 360 

AAAGCGCACT TGCTGATATT GCACAGTCAC AGCTGAAGAC AAGAAGTGGT ACCCATAGCC 420 

AGTCTCTTCC AGACTCATAG CATCAGTGGA TACCATGTGG CTGAGAAAAG AACTGTTTGA 480 

GTGCCATTAA GAATTCTGCA TCAGACTTAG ATACAAGCCT TACCAACAAT TACAGAAACA 540 

TTAAACACTA TGACACATTA CCTTTTTAGC TATTTTTAAT AGTCTTCTAT TTTCACTCTT 600 

40 GATAAGCTTA TAAATCATGA TTGAATCAGC TTTAAAGCAT CATACCATCA TTTTTTAACT 660 

GAGTGAAATT ATTAAGG CAT GTAATACATT AATGAACATA ATATAAGGAA ACATATGTAA 720 

AATTCTGTTA TGACATAATT TATGTCTCCA TTTTGTTGTA TTGGCCAGTA CTTTTACAAT 780 
C 

45 Seq ID NO : 371 Protein sequence 

Protein Accession tt; NP_004255 

1 11 21 31 41 51 

50 MADRLTQLQD AVNSLADQPC NAIGVLQQCG PPASFNNIQT AINKDQPANP TEEYAQLFAA 60 
It I ARTAKD ID VLIDSLPSEE STAALQAASL YKLEEBNHEA ATCVEDWYR GDMLLEKIQS 120 
ALADIAQSQL KTRSGTHSQS LPDS 

Seq ID NOi 372 DNA sequence 
55 Nucleic Acid Accession #: AJ271091 
Coding sequence: 1-1113 



1 11 21 31 41 51 

*A I I 1 1 1 1 

OU ATGGAGAATC AGGTGTTGAC GCCGCATGTC TACTGGGCTC AGCGACACCG CGAGCTATAT 60 

CTGCGCGTGG AGCTGAGTGA CGTACAGAAC CCTGCCATCA GCATCACTGA AAACGTGCTG 120 

CATTTCAAAG CTCAAGGACA TGGTGCCAAA GGAGACAATG TCTATGAATT TCACCTGGAG 180 

TTCTTAGACC TTGTGAAACC AGAGCCTGTT TACAAACTGA CCCAGAGGCA GGTAAACATT 240 

ACAGTACAGA AGAAAGTGAG TCAGTGGTGG GAGAGACTCA CAAAGCAGGA AAAGCGACCA 300 

65 CTGTTTTTGG CTCCTGACTT TGATCGTTGG CTGGATGAAT CTGATGCGGA AATGGAGCTC 360 

AGAGCTAAGG AAGAAGAGCG CCTAAATAAA CTCCGACTGG AAAGCGAAGG CTCTCCTGAA 420 

ACTCTTACAA ACTTAAGGAA AGGATACCTG TTTATGTATA ATCTTGTGCA ATTCTTGGGA 480 

TTCTCCTGGA TCTTTGTCAA CCTGACTGTG CGATTCTGTA TCTTGGGAAA AGAGTCCTTT 540 

TATGACACAT TCCATACTGT GGCTGACATG ATGTATTTCT GCCAGATGCT GGCAGTTGTG 600 

70 GAAACTATCA ATGCAGCAAT TGGAGTCACT ACGTCACCGG TGCTGCCTTC TCTGATCCAG 660 

CTTCTTGGAA GAAATTTTAT TTTGTTTATC ATCTTTGGCA CCATGGAAGA AATGCAGAAC 720 

AAAGCTGTGG TTTTCTTTGT GTTTTATTTG TGGAGTGCAA TTGAAATTTT CAGGTACTCT 780 

TTCTACATGC TGACGTGCAT TGACATGGAT TGGAAGGTGC TCACATGGCT TCGTTACACT 840 

CTGTGGATTC CCTTATATCC ACTGGGATGT TTGGCGGAAG CTGTCTCAGT GATTCAGTCC 900 

75 ATTCCAATAT TCAATGAGAC CGGACGATTC AGTTTCACAT TGCCATATCC AGTGAAAATC 960 

AAAGTTAGAT TTTCCTTTTT TCTTCAGATT TATCTTATAA TGATATTTTT AGGTTTATAC 1020 

ATAAATTTTC GTCACCTTTA TAAACAGCGC AGACTGAAAA TGAGGGCAGG CGCAGTGGCT 1080 
CATGCCTGTG ATCCCAGCGC TTTGGGAGGC TGA 

80 Seq ID NO: 3 73 Protein sequence 
Protein Accession #: CAB69070 

1 11 21 31 41 51 

I I 1 I I 1 

OJ MENQVLTPHV YWAQRHRELY LRVELSDVQN PAISITENVL HPKAQGHGAK GDNVYEFHLE 60 

PLDLVKPEPV YKLTQRQVNI TVQKKVSQWW ERLTKQEKRP LPItAPDFDRW LDESDAEMEL 120 

RAKEEERLNK LRLESEGSPE TLTNLRKGYL FMYNLVQFLG FSWIFVNLTV RFCILGKESF 180 
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YDTFHTVADM MYFCQMLAW ETINAAIGVT TSFVLPSL1Q LLGRNFILFI IFGTMEEMQN 
KAWPFVFYL WSAIEIFRYS PYMLTCIDMD WKVLTWLRYT LWIPLYPLGC LAEAVSVIQS 
IPIFNETGRP SFTLPYPVKI KVRFSFFLQI YLIMIFLGLY INFRHLYKQR RLKMRAGAVA 
KACDPSALGG 



PCT/US02/12476 



240 
300 
360 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



Seq ID NO j 374 DNA sequence 
Nucleic Acid Accession NM_016395 
Coding sequence: 1-1113 



1 
I 

ATGGAGAATC 
CTGCGCGTGG 
CATTTCAAAG 
TTCTTAGACC 
ACAGTACAGA 
CTGTTTTTGG 
AGAGCTAAGG 
ACTCTTACAA 
TTCTCCTGGA 
TATGACACAT 
GAAACTATCA 
CTTCTTGGAA 
AAAGCTGTGG 
TTCTACATGC 
CTGTGGATTC 
ATTCCAATAT 
AAAGTTAGAT 
ATAAATTTTC 
CATGCCTGTG 



11 
I 

AGGTGTTGAC 
AG CTGAGTGA 
CTCAAGGACA 
TTGTGAAACC 
AGAAAGTGAG 
CTCCTGACTT 
AAGAAGAGCG 
ACTTAAGGAA 
TCTTTGTCAA 
TCCATACTGT 
ATGCAGCAAT 
GAAATTTTAT 
TTTTCTTTGT 
TGACGTGCAT 
CCTTATATCC 
TCAATGAGAC 
TTTCCTTTTT 
GTCACCTTTA 
ATCCCAGCGC 



21 

I 

GCCGCATGTC 
CGTACAGAAC 
TGGTGCCAAA 
AGAGCCTGTT 
TCAGTGGTGG 
TGATCGTTGG 
CCTAAATAAA 
AGGATACCTG 
CCTGACTGTG 
GGCTGACATG 
TGGAGTCACT 
TTTGTTTATC 
GTTTTATTTG 
TGACATGGAT 
ACTGGGATGT 
CGGACGATTC 
TCTTCAGATT 
TAAACAGCGC 
TTTGGGAGGC 



31 

I 

TACTGGGCTC 
CCTGCCATCA 
GGAGACAATG 
TACAAACTGA 
GAGAGACTCA 
CTGGATGAAT 
CTCCGACTGG 
TTTATGTATA 
CGATTCTGTA 
ATGTATTTCT 
ACGTCACCGG 
ATCTTTGGCA 
TGGAGTGCAA 
TGGAAGGTGC 
TTGGCGGAAG 
AGTTTCACAT 
TATCTTATAA 
AGACTGAAAA 
TGA 



41 

I 

AGCGACACCG 
GCATCACTGA 
TCTATGAATT 
CCCAGAGGCA 
CAAAGCAGGA 
CTGATGCGGA 
AAAGCGAAGG 
ATCTTGTGCA 
TCTTGGGAAA 
GCCAGATGCT 
TGCTGCCTTC 
CCATGGAAGA 
TTGAAATTTT 
TCACATGGCT 
CTGTCTCAGT 
TGCCATATCC 
TGATATTTTT 
TGAGGGCAGG 



51 

I 

CGAGCTATAT 
AAACGTGCTG 
TCACCTGGAG 
GGTAAACATT 
AAAGCGACCA 
AATGGAGCTC 
CTCTCCTGAA 
ATTCTTGGGA 
AGAGTCCTTT 
GGCAGTTGTG 
TCTGATCCAG 
AATGCAGAAC 
CAGGTACTCT 
TCGTTACACT 
GATTCAGTCC 
AGTGAAAATC 
AGGTTTATAC 
CGCAGTGGCT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 



Seq ID NO t 375 Protein sequence 
Protein Accession ft: NP_057479 



1 
1 

MENQVLTPHV 
FLDLVKPEPV 
RAKEEERLNK 
YDTFHTVADM 
KAWPFVFYL 
IPIFNETGRF 
STKKKDLDGF 



11 

I 

YWAQRHRELY 
YKLTQRQVNI 
LRLESEGSPE 
MYFCQMLAW 
WSAIEIFRYS 
SFTLPYPVKI 
LPV 



21 

I 

LRVELSDVQN 
TVQKKVSQWW 
TLTNLRKGYL 
ETINAAIGVT 
PYMLTCIDMD 
KVRFSFFLQI 



31 
I 

PAISITENVL 
ERLTKQEKRP 
PMYNLVQFLG 
TSPVLPSLIQ 
WKVLTWLRYT 
YLIMIFLGLY 



41 
I 

HPKAQGHGAK 
LFLAPDFDRW 
FSWIFVNLTV 
LLGRNFILPI 
LWIPLYPLGC 
INFRHLYKQR 



51 
I 

GDNVYEFHLE 
LDESDAEMEL 
RFCILGKESF 
IFGTMEEMQN 
LVEAVSVIQS 
RRRYGKKRKR 



Seq ID NO i 376 DNA sequence 
Nucleic Acid Accession ft: NM_005987 
Coding sequence: 1-270 



11 



41 



51 



21 31 

I I I I 

ATGAATTCTC AGCAGCAGAA GCAGCCTTGC ACCCCACCCC CTCAGCCTCA GCAGCAGCAG 
GTGAAACAAC CTTGCCAGCC TCCACCCCAG GAACCATGCA TCCCCAAAAC CAAGGAGCCC 
TGCCAACCCA AGGTGCCTGA GCCCTGCCAC CCCAAAGTGC CTGAGCCCTG CCAGCCCAAG 
ATTCCAGAGC CCTGCCAGCC CAAGGTGCCT GAGCCCTGCC CTTCAACGGT CACTCCAGCA 
CCAGCCCAGC AGAAGACCAA GCAGAAGTAA 

Seq ID NO: 377 Protein sequence 
Protein Accession #« NP_005978 



1 11 21 31 41 51 

| 1 I I I I 

MNSQQQKQPC TPPPQPQQQQ VKQPCQPPPQ EPCIPKTKEP CQPKVPEPCH PKVPEPCQPK 
IPEPCQPKVP EPCPSTVTPA PAQQKTKQK 



Seq ID NO: 378 DNA sequence 
Nucleic Acid Accession ft: NM_002105 
Coding sequence: 74-505 



ACAGCAGTTA 
CTACCTCGCT 
GTCGCGCTCG 
GAAGGGCCAC 
GGAGTACCTC 
GACGCGAATC 
GCTGCTGGGC 
GCTGCCCAAG 
CACCCAGGCC 
CATGCCACCA 
CTTCAGACTG 
TCGCCGCCCG 
CGGCCTCGGG 
CGGCTTGGGC 



11 
I 

CACTGCGGCG 
AGCATGTCGG 
TCGCGCGCCG 
TACGCCGAGC 
ACCGCTGAGA 
ATCCCCCGCC 
GGCGTGACGA 
AAGACCAGCG 
TCCCAGGAGT 
CAAAGGCCCT 
CGGGGCAAGC 
GCCTCGAGTC 
CCTGCCCTGT 
GCTCTTCGGG 



21 
1 

GGCGTCTGTT 
GCCGCGGCAA 
GCCTCCAGTT 
GCGTTGGCGC 
TCCTGGAGCT 
ACCTGCAGCT 
TCGCCCAGGG 
CCACCGTGGG 
ACTAAGAGGG 
TTTAAGGGCC 
GGG CCGCGGC 
CCCGCCCGCC 
CCGCCGTCCG 
GACCTCCGTG 



31 

I 

CTAGTGTTTG 
GACTGGCGGC 
CCCAGTGGGC 
CGGCGCGCCA 
GGCGGGCAAT 
GGCCATCCGC 
AGGCGTCCTG 
GCCGAAGGCG 
CCCGCGCCGC 
ACCACCGCCC 
TCCCTTCCCC 
CCCGCTCCCG 
CCCTCCGGTA 
GCGCGGAAGA 



41 
I 

AGCCGTCGTG 
AAGGCCCGCG 
CGTGTACACC 
GTGTACCTGG 
GCGGCCCGCG 
AACGACGAGG 
CCCAACATCC 
CCCTCGGGCG 
GGCCGGCCGC 
TCATGGAAAG 
TCCCCTCCCC 
TCCCGCACCG 
GGGTTCGGGC 
CCCGAGCCTG 



51 
I 

CTTCACCGGT 
CCAAGGCCAA 
GGCTGCTGCG 
CGGCAGTGCT 
ACAACAAGAA 
AGCTCAACAA 
AGGCCGTGCT 
GCAAGAAGGC 
CCCAGCTCCC 
AGCTGAGCCG 

CCTGCCGCGT 
CTTCCGGATG 
CCGGGGGGAG 



60 
120 
180 
240 
300 
360 



60 
120 
180 
240 



60 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
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GCCGGCGGCG CCGCACCTGC CCGCCTCGGC GTTCGTGACT CAGCCGCCCC ATCCCGAGTC 900 

GCTAAGGGGC TGCGGGGAGG CCGCAGCACC TTCTGGAAGA CTTGGCCTTC CGCTCTGACG 960 

CAGGGCCGAG GTGGGCAGTC CAGGCCGAGA GCCGGCGGCC CTGAAGGTGA GTGAGGCCCT 1020 

CGGCAGCTGC AGCCGGGGTG TCTGGTACCC CCCCGGCGTG GTGCTTAGCC CAGGACTTTC 1080 

AGACGGCCGC TGGCCGGGAG GCTTTGGTGG GAGAGACGOG ATCGCOGATT TOGGTCTGGC 1140 

GCCCCTTCTG CGGCOGGGAC CCAGGCCTTT CACATCAGCT CTCCCTCCAT CTTCATTCAT 1200 

AGGTCTGCGC TGGGGCCGGG ACGAAGCACT TGGTAACAGG CACATCTTCC TCCCGAGTGA 1260 

CTGCCTCCTA GGAGGACATT TAGGGGAGGG CAGAGGCCTG CAGTTTGGCT TCACGGCTGG 1320 

CTATGTGGAC AGCAAGAGTC GTTTTGCGGA ACGCGACTGG CAGCCAGGCC TGTCGGGCCC 1380 

CCGACGCCGC CCCATTTCCC TTCCAGCAAA CTCAACTCGG CAATCCAAGC ACCTAGATAC 1440 

CAGCACAAGT CGGTTAATCC CTGTCTGGAC TGAGCCTCCG TTGGCTTCTG AACTGGAATT 1500 

CTGCAGCTAA CCCTTCCACG ACTAGAACCT TAGGCATTGG GGAGTTTTAG ATGGACTAAT 1560 
TTTATTAAAG GATTGTTTTT TTTTT 

Seq ID NO: 379 Protein sequence 
Protein Acceseion #: NP_002096 

1 li 21 31 41 51 

9 I t I 1 I 

MSGRGKTGGK ARAKAKSRSS RAGLQFPVGR VHRLLRKGHY AERVGAGAPV YLAAVLEYLT 60 

AEILELAGNA ARDNKKTRI I PRHLQLAIRN DEELNKLLGG VTIAQGGVLP NIQAVLLPKK 120 
TSATVGP KAP SGGKKATQAS QEY 



Seq ID NO: 380 DMA sequence 
Nucleic Acid Accession #: AL136942 
Coding sequence: 184-864 

1 11 21 31 41 51 

l I l I I I 

ACGCGTCCGG CAGAAGCTCG GAGCTCTCGG GGTATCGAGG AGGCAGGCCC GCGGGCGCAC 60 

GGGCGAGCGG GCCGGGAGCC GGAGCGGCGG AGGAGCCGGC AGCAGCGGCG CGGCGGGCTC 120 

CAGGCGAGGC GGTCGACGCT CCTGAAAACT TGCGCGCGCG CTOGCGCCAC TGCGCCCGGA 180 

GCGATGAAGA TGGTCGCGCC CTGGACGCGG TTCTACTCCA ACAGCTGCTG CTTGTGCTGC 240 

CATGTCCGCA CCGGCACCAT CCTGCTCGGC GTCTGGTATC TGATCATCAA TGCTGTGGTA 300 

CTGTTGATTT TATTGAGTGC CCTGGCTGAT CCGGATCAGT ATAACTTTTC AAGTTCTGAA 360 

CTGGGAGGTG ACTTTGAGTT CATGGATGAT GCCAACATGT GCATTGCCAT TGCGATTTCT 420 

CTTCTCATGA TCCTGATATG TGCTATGGCT ACTTACGGAG CGTACAAGCA ACGCGCAGCC 480 

TGGATCATCC CATTCTTCTG TTACCAGATC TTTGACTTTG CCCTGAACAT GTTGGTTGCA 540 

ATCACTGTGC TTATTTATCC AAACTCCATT CAGGAATACA TACGGCAACT GCCT CCTAAT 600 

TTTCCCTACA GAGATGATGT CATGTCAGTG AATCCTACCT GTTTGGTCCT TATTATTCTT 660 

CTGTTTATTA GCATTATCTT GACTTTTAAG GGTTACTTGA TTAGCTGTGT TTGGAACTGC 720 

TACCGATACA TCAATGGTAG GAACTCCTCT GATGTCCTGG TTTATGTTAC CAGCAATGAC 780 

ACTACGGTGC TGCTACCCCC GTATGATGAT GCCACTGTGA ATGGTGCTGC CAAGGAGCCA 840 

CCGCCACCTT ACGTGTCTGC CTAAGCCTTC AAGTGGGCGG AGCTGAGGGC AGCAGCTTGA 900 

CTTTGCAGAC ATCTGAGCAA TAGTTCTGTT ATTTCACTTT TGCCATGAGC CTCTCTGAGC 960 

TTGTTTGTTG CTGAAATGCT ACTTTTTAAA ATTTAGATGT TAGATTGAAA ACTGTAGTTT 1020 

TCAACATATG CTTTGCTAGA ACACTGTGAT AGATTAACTG TAGAATTCTT CCTGTACGAT 1080 

TGGGGATATA ACGGGCTTCA CTAACCTTCC CTAGGCATTG AAACTTCCCC CAAATCTGAT 1140 

GGACCTAGAA GTCTGCTTTT GTACCTGCTG GGCCCCAAAG TTGGGCATTT TTCTCTCTGT 1200 

TCCCTCTCTT TTGAAAATGT AAAATAAAAC CAAAAATAGA CAACTTTTTC TTCAGCCATT 1260 

CCAGCATAGA GAACAAAACC TTATGGAAAC AGGAATGTCA ATTGTGTAAT CATTG TTCTA 1320 

ATTAGGTAAA TAGAAGTCCT TATGTATGTG TTACAAGAAT TTCCCCCACA ACATCCTTTA 1380 

TGACTGAAGT TCAATGACAG TTTGTGTTTG GTGGTAAAGG ATTTTCTCCA TGGCCTGAAT 1440 

TAAGACCATT AGAAAGCACC AGGCCGTGGG AGCAGTGACC ATCTACTGAC TGTTCTTGTG 1500 

GATCTTGTGT CCAGGGACAT GGGGTGACAT GCCTCGTATG TGTTAGAGGG TGGAATGGAT 1560 

GTGTTTGGCG CTGCATGGGA TCTGGTGCCC CTCTTCTCCT GGATTCACAT CCCCACCCAG 1620 

GGCCCGCTTT TACTAAGTGT TCTGCCCTAG ATTGGTTCAA GGAGGTCATC CAACTGACTT 1680 

TATCAAGTGG AATTGGGATA TATTTGATAT ACTTCTGCCT AACAACATGG -AAAAGGGTTT 174 0 

TCTTTTCCCT GCAAGCTACA TCCTACTGCT TTGAACTTCC AAGTATGTCT AGTCACCTTT 1800 

TAAAATGTAA ACATTTTCAG AAAAATGAGG ATTGCCTTCC TTGTATGCGC TTTTTACCTT 1860 

GACTACCTGA ATTGCAAGGG ATTTTTATAT ATTCATATGT TACAAAGTCA GCAACTCTCC 1920 

TGTTGGTTCA TTATTGAATG TGCTGTAAAT TAAGTCGTTT GCAATTAAAA CAAGGTTTGC 1980 
CCACATCCAA AAAAAAAAAA AAAAA 

Seq ID NO i 381 Protein sequence 
Protein Accession #: CAB66B76 

1 11 21 31 41 51 

I I I I I I 

MKMVAPWTRF YSNSCCLCCH VRTGTILLGV WYLIINAWL fclLLSALADP DQYNFSSSEL 60 

GGDFEFMDDA NMCIAIAISL LMILICAMAT YGAYKQRAAW IIPPFCYQIP DFALNMLVAI 120 

TVLIYPNSIQ EYIRQLPPNF PYRDDVMSVN PTCLVLIILL FISIILTFKG YLISCVWNCY 180 
RYINGRNSSD VLVYVTSNDT TVLLPPYDDA TVNGAAKEPP PPYVSA 

Seq ID NO: 382 DNA sequence 
Nucleic Acid Accession ft: NM_002510 
Coding sequence: 92-1774 

21 31 41 51 

1111 

TTGCTCTTGG TGGACGGGCC CAGAGGAATT CAGAGTTAAA 60 

GAGAATTCAG CATGGAATGT CTCTACTATT TCCTGGGATT 120 

TGCCACTTGA TGCCGCCAAA CGATTTCATG ATGTGCTGGG 180 

ACATGAGGGA GCACAATCAA TTAAATGGCT GGTCTTCTGA 240 

AACTCTACCC AGTGTGGAAG CGGGGAGACA . TGAGGTGGAA 300 

GTGTGCAGGC GGTCCTGACC AGTGACTCAC CAGCCCTCGT 360 



1 11 

I 1 
CAGATGCCAG AAGAACACTG 
CCTTGAGTGC CTGCGTCCGT 
TCTGCTCCTG GCTGCAAGAT 
CAATGAAAGA CCTTCTGCTT 
TGAAAATGAC TGGAATGAAA 
AAACTCCTGG AAGGGAGGCC 
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GGGCTCAAAT ATAACATTTG CGGTGAACCT GATATTCCCT AGATGCCAAA AGGAAGATGC 420 

CAATGGCAAC ATAGTCTATG AGAAGAACTG CAGAAATGAG G CTGGTTT AT CTGCTGATCC 480 

ATATGTTTAC AACTGGACAG CATGGTCAGA GGACAGTGAC GGGGAAAATG GCACCGGCCA 540 

AAGCCATCAT AACGTCTTCC CTGATGGGAA ACCTTTTCCT CACCACCCCG GATGGAGAAG 600 

ATGGAATTTC ATCTACGTCT TCCACACACT TGGTCAGTAT TTCCAGAAAT TGGGACGATG 660 

TTCAGTGAGA GTTTCTGTGA ACACAGCCAA TGTGACACTT GGGCCTCAAC TCATGGAAGT 720 

GACTGTCTAC AGAAGACATG GACGGGCATA TGTTCCCATC GCACAAGTGA AAGATGTGTA 780 

CGTGGTAACA GATCAGATTC CTGTGTTTGT GACTATGTTC CAGAAGAACG ATCGAAATTC 840 

ATCCGACGAA ACCTTCCTCA AAGATCTCCC CATTATGTTT GATGTCCTGA TTCATGATCC 900 

TAGCCACTTC CTCAATTATT CTACCATTAA CTACAAGTGG AGCTTCGGGG ATAATACTGG 960 

CCTGTTTGTT TCCACCAATC ATACTGTGAA TCACACGTAT GTGCTCAATG GAACCTTCAG 1020 

CCTTAACCTC ACTGTGAAAG CTGCAGCACC AGGACCTTGT CCGCCACCGC CACCACCACC 10 80 

CAGACCTTCA AAACCCACCC CTTCTTTAGG ACCTGCTGGT GACAACCCCC TGGAGCTGAG 1140 

TAGGATTCCT GATGAAAACT GCCAGATTAA CAGATATGGC CACTTTCAAG CCACCATCAC 1200 

AATTGTAGAG GGAATCTTAG AGGTTAACAT CATCCAGATG ACAGACGTCC TGATGCCGGT 1260 

GCCATGGCCT GAAAGCTCCC TAATAGACTT TGTCGTGACC TGCCAAGGGA GCATTCCCAC 1320 

GGAGGTCTGT ACCATCATTT CTGACCCCAC CTGCGAGATC ACCCAGAACA CAGTCTGCAG 1380 

CCCTGTGGAT GTGGATGAGA TGTGTCTGCT GACTGTGAGA CGAACCTTCA ATGGGTCTGG 1440 

GACGTACTGT GTGAACCTCA CCCTGGGGGA TGACACAAGC CTGGCTCTCA CGAGCACCCT 1500 

GATTTCTGTT CCTGACAGAG ACCCAGCCTC GCCTTTAAGG ATGGCAAACA GTGCCCTGAT 1560 

CTCCGTTGGC TGCTTGGCCA TATTTGTCAC TGTGATCTCC CTCTTGGTGT ACAAAAAACA 1620 

CAAGGAATAC AACCCAATAG AAAATAGTCC TGGGAATGTG GTCAGAAGCA AAGGCCTGAG 1680 

TGTCTTTCTC AACCGTGCAA AAGCCGTGTT CTTCCCGGGA AACCAGGAAA AGGATCCGCT 1740 

ACTCAAAAAC CAAGAATTTA AAGGAGTTTC TTAAATTTCG ACCTTGTTTC TGAAGCTCAC 1800 

TTTTCAGTGC CATTGATGTG AGATGTGCTG GAGTGGCTAT TAACCTTTTT TTCCTAAAGA 1860 

TTATTGTTAA ATAGATATTG TGGTTTGGGG AAGTTGAATT TTTTATAGGT TAAATGTCAT 1920 

TTTAGAGATG GGGAGAGGGA TTATACTGCA GGCAGCTTCA GCCATGTTGT GAAACTGATA 1980 

AAAGCAACTT AGCAAGGCTT CTTTTCATTA TTTTTTATGT TTCACTTATA AAGTCTTAGG '2040 

TAACTAGTAG GATAGAAACA CTGTGTCCCG AGAGTAAGGA GAGAAGCTAC TATTGATTAG 2100 

AGCCTAACCC AGGTTAACTG CAAGAAGAGG CGGGATACTT TCAGCTTTCC ATGTAACTGT 2160 

ATGCATAAAG CCAATGTAGT CCAGTTTCTA AGATCATGTT CCAAGCTAAC TGAATCCCAC 2220 

TTCAATACAC ACTCATGAAC TCCTGATGGA ACAATAACAG GCCCAAGCCT GTGGTATGAT 2280 

GTGCACACTT GCTAGACTCA GAAAAAATAC TACTCTCATA AATGGGTGGG AGTATTTTGG 234 0 

TGACAACCTA CTTTGCTTGG CTGAGTGAAG GAATGATATT CATATATTCA TTTATTCCAT 2400 

GGACATTTAG TTAGTGCTTT TTATATACCA GGCATGATGC TGAGTGACAC TCTTGTGTAT 2460 

ATTTCCAAAT TTTTGTATAG TCGCTGCACA TATTTGAAAT CATATATTAA GACTTTCCAA 2520 

AGATGAGGTC CCTGGTTTTT CATGGCAACT TGATCAGTAA GGATTTCACC TCTGTTTGTA 2580 

ACTAAAACCA TCTACTATAT GTTAGACATG ACATTCTTTT TCTCTCCTTC CTGAAAAATA 2640 
AAGTGTGGGA AGAGACAAAA AAAAAAAAA 

Seq ID NO: 383 Protein sequence 
Protein Accession ft: NP_002501 

1 11 21 31 41 51 

I 1 I I I I 

MECItYYFLGF LLLAARLPLD AAKRFHDVLG NERPSAYMRE HNQLNGWSSD ENDWNEKLYP 60 

VWKRGDMRWK NSWKGGRVQA VLTSDSPALV GSNITFAVNL IFPRCQKEDA NGNIVYEKKC 120 

RNEAGLSADP YVYNWTAWSE DSDGENGTGQ SHHNVFPDGK PFPHHPGWRR WNFIYVFHTL 180 

GQYFQKLGRC SVRVSVNTAN VTLGPQLMEV TVYRRHGRAY VPIAQVKDVY WTDQIPVFV 240 

TMFQKNDRNS SDETFLKDLP IMFDVLIHDP SHFIiNYSTIN YKWSFGDNTG LFVSTNHTVN 300 

HTYVIiNGTFS LNLTVKAAAP GPCPPPPPPP RPSKPTPSLG PAGDNPLELS RIPDENCQIN 360 

RYGHPQATIT IVEGILEVNI IQMTDVLMPV PWPESSLIDF WTCQGSIPT EVCTIISDPT 42 0 

CEITQNTVCS PVDVDEMCLL TVRRTFNG SG TYCVNLTLGD DTSLALTSTL ISVPDRDPAS 480 

PLRMANSALI SVGCLAIFVT VISLL.VYKKH KEYNPIENSP GNWRSKGLS VFLNRAKAVP 540 
FPGNQEKDPL LKNQEFKGVS 

Seq ID NO: 384 DNA sequence 
Nucleic Acid Accession ft: NM_001134 
Coding sequence: 4 8-1877 

1 11 21 31 41 51 

I I I I I I 

TCCATATTGT GCTTCCACCA CTGCCAATAA CAAAATAACT AGCAACCATG AAGTGGGTGG 60 

AATCAATTTT TTTAA TTTT C CTACTAAATT TTACTGAATC CAGAACACTG CATAGAAATG 120 

AATATGGAAT AGCTTCCATA TTGGATTCTT ACCAATGTAC TGCAGAGATA AGTTTAGCTG 180 

ACCTGGCTAC CATATTTTTT GCCCAGTTTG TTCAAGAAGC CACTTACAAG GAAGTAAGCA 240 

AAATGGTGAA AGATG CATTG ACTG CAATTG AGAAACCCAC TGGAGATGAA CAGTCTTCAG 300 

GGTGTTTAGA AAACCAG CT A CCTGCCTTTC TGGAAGAACT TTGCCATGAG AAAGAAATTT 360 

TGGAGAAGTA CGGACATTCA GACTGCTGCA GCCAAAGTGA AGAGGGAAGA CATAACTGTT 420 

TTCTTGCACA CAAAAAGCCC ACTCCAGCAT CGATCCCACT TTTCCAAGTT CGAGAACCTG 480 

TCACAAGCTG TGAAGCATAT GAAGAAGACA GGGAGACATT CATGAACAAA TTCATTTATG 540 

AGATAGCAAG AAGGCATCCC TTCCTGTATG CACCTACAAT TCTTCTTTGG GCTGCTCGCT 600 

ATGACAAAAT AATTCCATCT TGCTGCAAAG CTGAAAATGC AGTTGAATGC TTCCAAACAA 660 

AGGCAGCAAC AGTTACAAAA GAATTAAGAG AAAGCAGCTT GTTAAATCAA CATGCATGTG 720 

CAGTAATGAA AAATTTTGGG ACCCGAACTT TCCAAGCCAT AACTGTTACT AAACTGAGTC 780 

AGAAGTTTAC CAAAGTTAAT TTTACTGAAA TCCAGAAACT AGTCCTGGAT GTGGCCCATG 840 

TACATGAGCA CTGTTGCAGA GGAGATGTGC TGGATTGTCT GCAGGATGGG GAAAAAATCA 900 

TGTCCTACAT ATGTTCTCAA CAAGACACTC TGTCAAACAA AATAACAGAA TGCTGCAAAC 960 

TGACCACGCT GGAACGTGGT CAATGTATAA TTCATGCAGA AAATGATGAA AAACCTGAAG 1020 

GTCTATCTCC AAATCTAAAC AGGTTTTTAG GAGATAGAGA TTTTAACCAA TTTTCTTCAG 1080 

GGGAAAAAAA TATCTTCTTG GCAAGTTTTG TT CATGAAT A TTCAAGAAGA CATCCTCAGC 1140 

TTGCTGTCTC AGTAATTCTA AGAGTTGCTA AAGGATACCA GGAGTTATTG GAGAAGTGTT 1200 

TCCAGACTGA AAACCCTCTT GAATGCCAAG ATAAAGGAGA AGAAGAATTA CAGAAATACA 1260 

TCCAGGAGAG CCAAGCATTG GCAAAGCGAA GCTGCGGCCT CTTCCAGAAA CTAGGAGAAT 1320 

ATTACTTACA AAATGCGTTT CTCGTTGCTT ACACAAAGAA AGCCCCCCAG CTGACCTCGT 1380 

CGGAGCTGAT GGCCATCACC AGAAAAATGG CAGCCACAGC AGCCACTTGT TGCCAACTCA 1440 

GTGAGGACAA ACTATTGGCC TGTGGCGAGG GAGCGGCTGA CATTATTATC GGACACTTAT 1500 



328 
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GTATCAGACA TGAAATGACT CCAGTAAACC CTGGTGTTGG CCAOTGCTQC ACTTCTTCAT 1560 

ATGCCAACAG GAGGCCATGC TTCAGCAGCT TGGTGGTGGA TGAAACATAT GTCCCTCCTG 1620 

CATTCTCTGA TGACAAGTTC ATTTTCCATA AGGATCTGTG CCAAGCTCAG GGTGTAGCGC 1680 

TGCAAACGAT GAAGCAAGAG TTTCTCATTA ACCTTGTGAA GCAAAAGCCA CAAATAACAG 1740 

AGGAACAACT TOAGGCTGTC ATTGCAGATT TCTCAGGCCT GTTGGAGAAA TGCTGCCAAG 1800 

GCCAGGAACA GGAAGTCTGC TTTGCTGAAG AGGGACAAAA ACTGATTTCA AAAACTCGTG 1860 

CTGCTTTGGG AGTTTAAATT ACTTCAGGGG AAGAGAAGAC AAAACGAGTC TTTCATTCGG 1920 

TGTGAACTTT TCTCTTTAAT TTTAACTGAT TTAACACTTT TTGTGAATTA ATGAAATGAT 1980 
AAAGACTTTT ATGTGAGATT TCCTTATCAC AGAAATAAAA TATCTCCAAA TG 

Seq ID NO: 385 Protein sequence 
Protein Accession #: NP_001125 

1 11 21 31 41 51 

I I I I I I 

MKWVESIFLI FLLNFTESRT LHRNEYGIAS ILDSYQCTAE ISLADLATIF FAQPVQEATY 60 

KEVSKMVKDA LTAIEKPTGD EQSSGCLENQ LPAFLEELCH EKEILEKYGH SDCCSQSEEG 120 

RHNCFLAHKK PTPASIPLFQ VPEPVTSCEA YEEDRETPMN KFIYEIARRH PFLYAPTILL 180 

WAARYDKIIP SCCKAENAVE CFQTKAATVT KELRESSLLN QHACAVMKNF GTRTFQAITV 240 

TKLSQKFTKV NFTEIQKLVL DVAHVHEHCC RGDVLDCLQD GEKIMSYICS QQDTLSNKIT 3 00 

ECCKLTTLER GQCIIHAEND EKPEGLSPNL NRFLGDRDFN QFSSGEKNIF LASFVHEYSR 360 

RHPQLAVSVI LRVAKGYQEL LEKCFQTENP LECQDKGEEE LQKYIQESQA LAKRSCGLFQ 420 

KLGEYYLQNA FLVAYTKKAP QLTSSELMAI TRKMAATAAT CCQLSEDKLL ACGEGAADII 480 

IGHLCIRHEM TPVNPGVGQC CTSSYANRRP CFSSLWDET YVPPAFSDDK FIFHKDLCQA 540 

QGVALQTMKQ EFLINLVKQK PQITEEQLEA VIADFSGLL.E KCCQGQEQEV CFAEEGQKLI 600 
SKTRAALGV 

Seq ID NO: 386 DNA sequence 

Nucleic Acid Accession #t NM_002205.1 

Coding sequence : 1 . . 3149 

1 11 21 31 41 51 

I I . I I I I 

ATGGGGAGCC GGACGCCAGA GTCCCCTCTC CACGCCGTGC AGCTGCGCTG GGGCCCCCGG 60 

CGCCGACCCC CGCTSSTGCC GCTGCTGTTG CTGCTSSTGC CGCCGCCACC CAGGGTCGGG 120 

GGCTTCAACT TAGACGCGGA GGCCCCAGCA GTACTCTCGG GGCCCCCGGG CTCCTTCTTC 180 

GGATTCTCAG TGGAGTTTTA CCGGCCGGGA ACAGACGGGG TCAGTGTGCT GGTGGGAGCA 240 

CCCAAGGCTA ATACCAGCCA GCCAGGAGTG CTGCAGGGTG GTGCTGTCTA CCTCTGTCCT 300 

TGGGGTGCCA GCCCCACACA QTGCACCCCC ATTGAATTTG ACAGCAAAGG CTCTCGGCTC 3 60 

CTGGAGTCCT CACTGTCCAG CTCAGAGGGA GAGGAGCCTG TGGAGTACAA GTCCTTGCAG 420 

TGGTTCGGGG CAACAGTTCG AGCCCATGGC TCCTCCATCT TGGCATGCGC TCCACTGTAC 480 

AGCTGGCGCA CAGAGAAGGA GCCACTGAGC GACCCCGTGG GCACCTGCTA CCTCTCCACA 540 

GATAACTTCA CCCGAATTCT GGAGTATGCA CCCTGCGGCT CAGATTTCAG CTGGGCAGCA 600 

GGACAGGGTT ACTGCCAAGG AGGCTTCAGT GCCGAGTTCA CCAAGACTGG CCGTGTGGTT 660 

TTAGGTGGAC CAGGAAGCTA TTTCTGGCAA GGCCAGATCC TGTCTGCCAC TCAGGAGCAG 720 

ATTGCAGAAT CTTATTACCC CGAGTACCTG ATCAACCTGG TTCAGGGGCA GCTGCAGACT 780 

CGCCAGGCCA GTTCCATCTA TGATGACAGC TACCTAGGAT ACTCTGTGGC TGTTGGTGAA 640 

TTCAGTGGTG ATGACACAGA AGACTTTGTT GCTGGTGTGC CCAAAGGGAA CCTCACTTAC 900 

GGCTATGTCA CCATCCTTAA TGGCTCAGAC ATTCGATCCC TCTACAACTT CTCAGGGGAA 960 

CAGATGGCCT CCTACTTTGG CTATGCAGTO GCCGCCACAG ACGTCAATGG GGAOGGGCTG 1020 

GATGACTTGC TGGTGGGGGC ACCCCTGCTC ATGGATOGGA CCCCTGACGG GCGGCCTCAG 1080 

GAGGTGGGCA GGGTCTACGT CTACCTGCAG CACCCAGCCG GCATAGAGCC CACGCCCACC 1140 

CTTACCCTCA CTGGCCATGA TGAGTTTGGC CGATTTGGCA GCTCCTTGAC CCCCCTGGGG 1200 

GACCTGGACC AGGATGGCTA CAATGATGTG GCCATCGGGG CTCCCTTTGG TGGGGAGACC 1260 

CAGCAGGGAG TAGTGTTTGT ATTTCCTGGG GGCCCAGGAG GGCTGGGCTC TAAGCCTTCC 1320 

CAGGTTCTGC AGCCCCTGTG GGCAGCCAGC CACACCCCAG ACTTCTTTGG CTCTGCCCTT 1380 

CGAGGAGGCC GAGACCTGGA TGGCAATGGA TATCCTGATC TGATTGTGGG GTCCTTTGGT 1440 

GTGGACAAGG CTGTGGTATA CAGGGGCCGC CCCATCGTGT CCGCTAGTGC CTCCCTCACC 150 0 

ATCTTCCCCG CCATGTTCAA CCCAGAGGAG CGGAGCTGCA GCTTAGAGGG GAACCCTGTG 1560 

GCCTGCATCA ACCTTAGCTT CTGCCTCAAT GCTTCTGGAA AACACGTTGC TGACTCCATT 1620 

GGTTT CACAO TGGAACTTCA GCTGGACTGG CAGAAGCAGA AGGGAGGGGT ACGGCGGGCA 1680 

CTGTTCCTGG CCTCCAGGCA GGCAACCCTG ACCCAGACCC TGCTCATCCA GAATGGGGCT 1740 

CGAGAGGATT GCAGAGAGAT GAAGATCTAC CTCAGGAACG AGTCAGAATT TCGAGACAAA 1800 

CTCTCGCCGA TTCACATCGC TCTCAACTTC TCCTTGGACC CCCAAGCCCC AGTGGACAGC 1860 

CACGGCCTCA GGCCAGCCCT ACATTATCAG AGCAAGAGCC GGATAGAGGA CAAGGCTCAG 1920 

ATCTTGCTGG ACTGTGGAGA AGACAACATC TGTGTGCCTG ACCTGCAGCT GGAAGTGTTT 1980 

GGGGAGCAGA AC CATGTGT A CCTGGGTGAC AAGAATGCCC TGAACCTCAC TTTCCATGCC 2040 

CAGAATGTGG GTGAGGGTGG CGCCTATGAG GCTGAGCTTC GGGTCACCGC CCCTCCAGAG 2100 

GCTGAGTACT CAGGACTCGT CAGACACCCA GGOAACTTCT CCAGCCTGAG CTGTGACTAC 2160 

TTTGCCGTGA ACCAGAGCCG CCTGCTGGTG TGTGACCTGG GCAACCCCAT GAAGGCAGGA 2220 

GCCAGTCTGT GGGGTGGCCT TCGGTTTACA GTCCCTCATC TCOGGGACAC TAAGAAAACC 2280 

ATCCAGTTTG ACTTCCAGAT CCTCAGCAAG AATCTCAACA ACTCGCAAAG CGACGTGGTT 2340 

TCCTTTCGGC TCTCCGTGGA GGCTCAGGCC CAGGTCACCC TGAACGGTGT CTCCAAGCCT 2400 

GAGG CAGTGC TATTCCCAGT AAGCGACTGG CATCCCCGAG ACCAGCCTCA GAAGGAGGAG 2460 

GACCTGGGAC CTGCTGTCCA CCATGTCTAT GAGCTCATCA ACCAAGGCCC CAGCTCCATT 2520 

AGCCAGGGTG TGCTGGAACT CAGCTGTCCC CAGGCTCTGG AAGGT CAGCA GCTCCTATAT 2580 

GTGACCAGAG TTACGGGACT CAACTGCACC ACCAATCACC CCATTAACCC AAAGGGCCTG 2640 

GAGTTGGATC CCGAGGGTTC CCTGCACCAC CAGCAAAAAC GGGAAGCTCC AAGCCGCAGC 2700 

TCTGCTTCCT CGGGACCTCA GATCCTGAAA TGCCCGGAGG CTGAGTGTTT CAGGCTGCGC 2760 

TGTGAGCTCG GGCCCCTGCA CCAACAAGAG AGCCAAAGTC TGCAGTTGCA TTTCCGAGTC 2820 

TGGGCCAAGA CTTTCTTGCA GCGGGAGCAC CAGCCATTTA GCCTGCAGTG TGAGGCTGTG 2880 

TACAAAGCCC TGAAGATGCC CTACCGAATC CTGCCTCGGC AGCTGCCCCA AAAAGAGCGT 2940 

CAGGTGGCCA CAGCTGTGCA ATGGACCAAG GCAGAAGGCA GCTATGGCGT CCCACTGTGG 3000 

AT CAT CAT CC TAGCCATCCT GTTTGGCCTC CTGCTCCTAG GTCTACTCAT CTACATCCTC 3060 

TACAAGCTTG GATTCTTCAA ACGCTCCCTC CCATATGGCA CCGCCATGGA AAAAGCTCAG 3120 

CTCAAGCCTC CAGCCACCTC TGATGCCTGA 

\ 



329 



WO 02/086443 



PCT/US02/12476 



Seq ID NO i 387 Protein sequence 
Protein Accession #: NP_002196.1 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



i 
I 

MGSRTPESPL 
GFSVEFYRPG 
LESSLSSSEG 
DNFTRI LEY A 
IAESYYPEYL 
GYVTILNGSD 
EVGRVYVYLQ 
QQGWPVFPG 
VDKAWYRGR 
GPTVELQLDW 
LSPIHIALNF 
GEQNHVYLGD 
FAVNQSRLLV 
SFRLSVEAQA 
SQGVLELSCP 
SASSGPQILK 
YKALKMPYRX 
YKLGFFKRSL 



11 
I 

HAVQLRWGPR 
TDGVSVLVGA 
EEPVEYKSLQ 
PCRSDFSWAA 
INLVQGQLQT 
IRSLYNFSGE 
HPAGIEPTPT 
GPGGLGSKPS 
PIVSASASLT 
QKQKGGVRRA 
SLDPQAPVDS 
KNALNLTFHA 
CDLGNPMKAG 
QVTLNGVSKP 
QALEGQQLLY 
CPEAECFRLR 
LPRQLPQKER 
PYGTAMEKAQ 



21 
I 

RRPPLLPLLL 
PKANTSQPGV 
WFGATVRAHG 
GQGYCQGGFS 
RQASSIYDDS 
QMASYFGYAV 
LTLTGHDEFG 
QVLQPLWAAS 
IFPAMFNPEE 
LFLASRQATL 
HGLRPALHYQ 
QNVGEGGAYE 
ASLWGGLRFT 
EAVIiFPVSDW 
VTRVTGLNCT 
CELGPLHQQE 
QVATAVQWTK 
LKPPATSDA 



31 
I 

IiLLPPPPRVG 
LQGGAVYLCP 
SSILACAPLY 
AEFTKTGRW 
YLGYSVAVGE 
AATDVNGDGIi 
RFGSSLTPLG 
HTPDFFGSAL 
RSCSLEGNPV 
TQTLLIQNGA 
SKSRIEDKAQ 
AELRVTAPPE 
VPHLRDTKKT 
HPRDQPQKEE 
TNHPINPKGL 
SQSLQLHFRV 
AEGSYGVPLW 



41 
I 

GFNLDAEAPA 
WGASPTQCTP 
SWRTEKEPLS 
LGGPGSYFWQ 
FSGDDTEDFV 
DDLLVGAPLL 
DLDQDGYNDV 
RGGRDLDGKG 
ACINLSFCLN 
REDCREMKIY 
ILLDCGEDNI 
AEYSGLVRHP 
IQFDFQILSK 
DLGPAVHHVY 
ELDPEGSLHH 
WAKTFLQREH 
IIILAILFGL 



51 

1 

VLSGPPGSFF 

IEFDSKGSRL 

DPVGTCYLST 

GQILSATQEQ 

AGVPKGNLTY 

MDRTPDGRPQ 

AIGAPFGGET 

YPDLIVGSFG 

ASGKHVADS I 

LRNESEFRDK 

CVPDLQLEVF. 

GNFSSLSCDY 

NLNNSQSDW 

ELINQGPSSI 

QQKREAPSRS 

QPFSLQCEAV 

LLLGLLIYIL 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 



Seq' ID NO: 388 DNA sequence 

Nucleic Acid Accession #: NMJ)0242S 

Coding sequences 26.. 1453 



1 
I 

AAAGAAGGTA 
AGTCTGCTCT 
TGCCCAGCAA 
AAAGGACAGT 
GGTGACAGGG 
TCCTGACGTT 
TACATACAGG 
TGAGAAAGCT 
AGGAGAGGCT 
TGATGGCCCA 
TATTCACTTT 
CGTTGCTGCT 
TTTGATGTAC 
TGATGTGAAT 
GGTGCCCACA 
GTCCTTCGAT 
TTGGCGAAGA 
CTCTCTTCCA 
TTTTAAAGGA 
AGGCATCCAT 
CAAGGAAAAG 
TAGCCAGTCC 
GCCTAAGGTT 
ACAGTTTGAG 
GTTACATTGC 
ATTATTCATC 
GAAGAAGATG 
ACTTGCTTTT 
ATGTATTTTC 
CTT 



11 
I 

AGGGCAGTGA 
GCCTATCCTC 
TACCTAGAAA 
AATCTCATTG 
AAGCTAGACA 
GGTCACTTCA 
ATTGTGAATT 
CTGAAAGTCT 
GATATAATGA 
GGACACAGTT 
GATGATGATG 
CATGAACTTG 
CCACTCTACA 
GGCATTCAGT 
AAATCTGTTC 
GCCATCAGCA 
TCCCACTGGA 
TCATATTTGG 
AATGAGTTCT 
ACCCTGGGTT 
AAGAAAACAT 
ATGGAGCAAG 
GATGCTGTAT 
TTTGACCCCA 
TAGGCGAGAT 
TAATGTATTA 
AGCCTTGCAG 
GAATTGCACT 
ATAGATGTGT 



21 
I 

GAATGATGCA 
TGAGTGGGGC 
AGTACTACAA 
TTAAAAAAAT 
CTGACACTCT 
GCTCCTTTCC 
ATACACCAGA 
GGGAAGAGGT 
TCTCTTTCGC 
TGGCTCATGC 
AAAAATGGAC 
GCCACTCCCT 
ACTCATTCAC 
CTCTCTACGG 
CTTCGGGATC 
CTCTGAGGGG 
ACCCTGAACC 
ATGCTGCATA 
GGGCCATCAG 
TTCCTCCAAC 
ACTTCTTTGC 
GCTTCCCTAG 
TACAGGCATT 
ATGCCAGGAT 
AGGGGGAAGA 
TGAGCCAAAA 
ATATCTGCAT 
GAACAGAATT 
TATTACTTCC 



31 
I 

TCTTGCATTC 
AGCAAAAGAG 
CCTCGAAAAG 
CCAAGGAATG 
GGAGGTGATG 
TGGCATGCCG 
TTTGCCAAGA 
GACTCCACTC 
AGTTAAAGAA 
CTACCCACCT 
AGAAGATGCA 
GGGGCTCTTT 
AGAGCTCGCC 
ACCTCCCCCT 
TGAGATGCCA 
AGAATATCTG 
TGAATTTCAT 
TGAAGTTAAC 
AGGAAATGAG 
CATAAGGAAA 
AGCGGACAAA 
ACTAATAGCT 
TGGATTTTTC 
GGTGACACAC 
CAGATATGGG 
TGGTTAATTT 
GTGTCATGAA 
AAGAAATACT 
TCAATAAAAA 



41 

I 

CTTGTGCTGT 
GAGGACTCCA 
GATGTGAAAC 
CAGAAGTTCC 
CGCAAGCCCA 
AAGTGGAGGA 
GATGCTGTTG 
ACATTCTCCA 
CATGGAGACT 
GGACCTGGGC 
TCAGGCACCA 
CACTCAGCCA 
CAGTTCCGCC 
GCCTCTACTG 
GCCAAGTGTG 
TTCTTTAAAG 
TTGATTTCTG 
AGCAGGGACA 
GTACAAGCAG 
ATTGATGCAG 
TACTGGAGAT 
GATGACTTTC 
TACTTCTTCA 
ATATTAAAGA 
TGTTTTTAAT 
TTCCTGCATG 
GAATGTTTCT 
CATGTGCAAT 
GTTTTATTTT 



51 

I 

TGTGTCTGCC 
ACAAGGATCT 
AGTTTAGAAG 
TTGGGTTGGA 
GGTGTGGAGT 
AAACCCACCT 
ATTCTG C CAT 
GGCTGTATGA 
TTTACTCTTT 
TTTATGGAGA 
ATTTATTCCT 
ACACTGAAGC 
TTTCGCAAGA 
AGGAACCCCT 
ATCCTGCTTT 
ACAGATATTT 
CATTTTGGCC 
CCGTTTTTAT 
GTTATCCAAG 
CTGTTTCTGA 
TTGATGAAAA 
CAGGAGTTGA 
GTGGATCATC 
GTAACAGCTG 
AAATCTAATA 
TTCTGTGACT 
GGAATTCTTC 
AGGTGAGAGA 
GGGCCTGTTC 



Seq ZD NO: 389 Protein sequence 
Protein Accession #: NP_002416 



MHLAFLVLLC 
KIQGMQKFLG 
PDLPRDAVDS 
HAYPPGPGLY 
FTELAQFRLS 
RGEYLFFKDR 
IRGNEVQAGY 
PRLIADDFPG 



11 
I 

LPVCSAYPLS 
XiEVTGKLDTD 
AIEKALKVWE 
GDIHFDDDEK 
QDDVNGIQSL 
YFWRRSHWNP 
PRGIHTLGFP 
VEPKVDAVLQ 



21 
I 

GAAKEEDSNK 
TLEVMRKPRC 
EVTPLTFSRL 
WTEDASGTNL 
YGPPPASTEE 
EPEFHLISAF 
PT I RKIDAAV 
AFGFFYFFSG 



31 

I 

DLAQQYLEKY 
GVPDVGHFSS 
YEGEADIMIS 
FLVAAHELGH 
PLVPTKSVPS 
WPSLPSYLDA 
SDKEKKKTYF 
SSQFEFDPNA 



41 
I 

YNLEKDVKQF 
FPGMPKWRKT 
F AVKEHGD FY 
6LGLFHSANT 
GSEMPAKCDP 
AYEVNSRDTV 
FAADKYWRFD 
RMVTHILKSN 



51 
I 

RRKDSNLIVK 
HLTYRIVNYT 
SFDGPGHSLA 
EALMYPIiYNS 
AL 9 FDA I STL 
FIFKGNEFWA 
ENSQSMEQGF 
SWLHC 



Seq ID NO: 390 DNA sequence 

Nucleic Acid Accession #i NM_002421.2 

Coding sequence: 1..1409 



1 
I 

ATGCACAGCT 
CCAGCGACTC 
TACTACAACC 
GTTGAAAAAT 
GCTGAAACCC 



11 
I 

TTCCTCCACT 
TAGAAACACA 
TGAAGAATGA 
TGAAGCAAAT 
TGAAGGTGAT 



21 

I 

GCTGCTGCTG 
AGAGCAAGAT 
TGGGAGGCAA 
GCAGGAATTC 
GAAGCAGCCC 



31 
I 

CTGTTCTGGG 
GTGGACTTAG 
GTTGAAAAGC 
TTTGGGCTGA 
AGATGTGGAG 



41 



51 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 



60 
120 
180 
240 
300 
360 
420 



GTGTGGTGTC ACACAGCTTC 
TCCAGAAATA CCTGGAAAAA 
GGAGAAATAG TGGCCCAGTG 
AAGTGACTGG GAAACCAGAT 
TGCCTGATGT GGCTCAGTTT 



60 
120 
1B0 
240 
300 



330 



5 

10 
15 

20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

GTCCTCACTG AGGGGAACCC TCGCTGGGAG CAAACACATC 
TACACGCCAG ATTTGCCAAG AGCAOATOTO GACCATGCCA 
TGGAGTAATG TCACACCTCT GACATTCACC AAGGTCTCTG 
ATATCTTTTG TCAGGGGAGA TCATCGGGAC AACTCTCCTT 
CTTGCTCATG CTTTTCAACC AGGCCCAGGT ATTGGAGGGG 
GAAAGGTGGA CCAACAATTT CAGAGAGTAC AACTTACATC 
GGCCATTCTC TTGGACTCTC CCATTCTACT GATATCGGGG 
ACCTTCAGTG GTGATGTTCA GCTAGCTCAG GATGACATTG 
GGACGTTCCC AAAATCCTGT CCAGCCCATC GGCCCACAAA 
AAGCTAACCT TTGATGCTAT AACTACGATT CGGGGAGAAG 
TTCTACATGC GCACAAATCC CTTCTACCCG GAAGTTGAGC 
TGGCCACAAC TGCCAAATGG GCTTGAAGCT GCTTACGAAT 
CGGTTTTTCA AAGGGAATAA GTACTGGGCT GTTCAGGGAC 
CCCAAGGACA TCTACAGCTC CTTTGGCTTC CCTAGAACTG 
CTTTCTGAGG AAAACACTGG AAAAACCTAC TTCTTTGTTG 
GATGAATATA AACGATCTAT GGATCCAGGT TATCCCAAAA 
GGAATTGGCC ACAAAGTTGA TGCAGTTTTC ATGAAAGATG 
GGAACAAGAC AATACAAATT TGATCCTAAA ACGAAGAGAA 
AATAGCTGGT TCAACTGCAG GAAAAATTAG 

Seq ID NO: 391 Protein sequence 
Protein Accession #: NP_002412.1 



PCT/US02/12476 



MHSFPPLLLL 
VEKLKQMQEF 
YTPDLPRADV 
LAHAFQPGPG 
TFSGDVQLAQ 
FYMRTNPFYP 
PKDIYSSFGF 
GIGHKVDAVF 



11 

I 

LFWGWSHSF 
FGLKVTGKPD 
DHAIEKAFQL 
IGGDAHFDED 
DDIDGIQAIY 
EVSLNFISVF 
PRTVKHIDAA 
MKDGFFYFFH 



21 
I 

PATLETQEQD 
AETLKVMKQP 
WSNVTPLTFT 
ERWTNNFREY 
GRSQNPVQPI 
WPQliPNGLEA 
LSEENTGKTY 
GTRQYKFDPK 



31 
I 

VDLVQKYLEK 
RCGVPDVAQF 
KVSEGQADIM 
NLHRVAAHEL 
GPQTPKACDS 
AYEFADRDEV 
FFVANKYWRY 
TKRILTLQKA 



Seq ID NO: 392 DNA sequence 

Nucleic Acid Accession #: NM_002421.2 

Coding sequence: 1..1409 



l 

I 

ATGCACAGCT 
CCAGCGACTC 
TACTACAACC 
GTTGAAAAAT 
GCTGAAACCC 
GTCCTCACTG 
TACACGCCAG 
TGGAGTAATG 
ATATCTTTTG 
CTTGCTCATG 
GAAAGGTGGA 
GGCCATTCTC 
ACCTTCAGTG 
GGACGTTCCC 
AAGCTAACCT 
TTCTACATGC 
TGGCCACAAC 
CGGTTTTTCA 
CCCAAGGACA 
CTTTCTGAGG 
GATGAATATA 
GGAATTGGCC 
GGAACAAGAC 
AATAGCTGGT 



11 

I 

TTCCTCCACT 
TAGAAACACA 
TGAAGAATGA 
TGAAGCAAAT 
TGAAGGTGAT 
AGGGGAACCC 
ATTTGCCAAO 
TCACACCTCT 
TCAGGGGAGA 
CTTTTCAACC 
CCAACAATTT 
TTGGACTCTC 
GTGATGTTCA 
AAAATCCTGT 
TTGATGCTAT 
GCACAAATCC 
TGCCAAATGG 
AAGGGAATAA 
TCTACAGCTC 
AAAACACTGG 
AACGATCTAT 
ACAAAGTTGA 
AATACAAATT 
TCAACTGCAG 



21 
I 

GCTGCTGCTG 
AGAGCAAGAT 
TGGGAGGCAA 
GCAGGAATTC 
GAAGCAGCCC 
TCGCTGGGAG 
AGCAGATGTG 
GACATTCACC 
TCATCGGGAC 
AGGCCCAGGT 
CAGAGAGTAC 
CCATTCTACT 
GCTAGCTCAG 
CCAGCCCATC 
AACTACGATT 
CTTCTACCCG 
GCTTGAAGCT 
GTACTGGGCT 
CTTTGGCTTC 
AAAAACCTAC 
GGATCCAGGT 
TGCAGTTTTC 
TGATCCTAAA 
GAAAAATTAG 



31 



GTGGACTTAG 
GTTGAAAAGC 
TTTGGGCTGA 
AGATGTGGAG 
CAAACACATC 
GACCATGCCA 
AAGGTCTCTG 
AACTCTCCTT 
ATTGGAGGGG 
AACTTACATC 
GATATCGGGG 
GATGACATTG 
GGCCCACAAA 
CGGGGAGAAG 
GAAGTTGAGC 
GCTTACGAAT 
GTTCAGGGAC 
CCTAGAACTG 
TTCTTTGTTG 
TATCCCAAAA 
ATGAAAGATG 
ACGAAGAGAA 



TGACCTACAG 
TTGAGAAAGC 
AGGGTCAAGC 
TTGATGGACC 
ATGCTCATTT 
GTGTTGCGGC 
CTTTGATGTA 
ATGGCATCCA 
CCCCAAAAGC 
TGATGTTCTT 
TCAATTTCAT 
TTGCCGACAG 
AGAATGTGCT 
TGAAGCATAT 
CTAACAAATA 
TGATAGCACA 
GATTTTTCTA 
TTTTGACTCT 



41 

I 

YYNLKNDGRQ 
VLTEGNPRWE 
ISFVRGDHRD 
GHSLGLSHST 
KLTFDAITTI 
RFFKGNKYWA 
DEYKRSMDPG 
NSWFNCRKN 



41 

I 

GTGTGGTGTC 
TCCAGAAATA 
GGAGAAATAG 
AAGTGACTGG 
TGCCTGATGT 
TGACCTACAG 
TTGAGAAAGC 
AGGGTCAAGC 
TTGATGGACC 
ATGCTCATTT 
GTGTTGCGGC 
CTTTGATGTA 
ATGGCATCCA 
CCCCAAAAGC 
TGATGTTCTT 
TCAATTTCAT 
TTGCCGACAG 
AGAATGTGCT 
TGAAGCATAT 
CTAACAAATA 
TGATAGCACA 
GATTTTTCTA 
TTTTGACTCT 



GATTGAAAAT 


360 


CTTCCAACTC 


420 


AGACATCATG 


480 


TGGAGGAAAT 


540 


TGATGAAGAT 


600 


TCATGAACTC 


660 


CCCTAGCTAC 


720 


AGCCATATAT 


780 


ATGTGACAGT 


840 


TAAAGACAGA 


900 


TTCTGTTTTC 


960 


AGATGAAGTC 


1020 


ACACGGATAC 


1080 


CGATGCTGCT 


1140 


CTGGAGGTAT 


1200 


TGACTTTCCT 


1260 




1320 


CCAGAAAGCT 


1380 


51 




1 

VEKRRNSGPV 


60 


QTHLTYRIEN 


120 


NSPFDGPGGN 


180 


DIGALMYPSY 


240 


RGEVMFFKDR 


300 


VQGQNVLHGY 


360 


YPKMIAHDFP 


420 


SI 




| 

ACACAGCTTC 


60 


CCTGGAAAAA 


120 


TGGCCCAGTG 


180. 


GAAACCAGAT 


240 


GG CTCAGTTT 


300 


GATTGAAAAT 


360 


CTTCCAACTC 


420 


AGACATCATG 


480 


TGGAGGAAAT 


540 


TGATGAAGAT 


600 


TCATGCCCTC 


660 


CCCTAGCTAC 


720 


AGCCATATAT 


780 


ATGTGACAGT 


840 


TAAAGACAGA 


900 


TTCTGTTTTC 


960 


AGATGAAGTC 


1020 


ACACGGATAC 


1080 


CGATGCTGCT 


1140 


CTGGAGGTAT 


1200 


TGACTTTCCT 


1260 




1320 


CCAGAAAGCT 


1380 



Seq ID NO: 393 Protein sequence 
Protein Accession #* NP_002412.l 



l 
I 

MHSFPPLLLL 
VEKLKQMQEF 
YTPDLPRADV 
LAHAFQPGPG 
TFSGDVQLAQ 
FYMRTNPFYP 
PKDIYSSFGF 
GIGHKVDAVF 



11 

I 

LFWGWSHSF 
FGLKVTGKPD 
DHAIEKAFQL 
IGGDAHFDED 
DDIDGIQAIY 
EVELNFISVF 
PRTVKHIDAA 
MKDGFFYFFH 



21 
I 

PATLETQEQD 
AETLKVMKQP 
WSNVTPLTFT 
ERWTNNFREY 
GRSQNPVQPI 
WPQLPNGLEA 
LSEENTGKTY 
GTRQYKFDPK 



31 

I 

VDLVQKYLEK 
RCGVPDVAQF 
KVSEGQADIM 
NLHRVAAHAL 
GPQTPKACDS 
AYEFADRDEV 
FFVANKYWRY 
TKRILTLQKA 



41 

I 

YYNLKNDGRQ 
VLTEGNPRWE 
ISFVRGDHRD 
GHSLGLSHST 
KLTFDAITTI 
RFFKGNKYWA 
DEYKRSMDPG 
NSWFNCRKN 



51 
I 

VEKRRNSGPV 
QTHLTYRIEN 
NSPFDGPGGN 
DIGALMYPSY 
RGEVMFFKDR 
VQGQNVLHGY 
YPKMIAHDFP 



Seq ID NO: 394 DNA Bequence 

Nucleic Acid Accession #> NM_014331«2 

Coding sequence: 1..1506 



60 
120 
180 
240 
300 
360 
420 



11 
I 



21 



31 
I 



41 
I 



51 
I 
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ATGGTCAGAA AGCCTGTTGT GTCCACCATC TCCAAAGGAG GTTACCTGCA GGGAAATGTT 60 

AACGGGAGGC TGCCTTCCCT GGGCAACAAG GAGCCACCTG GGCAGGAGAA AGTGCAGCTG 120 

AAGAGGAAAG TCACTTTACT GAGGGGAGTC TCCATTATCA TTGGCACCAT CATTGGAGCA 180 

GGAATCTTCA TCTCTCCTAA GGGCGTGCTC CAGAACACGG GCAGCGTGGG CATGTCTCTG 240 

ACCATCTGGA CGGTGTGTGG GGTCCTGTCA CTATTTGGAG CTTTGTCTTA TGCTGAATTG 300 

GG AACAACTA TAAAGAAATC TGGAGGTCAT TACACATATA TTTTGGAAGT CTTTGGTCCA 360 

TTACCAGCTT TTGTACGAGT CTGGGTGGAA CTCCTCATAA TACGCCCTGC AGCTACTGCT 420 

GTGATATCCC TGGCATTTGG ACGCTACATT CTGGAACCAT TTTTTATTCA ATGTGAAATC 480 

CCTGAACTTG CGATCAAGCT CATTACAGCT GTGGGCATAA CTGTAGTGAT GGTCCTAAAT 540 

AGCATGAGTG TCAGCTGGAG CGCCCGGATC CAGATTTTCT TAACCTTTTG CAAGCTCACA 600 

GCAATTCTGA TAATTATAGT CCCTGGAGTT ATGCAGCTAA TTAAAGGTCA AACGCAGAAC 660 

TTTAAAGACG CGTTTTCAGG AAGAGATTCA AGTATTACGC GGTTGCCACT GGCTTTTTAT 720 

TATGGAATGT ATGCATATGC TGGCTGGTTT TACCTCAACT TTGTTACTGA AGAAGTAGAA 780 

AACCCTGAAA AAACCATTCC CCTTGCAATA TGTATATCCA TGGCCATTGT CACCATTGGC 840 

TATGTGCTGA CAAATGTGGC CTACTTTACG ACCATTAATG CTGAGGAGCT GCTGCTTTCA 900 

AATGCAGTGG CAGTGACCTT TTCTGAGCGG CTACTGGGAA ATTTCTCATT AGCAGTTCCG 960 

ATCTTTGTTG CCCTCTCCTG CTTTGGCTCC ATGAACGGTG GTGTGTTTGC TGTCTCCAGG 1020 

TTATTCTATG TTGCGTCTCG AGAGGGTCAC CTTCCAGAAA TCCTCTCCAT GATTCATGTC 1080 

CGCAAGCACA CTCCTCTACC AGCTGTTATT GTTTTGCACC CTTTGACAAT GATAATGCTC 1140 

TTCTCTGGAG ACCTCGACAG TCTTTTGAAT TTCCTCAGTT TTGCCAGGTG GCTTTTTATT 1200 

GGGCTGGCAG TTGCTGGGCT GATTTATCTT CGATACAAAT GCCCAGATAT GCATCGTCCT 1260 

TTCAAGGTGC CACTGTTCAT CCCAGCTTTG TTTTCCTTCA CATGCCTCTT CATQGTTGCC 1320 

CTTTCCCTCT ATTCGGACCC ATTTAGTACA GGGATTGGCT TCGTCATCAC TCTGACTGGA 1380 

GTCCCTGCGT ATTATCTCTT TATTATATGG GACAAGAAAC CCAGGTGGTT TAGAATAATG 1440 

TCAGAGAAAA TAACCAGAAC ATTACAAATA ATACTGGAAG TTGTACCAGA AGAAGATAAG 1500 

TTATGAACTA ATGGACTTGA GATCTTGGCA ATCTGCCCAA GGGGAGACAC AAAATAGGGA 1560 

TTTTTACTTC ATTTTCTGAA AGTCTAGAGA ATTACAACTT TGGTGATAAA CAAAAGGAGT 1620 

CAGTTATTTT TATTCATATA TTTTAGCATA TTCGAACTAA TTTCTAAGAA ATTTAGTTAT 1680 

AACTCTATGT AGTTATAGAA AGTGAATATG CAGTTATTCT ATGAGTCGCA CAATTCTTGA 1740 

GT CTCTGAT A CCTACCTATT GGGGTTAGGA GAAAAGACTA GACAATTACT ATGTGGTCAT 1800 

TCTCTACAAC ATATGTTAGC ACGGCAAAGA ACCTTCAAAT TGAAGACTGA GATTTTTCTG 1660 

TATATATGGG TTTTGTAAAG ATGGTTTTAC ACACTACAGA TGTCTATACT GTGAAAAGTG 1920 

TTTTCAATTC TGAAAAAAAG CATACATCAT GATTATGGCA AAGAGGAGAG AAAGAAATTT 1980 

ATTTTACATT GACATTGCAT TGCTTCCCCT TAGATACCAA TTTAGATAAC AAACACTCAT 2040 

GCTTTAATGG ATTATACCCA GAGCACTTTG AACAAAGGTC AGTGGGGATT GTTGAATACA 2100 

TTAAAGAAGA GTTTCTAGGG GCTACTGTTT ATGAGACACA TCCAGGAGTT ATGTTTAAGT 2160 

AAAAATCCTT GAGAATTTAT TATGTCAGAT GTTTTTTCAT TCATTATCAG GAAGTTTTAG 2220 

TTATCTGTCA TTTTTTTTTT TCACATCAGT TTGATCAGGA AAGTGTATAA CACATCTTAG 2280 

AG CAAGAGTT AGTTTGGTAT TAAATCCTCA TTAGAACAAC CACCTGTTTC ACTAATAACT 2340 

TACCCCTGAT GAGTCTATCT AAACATATGC ATTTTAAGCC TTCAAATTAC ATTATCAACA 2400 

TGAGAGAAAT AACCAACAAA GAAGATGTTC AAAATAATAG TCCCATATCT GTAATCATAT 2460 

CTACATGCAA TGTTAGTAAT TCTGAAGTTT TTTAAATTTA TGGCTATTTT TACACGATGA 2520 

TGAATTTTGA CAGTTTGTGC ATTTTCTTTA TACATTTTAT ATTCTTCTGT TAAAATATCT 2580 

CTTCAGATGA AACTGTCCAG ATTAATTAGG AAAAGGCATA TATTAACATA AAAATTGCAA 2640 

AAGAAATGTC GCTGTAAATA AGATTTACAA CTGATGTTTC TAGAAAATTT CCACTTCTAT 2700 

ATCTAGGCTT TGTCAGTAAT TTCCACACCT TAATTATCAT TCAACTTGCA AAAGAGACAA 2760 

CTGATAAGAA GAAAATTGAA ATGAGAATCT GTGGATAAGT GTTTGTGTTC AGAAGATGTT 2820 

GTTTTGCCAG TATTAGAAAA TACTGTGAGC CGGGCATGGT GGCTTACATC TGTAATCCCA 2880 

GCACTTTGGG AGGCTGAGGG GGTGGATCAC CTGAGGTCGG GAGTTCTAGA CCAGCCTGAC 2940 

CAACATGGAG AAACCCCATC TCTACTAAAA ATACAAAATT AGCTGGGCAT GGTGGCACAT 3000 

GCTGGTAATC TCAGCTATTG AGGAGGCTGA GGCAGGAGAA TTGCTTGAAC CCGGGAGGCG 3060 

GAGGTTGCAG TGAGCCAAGA TTQ CACCACT GTACTCCAGC CTGGGTGACA AAGTCAGACT 3120 
CCATCTCCAA AAAAAAAAAA AAAA 

Seq ID NO: 3 95 Protein sequence 
Protein Accession fti NP_055146.1 

1 11 21 31 41 51 

I I I I I I 

MVRKPWSTI SKGGYLQGNV NGRLPSLGNK EPPGQEKVQL KRKVTLLRGV SIIIGTIIGA 60 

GIFISPKGVL QNTGSVGMSI* TIWTVCGVLS LFGALSYAEL GTTIKKSGGH YTYILEVFGP 120 

LPAFVRVWVE LLIIRPAATA VXSIiAFGRYX LEPFPIQCEI PELAIKLITA VGITWMVLN 180 

SMSVSWSAR1 QIFLTFCKLT AILIIIVPGV MQLIKGQTQN FKDAFSGRDS SITRLPLAFY 240 

YGMYAYAGWF YIiNFVTEEVE NPEKTIPLAI CISMAITIGV YVLTNVAYFT TINAEELLLS 300 

NAVAVTFSER LLGNFSLAVP IFVALSCFGS MNGGVFAVSR LFYVASREGH liPEILSMIHV 360 

RKHTPIiPAVl VLHPLTMIML FSGDLDSLLN FLSFARWLFt GLAVAGL I YL RYKCPDMHRP 420 

FKVPLFIPAL FSFTCLFMVA LSLYSDPFST GIGFVITLTG VPAYYLFIIW DKKPRWFRIM 480 
SEKITRTLQI ILEWPEEDK L 



Seq ID NO: 396 DNA sequence 
Nucleic Acid Accession #: NM_006S28 
Coding sequence: 57.. 764 

1 11 21 31 41 51 

I I I 1 I I 

GCCGCCAGCG GCTTTCTCGG ACGCCTTGCC CAGCGGGCCG CCCGACCCCC TGCACCATGG 60 

ACCCCGCTCG CCCCCTGGGG CTGTCGATTC TGCTGCTTTT CCTGACGGAG GCTGCACTGG 120 

GCGATGCTGC TCAGGAGCCA ACAGGAAATA ACGCGGAGAT CTGTCTCCTG CCCCTAGACT 180 

ACGGACCCTG CCGGGCCCTA CTTCTCCGTT ACTACTACGA CAGGTACACG CAGAGCTGCC 240 

GCCAGTTCCT GTACGGGGGC TGCGAGGGCA ACGCCAACAA TTTCTACACC TGGGAGGCTT 300 

GCGACGATGC TTGCTGGAGG ATAGAAAAAG TTCCCAAAGT TTGCCGGCTG CAAGTGAGTG 360 

TGGACGACCA GTGTGAGGGG TCCACAGAAA AGTATTTCTT TAATCTAAGT TCCATGACAT 420 

GTGAAAAATT CTTTTCCGGT GGGTGTCACC GGAACCGGAT TGAGAACAGG TTTCCAGATG 480 

AAGCTACTTG TATGGGCTTC TGCGCACCAA AGAAAATTCC ATCATTTTGC TACAGTCCAA 540 

AAGATGAGGG ACTGTGCTCT GCCAATGTGA CTCGCTATTA TTTTAATCCA AGATACAGAA 600 

CCTGTGATGC TTTCACCTAT ACTGGCTGTG GAGGGAATGA CAATAACTTT GTTAGCAGGG 660 
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AGGATTGCAA ACGTGCATGT GCAAAAGCTT TGAAAAAGAA AAAOAAOATG CCAAAGCTTC 720 

GCTTT6CCAG TAGAATCCGG AAAATTCGGA AGAAGCAATT TTAAACATTC TTAATATGTC 780 

ATCTTGTTTG TCTTTATGGC TTATTTGCCT TTATGGTTGT ATCTGAAGAA TAATATGACA 840 

GCATGAGGAA ACAAATCATT GGTGATTTAT TCACCAGTTT TTATTAATAC AAGTCACTTT 900 

TTCAAAAATT TGGATTTTTT TATATATAAC TAGCTGCTAT TCAAATGTGA GTCTACCATT 960 

TTTAATTTAT GGTTCAACTG TTTGTGAGAC GAATTCTTGC AATGCATAAG ATATAAAAGC X020 

AAATATGACT CACTCATTTC TTGGGGTCGT ATTCCTGATT TCAGAAGAGG ATCATAACTG 1080 

AAACAACATA AGACAATATA ATCATGTGCT TTTAACATAT TTGAGAATAA AAAGGACTAG 1140 
CC 



Seq ID NO: 397 Protein sequence 
Protein Accession ft: NP_006519 

l ii 21 31 41 51 

I I I I I I 

MDPARPLGLS ILLLFLTEAA LGDAAQEPTG NNAEICLLPI* DYGPCRALLL RYYYDRYTQS 60 

CRQFLYGGCE GNANNFYTWE ACDDACWRIE KVPKVCRLQV SVDDQCEGST EKYFFNLSSM 120 

TCEKFFSGGC HRNRIENRFP DEATCMGFCA PKKIPSFCYS PKDEGLCSAN VTRYYFNPRY 180 
RTCDAFTYTG CGGNDNNFVS REDCKRACAK ALKKKKKMPK LRFASRIRKI RKKQF 

Seq ID NO j 398 DNA sequence 

Nucleic Acid Accession ft: NM_001508.1 

Coding sequence: 1..1361 

1 11 21 31 41 51 

I I I I I I 

ATGGCTTCAC CCAGCCTCCC GGGCAGTGAC TGCTCCCAAA TCATTGATCA CAGTCATGTC 60 

CCCGAGTTTG AGGTGGCCAC CTGGATCAAA AT CACCCTTA TTCTGGTGTA CCTGATCATC 120 

TTCGTGATGG GCCTTCTGGG GAACAGCGTC ACCATTCGGG TCACCCAGGT GCTGCAGAAG 180 

AAAGGATACT TGCAGAAGGA GGTGACAGAC CACATGGTGA GTTTGGCTTG CTCGGACATC 240 

TTGGTGTTCC TCATCGGCAT GCCCATGGAG TTCTACAGCA TCATCTGGAA TCCCCTGACC 300 

ACGTCCAGCT ACACCCTGTC CTGCAAGCTG CACACTTTCC TCTTCGAGGC CTGCAGCTAC 360 

GCTACGCTGC TGCACGTGCT GACGCTCAGC TTTGAGCGCT ACATCGCCAT CTGTCACCCC 420 

TTCAGGTACA AGGCTGTGTC GGGACCTTGC CAGGTGAAGC TGCTGATTGG CTTCGTCTGG 480 

GTCACCTCCG CCCTGGTGGC ACTGCCCTTG CTGTTTGCCA TGGGTACTGA GTACCCCCTG 540 

GTGAACGTGC CCAGCCACCG GGGTCTCACT TGCAACCGCT CCAGCACCCG CCACCACGAG 600 

CAGCCCGAOA CCTCCAATAT GTCCATCTGT ACCAACCTCT CCAGCCGCTG GACCGTGTTC 660 

CAGTCCAGCA TCTTCGGCGC CTTCGTGGTC TACCTCGTGG TCCTGCTCTC CGTAGCCTTC 720 

ATGTGCTGGA ACATGATGCA GGTGCTCATG AAAAGCCAGA AGGGCTCGCT GGCCGGGGGC 780 

ACGCGGCCTC CGCAGCTGAG GAAGTCCGAG AGCGAAGAGA GCAGGACCGC CAGGAGGCAG 840 

ACCATCATCT TCCTGAGGCT GATTGTTGTG ACATTGGCCG TATGCTGGAT GCCCAACCAG 900 

ATTCGGAGGA TCATGGCTGC GGCCAAACCC AAGCACGACT GOACGAGGTC CTACTTCCGG 960 

GCGTACATGA TCCTCCTCCC CTTCTCGGAG ACGTTTTTCT ACCTCAGCTC GGTCATCAAC 1020 

CCGCTCCTGT ACACGGTGTC CTCGCAGCAG TTTCGGCGGG TGTTCGTGCA GGTGCTGTGC 1080 

TGCCGCCTGT CGCTGCAGCA CGCCAACCAC GAGAAGCGCC TGCGCGTACA TGCGCACTCC 1140 

ACCACCGACA GCGCCCGCTT TGTGCAGCGC CCGTTGCTCT TCGCGTCCCG GCGCCAGTCC 1200 

TCTGCAAGGA GAACTGAGAA GATTTTCTTA AG CACTTTT C AGAGCGAGGC CGAGCCCCAG 1260 

TCTAAGTCCC AGTCATTGAG TCTCGAGTCA CTAGAGCCCA ACTCAGGCGC GAAACCAGCC 1320 
AATTCTGCTG CAGAGAATGG TTTTCAGGAG CATGAAGTTT GA 



Seq ID NO: 399 Protein sequence 
Protein Accession ft: NP_001499.1 

1 ll 21 31 41 51 

I I I I I I 

MASPSLPGSD CSQIIDHSHV PEFEVATWIK ITLIIiVYLII FVMGIiLGNSV TIRVTQVLQK 60 
KGYLQKEVTD HMVSIACSDI LVFIilGMPME FYSIIWNPLT TSSYTLSCKL HTFLFEACSY 120 
ATLLHVLTLS FERYIAICHP FRYKAVSGPC QVKLLIGFVW VTSALVALPL LFAMGTEYPL 180 
VNVPSHRGLT CNRSSTRHHE QPETSNMSIC TNLSSRWTVF QSSIFGAFW YLWLLSVAF 240 
MCWNMMQVLM KSQKGSLAGG TRPPQLRKSE SEESRTARRQ TTIFLRLIW TIAVCWMPNQ 300 
IRRIMAAAKP KHDWTRSYFR AYMILLPFSE TFFYLSSVIN PLLYTVSSQQ FRRVFVQVLC 360 
CRLSLQHANH EKRLRVHAHS TTDSARFVQR PLLFASRRQS SARRTEKIFL STFQSEAEPQ 420 
SKSQSLSLES LEPNSGAKPA NSAAENGFQB HEV 



Seq ID NO: 400 DNA sequence 

Nucleic Acid Accession ft: NM_006475.1 

Coding sequence: 28.. 2538 

1 11 21 31 41 51 

I I I I I* I 

AACAGAACTG CAACGGAGAG ACT CAAGATG ATTCCCTTTT TACCCATGTT TTCTCTACTA 60 

TTGCTGCTTA TTGTTAACCC TATAAACGCC AACAATCATT ATGACAAGAT CTTGGCTCAT 120 

AGTCGTATCA GGGGTCGGGA CCAAGGCCCA AATGTCTGTG CCCTTCAACA GATTTTGGGC 180 

ACCAAAAAGA AATACTTCAG CACTTGTAAG AACTGGTATA AAAAGTCCAT CTGTGGACAG 240 

AAAACGACTG TTTT AT AT G A ATGTTGCCCT GGTTATATGA GAATGGAAGG AATGAAAGGC 300 

TGCCCAGCAG TTTTGCCCAT TGACCATGTT TATGGCACTC TGGGCATCGT GGGAGCCACC 360 

ACAACGCAGC GCTATTCTGA CGCCTCAAAA CTGAGGGAGG AGATCGAGGG AAAGGGATCC 420 

TTCACTTACT TTGCACCGAG TAATGAGGCT TGGGACAACT TGGATTCTGA TATCCGTAGA 480 

GGTTTGGAGA GCAACGTGAA TGTTGAATTA CTGAATGCTT TACATAGTCA CATGATTAAT 540 

AAGAGAATGT TGACCAAGGA CTTAAAAAAT GGCATGATTA TTCCTTCAAT GTATAACAAT 600 

TTGGGGCTTT TCATTAACCA TTATCCTAAT GGGGTTGTCA CTGTTAATTG TGCTCGAATC 660 

ATCCATGGGA ACCAGATTGC AACAAATGGT GTTGTCCATG TCATTGACCG TGTGCTTACA 720 

CAAATTGGTA CCTCAATTCA AGACTTCATT GAAGCAGAAG ATGACCTTTC ATCTTTTAGA 780 

GCAGCTGCCA TCACATCGGA CATATTGGAG GCCCTTGGAA GAGACGGTCA CTTCACACTC 840 

TTTGCTCCCA CCAATGAGGC TTTTGAGAAA CTTCCACGAG GTGTCCTAGA AAGGTTCATG 900 

GGAGACAAAG TGGCTTCCGA AGCTCTTATG AAGTACCACA TCTTAAATAC TCTCCAGTGT 960 

TCTGAGTCTA TTATGGGAGG AGCAGTCTTT GAGACGCTGG AAGGAAATAC AATTGAGATA 1020 
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GGATGTOACG GTGACAGTAT AACAGTAAAT GGAATCAAAA 
GTGACAAATA ATGGTGTGAT CCATTTGATT GATCAGGTCC 
CAAGTTATTG AGCTGGCTGG AAAACAGCAA ACCACCTTCA 
GGCTTGGCAT CTGCTCTGAG GCCAGATGGA GAATACACTT 
GCATTTTCTG ATGATACTCT CAGCATGGTT CAGCGCCTCC 
CACATATTGA AAGTAAAAGT TGGCCTTAAT GAGCTTTACA 
ATCGGAGGCA AACAGCTCAG AGTCTTCGTA TATCGTACAG 
TGCATGGAGA AAGGGAGTAA GCAAGGGAGA AACGGTGCGA 
ATCAAGCCAG CAGAGAAATC CCTCCATGAA AAGTTAAAAC 
TTCCTCAGCC TACTTGAAGC TGCAGACTTG AAAGAGCTCC 
ACATTATTTG TGCCAACCAA TGATGCTTTT AAGGGAATGA 
CTGATACGGG ACAAAAATGC TCTTCAAAAC ATCATTCTTT 
TTCATTGGAA AAGGATTTGA ACCTGGTGTT ACTAACATTT 
AAAATCTTTC TGAAAGAAGT AAATGATACA CTTCTGGTGA 
TCTGACATCA TGACAACAAA TGGTGTAATT CATGTTGTAG 
GACACACCTG TTGGAAATGA TCAACTGCTG GAAATACTTA 
CAAATTAAGT TTGTTCGTGG TAGCACCTTC AAAGAAATCC 
AAAATTATAA CCAAAGTTGT GGAACCAAAA ATTAAAGTGA 
ATTATCAAAA CTGAAGGACC CACACTAACA AAAGTCAAAA 
AGACTGATTA AAGAAGGTGA AACAATAACT GAAGTGATCC 
AAATACACCA AAATCATTGA TGGAGTGCCT GTGGAAATAA 
GAACGAATCA TTACAGGTCC TGAAATAAAA TACACTAGGA 
ACAGAAGAAA CTCTGAAGAA ATTGTTACAA GAAGAGGTCA 
GAAGGTGGTG ATGGTCATTT ATTTGAAGAT GAAGAAATTA 
ACACCCGTGA GGAAGTTGCA AGCCAACAAA AAAGTTCAAG 
GAAGGTCGTT CTCAGTGAAA ATCCAAAAAC CAGAAAAAAA 
AATAACCTGA CCTTAGAAAA TTGTGAGAGC CAAGTTGACT 
ACAAAGAAGC AATCATCAAA TAATTCTGAA CACAAATTTA 
GAAACATGAG GGAAATTGTG GAGTTAGCCT CCTGTGGTAA 
CACCTTACAC CCTTTTTCAT CTTGACATTA AAAGTTCTGG 
AGAAAAATCC TTGTCACCAG ATTCATTACA ATTCAAATCG 
CCATTGAAAA GACCGAGCCT TGTATGTATG TTATGGATAC 
TATCTCTCCA TGGGAAGCTA AGTTATAAAA ATAGGTGCTT 
TCAAAAGGCT TTGCACATTT CTATATGAGT GGGTTTACTG 
CAACTAATTT TGTACTCTCA GAATGTTTGT CATATGCTTC 
TCTCAAACGT TTCAATAAAA CCATTTTTCA GATATAAAGA 
ATTCAGAAAA ACTCAAGATT TAAGTTAAAA AGTGGTTTGG 

Seq ID NO: 401 Protein sequence 
Protein Accession »: NP_006466.l 



PCT/US02/12476 



TGGTGAACAA 
TAATTCCTGA 
CGGATCTTGT 
TGCTGGCACC 
TTAAATTAAT 
ACGGGCAAAT 
CTGTCTGCAT 
TTCACATATT 
AAGATAAGCG 
TGACACAACC 
CTAGTGAAGA 
ATCACCTGAC 
TAAAGACCAC 
ATGAATTGAA 
ATAAACTCCT 
ATAAATTAAT 
CCGTGACTGT 
TTGAAGGCAG 
TTGAAGGTGA 
ATGGAGAGCC 
CTGAAAAAGA 
TTTCTACTGG 
CCAAGGTCAC 
AAAGACTGCT 
GTTCTAGAAG 
TGTTTATACA 
TCAGGAACTG 
ATATTTTTTT 
AGGAATTGAA 
CTAACTTTGG 
AAGAGTTGTG 
ATAAAATGCA 
GGTGTACAAA 
GTAAATTATG 
TTGCAATGCA 
GAATTACTTC 
ACTTGGGAA 



AAAGGATATT 
TTCTGCCAAA 
GGCCCAATTA 
TGTGAATAAT 
TCTGCAGAAT 
ACTGGAAACC 
TGAAAATTCA 
CCGCGAGATC 
CTTTAGCACC 
TGGAGACTGG 
AAAAGAAATT 
ACCAGGAGTT 
ACAAGGAAGC 
ATCAAAAGAA 
CTATCCAGCA 
CAAATACATC 
CTATACAACT 
TCTTCAGCCT 
ACCTGAATTC 
AATTATTAAA 
GACACGAGAA 
AGGTGGAGAA 
CAAATTCATT 
TCAGGGAGAC 
ACGATTAAGG 
ACCCTAAGTC 
AAA CATC AG C 
TTCTGAATGA 
GAAAATATAA 
AATCCATTAG 
AACTGTTATC 
CGCAAGCCAT 
ACTTTTTATA 
TTATTTTTTA 
TATTTTTTAA 
AAATTGAGTA 



1 
I 

MIPFfcPMFSL 
KNWYKKSICG 
KLREEIEGKG 
NGMIIPSMYN 
IEAEDDLSSP 
MKYHILNTLQ 
IDQVIilPDSA 
VQRLLKLILQ 
RNGAIHIFRE 
FKGMTSEEXE 
TLLVNELKSK 
FKEIPVTVYT 
TEVIHGEPII 
QEEVTKVTKP 



11 
I 

LLLliIVNPIN 
QKTTVLYECC 
SFTYFAPSNE 
NLGLFINHYP 
RAAAITSDIIj 
CSESIMGGAV 
KQVIEIiAGKQ 
NHILKVKVGL 
IIKPAEKSLH 
ILIRDKNALQ 
ESDIMTTNGV 
TKIITKWEP 
KKYTKI IDGV 
IEGGDGHLFE 



21 

I 

ANNHYDKILA 
PGYMRMEGMK 
AWDNLDSDIR 
NGWTVNCAR 
EALGRDGHFT 
FETLEGNTIE 
QTTFTDLVAQ 
NELYNGQIEiE 
EKIiKQDKRFS 
NIILYHLTPG 
IHWDKLLYP 
KIKVIEGSLQ 
PVEITEKETR 
DEEIKRLLQG 



31 
I 

BSRIRGRDQG 
GCPAVLPIDH 
RGLESNVNVE 
IIHGNQIATN 
LPAPTNEAFE 
IGCDGDSITV 
LGLASALRPD 
TIGGKQLRVF 
TFLSLLEAAD 
VFIGKGFEPG 
ADTPVGNDQL 
PIIKTEGPTL 
EERIITGPEX 
DTPVRKLQAN 



41 
I 

PNVCALQQIL 
VYGTLGIVGA 
LLNALHSHMI 
GWHVIDRVL 
KLPRGVLERF 
NGIKMVNKKD 
GEYTLLAPVN 
VYRTAVCIEN 
LKELLTQPGD 
VTNILKTTQG 
LEILNKLIKY 
TKVKIEGEPE 
KYTRISTGGG 
KKVQGSRRRL 



51 
I 

GTKKKYFSTC 
TTTQRY9DAS 
NKRMLTKDLK 
TQIGTSIQDF 
MGDKVASEAL 
IVTNNGVIHL 
NAFSDDTLSM 
SCMEKGSKQG 
WTLFVPTNDA 
SKIPLKEVND 
IQIKFVRGST 
FRLIKEGETI 
ETEETLKKLL 
REGRSQ 



Seq ID NOi 402 dna sequence 

Nucleic Acid Accession #: NM_002416 

Coding sequence: 40.. 417 



1 
I 

ATCCAATACA 
TTCCTCTTGG 
AAGGGTCGCT 
GACCTTAAAC 
AAGAATGGAG 
AAGTGGGAGA 
AAGAAAGTTC 
ACCACTTCAC 
TTCCAAAGGA 
ATTACTCTGA 
TTGTTAAAGG 
TTAAGGCCAT 
CTCACAACAG 
TATCTGAGGC 
TGAAATTGAG 
CTACAGGCCT 
ACTGGAGATC 
TTCCATCTTG 
AAGTCAGCTC 
CATATGCTCT 
AGTGCTTTCT 
AAATAAACCT 
CACATGGGTG 



11 
I 

GGAGTGACTT 
GCATCATCTT 
GTTCCTGCAT 
AATTTGCCCC 
TTCAAACATG 
AACAGGTCAG 
TGAAAGTTCG 
CAATAAGTAT 
GGATGGCATA 
AATTGTAACT 
CTATGATTGT 
GATTTTAGCA 
CTGCCTGGAA 
ACATGTCAGC 
CTGGACCTCA 
CACACACAAT 
ACCAGTGTGT 
CCCGCTCAGG 
TTCTCCATCC 
GATTTATCTG 
TCTCCCAATT 
TTTTGGACAC 
AACACTCAAT 



21 
I 

GGAACTCCAT 
GCTGGTTCTG 
CAGCACCAAC 
AAGCCCTTCC 
TCTAAACCCA 
CCAAAAGAAA 
AAAATCTCAA 
TCTGTGTTAA 
TAATACAAAG 
AAAGTTAGAA 

ATACCCATGT 
GAGCAGCCCT 
AAGTCCTAAG 
CCAAGCTGCT 
GTGTCTGAGA 
GGCTTTCAGA 
CTGACCACTT 
TACCACAATG 
AGTCAACTCC 
CATCCTCACT 
ACAAATTATC 
GGTTAACTAA 



31 

1 

TCTATCACTA 
ATTGGAGTGC 
CAAGGGACTA 
TGCGAGAAAA 
GATTCAGCAG 
AAGCAAAAGA 
CGTTCTCGTC 
AAATGTTCTA 
GCTTATTAAT 
AGTTGATTTT 
CTACCACCCA 
CTACACAGAT 
AGGCTTCCAC 
CCTGTTAGCA 
GTGGCCATCA 
GATTCATGCT 
GCCTCCTTTC 
TATTTCTTTT 
CAGTGCCTTT 
TTTCTCATCT 
CAGTCCAGCT 
TTAAAACTCC 
TTCTTGGGTG 



41 

I 

TGAAGAAAAG 
AAGGAACCCC 
TCCACCTACA 
TTGAAATCAT 
ATGTGAAGGA 
ATGGGAAAAA 
AAAAGAAGAC 
TTTTAATTAT 
TTGACTAGAA 
AAGAATCCAA 
CCAGTTGAAT 
GTTCACCCAA 
GTACTGCAGC 
TGCTGGTGAG 
ACCTCTGTAT 
GATTGTTATT 
TGGCTTTGGA 
TGTTCCCCTT 
CTTCTCTCCA 
TGTCCCCAAC 
TAGTTCAAGT 
TGTTTCACTT 
TTTATCCTAT 



51 
I 

TGGTGTTCTT 
AGTAGTGAGA 
ATCCTTGAAA 
TGCTACACTG 
ACTGATTAAA 
ACATCAAAAA 
TACATAAGAG 
ACCGCTATCA 
AATTTAAAAC 
ACGTTAAGAA 
TTCATCATGC 
CCACATCCCA 
CTCCAGAGAG 
CCAAGCAGTT 
TTGAATCAGC 
GGGTATCACC 
AGCCATGTGA 
TGCTTCATTC 
GTGCACCTGT 
ACCCCACAGA 
CCTGCCTCTT 
GGTTCAGTAC 
CTCTCCAACC 
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1800 
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60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
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AGATTGTCAG CTCCTTGAGG QCAAGAGCCA CAGTATATTT CCCTGTTTCT TCCACAGTGC 1440 

CTAATAATAC TGTGGAACTA GGTTTTAATA ATTTTTTAAT TGATGTTGTT ATGGGCAGGA 1500 

TGGCAACCAG ACCATTGTCT CAGAGCAGGT GCTGGCTCTT TCCTGGCTAC TCCATGTTGG 1560 

CTAGCCTCTG GTAACCTCTT ACTTATTATC TTCAGGACAC TCACTACAGG GACCAGGGAT 1620 

GATGCAACAT CCTTGTCTTT TTATGACAGG ATGTTTGCTC AGCTTCTCCA ACAATAAGAA 1680 

GCACGTGGTA AAACACTTGC GGATATTCTG GACTGTTTTT AAAAAATATA CAGTTTACCG 1740 

AAAATCATAT AATCTTACAA TGAAAAGGAC TTTATAGATC AGCCAGTGAC CAACCTTTTC 1800 

CCAACCATAC AAAAATTCCT TTTCCCGAAG GAAAAGGGCT TTCTCAATAA GCCTCAGCTT 1860 

TCTAAGATCT AACAAGATAG CCACCGAGAT CCTTATCGAA ACTCATTTTA GGCAAATATG 1920 

AGTTTTATTG TCCGTTTACT TGTTTCAGAG TTTGTATTGT GATTATCAAT TACCACACCA 1980 

TCTCCCATGA AGAAAGGGAA CGGTGAAGTA CTAAGCGCTA GAGGAAGCAG CCAAGTCGGT 2040 

TAGTGGAAGC ATGATTGGTG CCCAGTTAGC CTCTGCAGGA TGTGGAAACC TCCTTCCAGG 2100 

GGAGGTTCAG TGAATTGTGT AGGAGAGGTT GTCTGTGGCC AGAATTTAAA CCTATACTCA 2160 

CTTTCCCAAA TTGAATCACT GCTCACACTG CTGATGATTT AGAGTGCTGT CCGGTGGAGA 2220 

TCCCACCCGA ACGTCTTATC TAATCATGAA ACTCCCTAGT TCCTTCATGT AACTTCCCTG 2280 

AAAAATCTAA GTGTTTCATA AATTTGAGAG TCTGTGACCC ACTTACCTTG CATCTCACAG 2340 

GTAGACAGTA TATAACTAAC AACCAAAGAC TACATATTGT CACTGACACA CACGTTATAA 2400 

TCATTTATCA TATATATACA TACATGCATA CACTCTCAAA GCAAATAATT TTTCACTTCA 2460 

AAACAGTATT GACTTGTATA CCTTGTAATT TGAAATATTT TCTTTGTTAA AATAGAATGG 2520 
TATCAATAAA TAGACCATTA ATCAG 

Seq ID NO: 403 Protein sequence 
Protein Accession #: NP_002407 

1 11 21 31 41 51 

I I I I I I 

MKKSGVLFLD GIILLVIjIGV QGTPWRKGR CSCISTNQGT IHLQSLKDLK QFAPSPSCEK 60 

IEIIATLKNG VQTCLNPDSA DVKELIKKWE KQVSQKKKQK NGKKHQKKKV LKVRKSQRSR 120 
QKKTT 

Seq ID NOi 404 DNA sequence 

Nucleic Acid Accession ft: NM_006670 

Coding sequence: 85.. 1347 

1 11 21 31 , 41 51 

I I I I I 

CCGGCTCGCG CCCTCCGGGC CCAGCCTCCC GAGCCTTCGG AGCGGGCGCC GTCCCAGCCC 60 

AGCTCCGGGG AAACGCGAGC CGCGATGCCT GGGGGGTGCT CCCGGGGCCC CGCCGCCGGG 120 

GACGGGCGTC TGCGGCTGGC GCGACTAGCG CTGGTACTCC TGGGCTGGGT CTCCTCGTCT 180 
TCTCCCACCT CCTCGGCATC CTCCTTCTCC TCCTCGGCGC CGTTCCTGGC TTCCGCCGTG • 240 

TCCGCCCAGC CCCCGCTGCC GGACCAGTGC CCCGCGCTGT GCGAGTGCTC CGAGGCAGCG 300 

CGCACAGTCA AGTGCGTTAA CCGCAATCTG ACCGAGGTGC CCACGGACCT GCCCGCCTAC 360 

GTGCGCAACC TCTTCCTTAC CGGCAACCAG CTGGCCGTGC TCCCTGCCGG CGCCTTCGCC 420 

CGCCGGCCGC CGCTGGCGGA GCTGGCCGCG CTCAACCTCA GCGGCAGCCG CCTGGACGAG 480 

GTGCGCGCGG GCGCCTTCGA GCATCTGCCC AGCCTGCGCC AGCTCGACCT CAGCCACAAC 540 

CCACTGGCCG ACCTCAGTCC CTTCGCTTTC TCGGGCAGCA ATGCCAGCGT CTCGGCCCCC 600 

AGTCCCCTTG TG GAACTG AT CCTGAACCAC ATCGTGCCCC CTGAAGATGA GCGGCAGAAC 660 

CGGAGCTTCG AGGGCATGGT GGTGGCGGCC CTGCTGGCGG GCCGTGCACT GCAGGGGCTC 720 

CGCCGCTTGG AGCTGGCCAG CAACCACTTC CTTTACCTGC CGCGGGATGT GCTGGCCCAA 780 

CTGCCCAGCC TCAGGCACCT GGACTTAAGT AATAATTCGC TGGTGAGCCT GACCTACGTG 840 

TCCTTCCGCA ACCTGACACA TCTAGAAAGC CTCCACCTGG AGGACAATGC CCTCAAGGTC 900 

CTTCACAATG GCACCCTGGC TGAGTTGCAA GGTCTACCCC ACATTAGGGT TTTCCTGGAC 960 

AACAATCCCT GGGTCTGCGA CTGCCACATG GCAGACATGG TGACCTGGCT CAAGGAAACA 1020 

GAGGTAGTGC AGGGCAAAGA CCGGCTCACC TGTGCATATC CGGAAAAAAT GAGGAATCGG 1080 

GTCCTCTTGG AACTCAACAG TGCTGACCTG GACTGTGACC CGATTCTTCC CCCATCCCTG 1140 

CAAACCTCTT ATGTCTTCCT GGGTATTGTT TTAGCCCTGA TAGGCGCTAT TTTCCTCCTG 1200 

GTTTTGTATT TGAACCGCAA GGGGATAAAA AAGTGGATGC ATAACATCAG AGATGCCTGC 1260 

AGGGATCACA TGGAAGGGTA TCATTACAGA TATGAAATCA ATGCGGACCC CAGATTAACA 1320 

AACCTCAGTT CTAACTCGGA- TGTCTGAGAA ATATTAGAGG ACAGACCAAG GACAACTCTG 1380 

CATGAGATGT AGACTTAAGC TTTATCCCTA CTAGGCTTGC TCCACTTTCA TCCTCCACTA 1440 

TAGATACAAC GGACTTTGAC TAAAAGCAGT GAAGGGGATT TGCTTCCTTG TTATGTAAAG 1500 

TTTCTCGGTG TGTTCTGTTA ATGTAAGACG ATGAACAGTT GTGTATAGTG TTTTACCCTC 1560 

TTCTTTTTCT TGGAACTCCT CAACACGTAT GGAGGGATTT TTCAGGTTTC AGCATGAACA 1620 

TGGGCTTCTT GCTGTCTGTC TCTCTCTCAG TACAGTTCAA GGTGTAGCAA GTGTACCCAC 1680 

ACAGATAGCA TTCAACAAAA GCTGCCTCAA CTTTTTCGAG AAAAATACTT TATTCATAAA 1740 

TATCAGTTTT ATTCTCATGT ACCTAAGTTG TGGAGAAAAT AATTGCATCC TATAAACTGC 1800 

CTGCAGACGT TAGCAGGCTC TTCAAAATAA CTCCATGGTG CACAGGAGCA CCTGCATCCA 1860 

AGAGCATGCT TACATTTTAC TGTTCTGCAT ATTACAAAAA ATAACTTGCA ACTTCATAAC 1920 

TTCTTTGACA AAGTAAATTA CTTTTTTGAT TGCAGTTTAT ATGAAAATGT ACTGATTTTT 1980 

TTTTAATAAA CTGCATCGAG ATCCAACCGA CTGAATTGTT AAAAAAAAAA AAAAATAAAG 2040 
ATTCTTAAAA GAA 

Seq ID NOx 405 Protein sequence 
Protein Accession #: NP_006661 

1 11 21 31 41 51 

I I I I I I 

MPGGCSRGPA AGDGRLRLAR LAL.VLLGWVS SSSPTSSASS PSSSAPFLAS AVSAQPPIjPD 60 

QCPALCECSE AARTVKCVNR NLTEVPTDLP AYVRNLFLTG NQLAVLPAGA FARRPPLAEL 120 

AALNLSGSRIi DEVRAGAFEH LPSLRQLDLS HNPLADLSPF AFSGSNASVS APSPLVELIL 180 

NHIVPPEDER QNR8FEGMW AALLAGRALQ GLRRLELASN HFLYLPRDVL AQLPSLRHLD 240 

LSNNSLVSLT YVSPRNLTHL ESLHLEDNAL KVLHNGTLAE LQGLPHIRVP LDNNPWVCDC 300 

HMADMVTWLK ETEWQGKDR LTCAYPEKMR NRVLLELNSA DLDCDPILPP SLQTSYVFLG 360 
IVLALIGAIF LLVLYLNRKG IKKWMHNIRD ACRDHMEGYH YRYEINADPR LTNLSSNSDV 

Seq ID NO: 406 DNA sequence 

Nucleic Acid Accession #: Eos sequence 
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Coding sequence: 1..927 

1 11 21 31 41 51 

■ I t I I I I 

ATGCCTGGGG GGTGCTCCCG GGGCCCCGCC GCCGGGGAGG GGCGTCTGCG GCTGGCGCGA 60 

CTAGCGCTGG TACTCCTGGG CTGGGTCTCC TCGTCTTCTC CCACCTCCTC GGCATCCTCC 120 

TTCTCCTCCT CGGCGCCGTT CCTGGCTTCC GCCGTGTCCG CCCAGCCCCC GCTGCCGGAC 180 

CAGTGCCCCG CGCTGTGCGA GTGCTCCGAG GCAGCGCGCA CAGTCAAGTG CGTTAACCGC 240 

AATCTGACCG AGGTGCCCAC GGACCTGCCC GCCTACGTGC GCAACCTCTT CCTTACCGGC 300 

AACCAGCTGG CCAGCAACCA CTTCCTTTAC CTGCCGCGGG ATGTGCTGGC CCAACTGCCC 360 

AGCCTCAGGC ACCTGGACTT AAGTAATAAT TCGCTGGTGA GCCTGACCTA CGTGTCCTTC 420 

CGCAACCTGA CACATCTAGA AAGCCTCCAC CTGGAGGACA ATGCCCTCAA GGTCCTTCAC 480 

AATGGCACCC TGGCTGAGTT GCAAGGTCTA CCCCACATTA GGGTTTTCCT GGACAACAAT 540 

CCCTGGGTCT GCGACTGCCA CATGG CAG AC ATGGTGACCT GGCTCAAGGA AACAGAGGTA 600 

GTG CAGGGCA AAGACCGGCT CACCTGTGCA TATCCGGAAA AAATGAGGAA TCGGGTCCTC 660 

TTGGAACTCA ACAGTGCTGA CCTGGACTGT GACCCGATTC TTCCCCCATC CCTGCAAACC 720 

TCTTATGTCT TCCTGGGTAT TGTTTTAGCC CTGATAGGCG CTATTTTCCT CCTGGTTTTG 780 

TATTTGAACC GCAAGGGGAT AAAAAAGTGG ATGCATAACA TCAGAGATGC CTGCAGGGAT 840 

CACATGGAAG GGTATCATTA CAGATATGAA ATCAATGCGG ACCCCAGATT AACAAACCTC 900 
AGTTCTAACT CGGATGTCCT CGAGTGA 

Seq ID NO j 407 Protein sequence 
Protein Accession #: Eos sequence 

1 11 21 31 

111! 

MPGGCSRGPA AGDGRLRLAR LALVLLGWVS SSSPTSSASS 
QCPALCECSE AARTVKCVNR NLTEVPTDLP AYVRNLFLTG 
SLRHLDLSNN SLVSLTYVSF RNLTHLESLH LEDNALKVLH 
PWVCDCHMAD MVTWLKETEV VQGKDRLTCA YPEKMRNRVL 
SYVFLGIVLA LIGAIFLLVL YLNRKGIKKW MHNIRDACRD 
SSNSDVLE 

Seq ID NO: 408 DNA sequence 
Nucleic Acid Accession #: NM_Q00095.l 
Coding sequence: 26.. 2299 

1 11 21 31 41. 51 

1 I I I I 1 

CAGCACCCAG CTCCCCGCCA CCGCCATGGT CCCCGACACC GCCTGCGTTC TTCTGCTCAC 60 

CCTGGCTGCC CTCGGCGCGT CCGGACAGGG CCAGAGCCCG TTGGGCTCAG ACCTGGGCCC 120 

GCAGATGCTT CGGGAACTGC AGGAAACCAA CGCGGCGCTG CAGGACGTGC GGGACTGGCT 180 

GCGGCAGCAG GTCAGGGAGA TCACGT TCCT GAAAAACACG GTGATGGAGT GTGACGCGTG 240 

CGGGATGCAG CAGTCAGTAC GCACCGGCCT ACCCAGCGTG CGGCCCCTGC TCCACTGCGC 300 

GCCCGGCTTC TGCTTCCCCG GCGTGGCCTG CATCCAGACG GAGAGCGGCG GCCGCTGCGG 360 

CCCCTGCCCC GCGGGCTTCA CGGGCAACGG CTCGCACTGC ACCGACGTCA ACGAGTGCAA 420 

CGCCCACCCC TGCTTCCCCC GAGTCCGCTG TATCAACACC AGCCCGGGGT TCCGCTGCGA 480 

GGCTTGCCCG CCGGGGTACA GCGGCCCCAC CCACCAGGGC GTGGGGCTGG CTTTCGCCAA 540 

GGCCAACAAG CAGGTTTGCA CGGACATCAA CGAGTGTGAG ACCGGGCAAC ATAACTGCGT 600 

CCCCAACTCC GTGTGCATCA ACACCCGGGG CTCCTTCCAG TGCGGCCCGT GCCAGCCCGG 660 

CTT CGTGGGC GACCAGGCGT CCGGCTGCCA GCGCGGCGCA CAGCGCTTCT GCCCCGACGG 720 

CTCGCCCAGC GAGTGCCACG AGCATGCAGA CTGCGTCCTA GAGCGCGATG GCTCGCGGTC 780 

GTGCGTGTGT CGCGTTGGCT GGGCCGGCAA CGGGATCCTC TGTGGTCGCG ACACTGACCT 840 

AGACGGCTTC CCGGACGAGA AGCTGCGCTG CCCGGAGCCG CAGTGCCGTA AGGACAACTG 900 

CGTGACTGTG CCCAACTCAG GGCAGGAGGA TGTGGACCGC GATGGCATCG GAGACGCCTG 960 

CGATCCGGAT GCCGACGGGG ACGGGGTCCC CAATGAAAAG GACAACTGCC CGCTGGTGCG 1020 

GAACCCAGAC CAGCGCAACA CGGACGAGGA CAAGTGGGGC GATGCGTGCG ACAACTGCCG 1080 

GTCCCAGAAG AACGACGACC AAAAGGACAC AGACCAGGAC GGCCGGGGCG ATGCGTGCGA 1140 

CGACGACATC GACGGCGACC GGATCCGCAA CCAGGCCGAC AACTGCCCTA GGGTACCCAA 1200 

CTCAGACCAG AAGGACAGTG ATGGCGATGG TATAGGGGAT GCCTGTGACA ACTGTCCCCA 1260 

GAAGAGCAAC CCGGATCAGG CGGATGTGGA CCACGACTTT GTGGGAGATG CTTGTGACAG 1320 

CGATCAAGAC CAGGATGGAG ACGGACATCA GGACTCTCGG GACAACTGTC CCACGGTGCC 1380 

TAACAGTGCC CAGGAGGACT CAGACCACGA TGGCCAGGGT GATGCGTGCG ACGACGACGA 1440 

CGACAATGAC GGAGTCCCTG ACAGTCGGGA CAACTGCCGC CTGGTGCCTA ACCCCGGCCA 1500 

GGAGGACGCG GACAGGGACG GCGTGGGCGA CGTGTGCCAG GACGACTTTG ATGCAGACAA 1560 

GGTGGTAGAC AAGATCGACG TGTGTCCGGA GAACGCTGAA GTCACGCTCA CCGACTTCAG 1620 

GGCCTTCCAG ACAGTCGTGC TGGACCCGGA GGGTGACGCG CAGATTGACC CCAACTGGGT 1680 

GGTGCTCAAC CAGGGAAGGG AG AT CG T G CA GACAATGAAC AGCGACCCAG GCCTGGCTGT 1740 

GGGTTACACT GCCTTCAATG GCGTGGACTT CGAGGGCACG TTCCATGTGA ACACGGTCAC 1800 

GGATGACGAC TATGCGGGCT TCAT CTTTGG CTACCAGGAC AGCTCCAGCT TCTACGTGGT 1860 

CATGTGGAAG CAGATGGAGC AAACGTATTG GCAGGCGAAC CCCTTCCGTG CTGTGGCCGA 1920 

GCCTGGCATC CAACTCAAGG CTGTGAAGTC TTCCACAGGC CCCGGGGAAC AGCTGCGGAA 1980 

CGCTCTGTGG CATACAGGAG ACACAGAGTC CCAGGTGCGG CTGCTGTGGA AGGACCCGCG 2040 

AAACGTGGGT TGGAAGGACA AGAAGTCCTA TCGTTGGTTC CTGCAGCACC GGCCCCAAGT 2100 

GGG CTACATC AGGGTGCGAT TCTATGAGGG CCCTGAGCTG GTGGCCGACA GCAACGTGGT 2160 

CTTGGACACA ACCATGCGGG GTGGCCGCCT GGGGGTCTTC TGCTTCTCCC AGGAGAACAT 2220 

CATCTGGGCC AACCTGCGTT ACCGCTGCAA TGACACCATC CCAGAGGACT ATGAGACCCA 2280 

TCAGCTGCGG CAAGCCTAGG GACCAGGGTG AGGACCCGCC GGATGACAGC CACCCTCACC 2340 

GCGGCTGGAT GGGGGCTCTG CACCCAGCCC AAGGGGTGGC CGTCCTGAGG GGGAAGTGAG 2400 
AAGGGCTCAG AGAGGACAAA ATAAAGTGTG TGTGCAGGG 

Seq ID NO: 409 Protein sequence 
Protein Accession #: NP_000086.1 

1 11 21 31 41 51 

I I I I I I 

MVPDTACVLL LTLAALGASG QGQSPLGSDL GPQMLRELQE TNAALQDVRD WLRQQVREIT 60 



41 51 

I I 

FSSSAPFLAS AVSAQPPLPD 60 

NQLASNHFLY liPRDVLAQLP 120 

NGTLAELQGL PHIRVFLDNN 180 

LELNSADLDC DPILPPSLQT 240 

HMEGYHYRYE INADPRLTNL 300 
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FLKNTVMECD ACGMQQSVRT GLPSVRPLLH CAPGFCFPGV 
NGSHCTDVNE CNAHPCFPRV RCINTSPGFR CEACPPGYSG 
INBCETGQHN CVPNSVCINT RGSFQCGPCQ PGFVGDQASG 
ADCVLERDGS RSCVCRVGWA GNGILCGRDT DLDGPPDEKL 
EDVDRDGIGD ACDPDADGDG VPNEKDNCPL VRNPDQRNTD 
DTDQDGRGDA CDDDIDGDRI RNQADNCPRV PNSDQKDSDG 
VDHDFVGDAC DSDQDQDGDG HQDSRDNCPT VPNSAQEDSD 
RDNCRLVPNP GQEDADRDGV GDVCQDDFDA DKWDKIDVC 
PEGDAQIDPN WWLNQGREI VQTMNSDPGL AVGYTAFNGV 
FGYQDSSSFY WMWKQMEQT YWQANPFRAV AEPGIQLKAV 
ESQVRLLWKD PRNVGWKDKK SYRWFLQHRP QVGYIRVRPY 
RLGVFCFSQE NIIWANLRYR CNDT I PEDYE THQLRQA 

Seq 10 NO i 410 DNA sequence 

Nucleic Acid Accession #: NM_001S65.1 

Coding sequence : 67 . . 3 63 



GAGACATTCC 
AGCACCATGA 
ATTCAAGGAG 
CCTGTTAATC 
CGTGTTGAGA 
TCGAAGGCCA 
TAAAACCAGA 
CCTCTCCCAT 
GTTACACTAA 
GGTTAATGTT 
GCTCTACTGA 
ACCTTTCCCA 
TCAGAATCTC 
ACTTCATGGA 
CATACAATTC 
CTTATTTAAT 
TTTCAGTGTA 
TTTTAAAAAT 
TTTT CAAAT A 



11 

I 

TCAATTGCTT 
ATCAAACTGC 
TACCTCTCTC 
CAAGGTCTTT 
TCATTGCTAC 
TCAAGAATTT 
GGGGAGCAAA 
CACTTCCCTA 
AAGGTGACCA 
CAT CAT CCT A 
GGTGCTATGT 
TCTTCCAAGG 
AAATAACTAA 
CTTCCACTGC 
CAAACACATA 
GAAAGACTGT 
CATGGAATAA 
ACAGATAGAT 
AAAATGAGGT 



21 
I 

AGACATATTC 
GATT CTGATT 
TAGAACCGTA 
AGAAAAACTT 
AATGAAAAAG 
ACTGAAAGCA 
ATCGATGCAG 
CATGGAGTAT 
ATGATGGTCA 
AGCTATTCAG 
TCTTAGTGGA 
GTACTAAGGA 
AAGGTATGCA 
CATCCTCCCA 
CAGGAAGGTA 
ACAAAGTATA 
CATGTAATTA 
ATATGCTCTG 
ACTCTCCTGG 



31 
I 

TGAGCCTACA 
TGCTGCCTTA 
CGCTGTACCT 
GAAATTATTC 
AAGGGTGAGA 
GTTAGCAAGG 
TGCTTCCAAG 
ATGTCAAGCC 
CCAAATCAGC 
TAATAACTCT 
TGTTCTGACC 
ATCTTTCTGC 
ATCAAATCTG 
AGGGGCCCAA 
GAAATATCTG 
AGT CTTAGAT 
AGTACTATGT 
CATGTTACAT 
AAATATTAAG 



ACIQTESGGR 
PTHQGVGLAF 
CQRGAQRFCP 
RCPEPQCRKD 
EDKWGDACDN 
DGIGDACDNC 
HDGQGDACOD 
PENAEVTLTD 
DFEGTFHVNT 
KSSTGPGEQL 
EGPELVADSN 



41 
I 

GCAGAGGAAC 
TCTTTCTGAC 
GCATCAGCAT 
CTGCAAGCCA 
AGAGATGTCT 
AAATGTCTAA 
GATGGACCAC 
ATAATTGTTC 
TGCTACTACT 
ACCCTGGCAC 
CTGCTTCAAA 
TTTGGGGTTT 
CTTTTTAAAG 
ATTCTTTCAG 
AAAATGTATG 
GTATATATTT 
ATCAATGAGT 
AAGATAAATG 



CGPCPAGFTG 
AKANKQVCTD 
DG5P SECHEH 
NCVTVPNSGQ 
CRSQKNDDQK 
PQKSNPDQAD 
DDDNDGVPDS 
FRAFQTWLD 
VTDDDYAGFI 
RNALWHTGDT 
WLDTTMRGG 



51 
I 

CTCCAGTCTC 
TCTAAGTGGC 
TAGTAATCAA 
ATTTTGTCCA 
GAATCCAGAA 
AAGATCTCCT 
ACAGAGGCTG 
TTAGTTTGCA 
CCTGTAGGAA 
TATAATGTAA 
TATTTCCCTC 
ATCAGAATTC 
AATGCTCTTT 
TGGCTACCTA 
TGTAAGTATT 
CCTATATTGT 
AACAGGAAAA 
TGCTGAATGG 



Seq ID NOt 411 Protein sequence 
Protein Accession #$ NP_001556.1 

1 11 21 31 41 51 

I 1 I I I 1 

MNQTAILICC LIFLTLSGIQ GVPLSRTVRC TCISISNQPV NPRSLEKLEI IPASQFCPRV 
E I I ATMKKKG EXRCLNPESK AIKNLLKAVS KEMSKRSP 

Seq ID NO: 412 DNA sequence 

Nucleic Acid Accession #i XM_057014 

Coding sequence : 143 . . 874 



1 
I 

GGGAGGGAGA 
CGCGGCGGAG 
CGCTGCCCGG 
CCGCGGCCTC 
CCCCAAGGGG 
AATGTGCTTA 
CATTCCGGGT 
TCTGAGGGAA 
ATTGAATTAT 
AAATAGTGCT 
CTGTCAGCGT 
AGCTATAATT 
CACTTCTTCT 
CTGGGTTGGC 
TTCTCGCATC 
TTTTTTTATT 
CATCTGAATG 
TTTAAATCTA 
TGGTTAGAAT 
GGTCTTTTGT 
TGTACAATTT 
CAACCTTAAA 



11 
I 

GAGGCGCGCG 
CCAGACGCTG 
CAGCCGGGAG 
CTGCTGCTCC 
AAGCAAAAGG 
CAAGGGCCAG 
ACACCTGGGA 
AGCTTTGAGG 
GGCATAGATC 
CTAAGAGTTT 
TGGTATTTCA 
TATTTGGACC 
GTGGAAGGAC 
ACTTGTTCAG 
ATTATTGAAG 
ATGCCTTGGA 
AAAAGCAAAG 
GCATTATTCA 
ACTTTCTTCA 

GTAAATGTTA 
AAAAAAAAAA 



21 

I 

GGTGAAAGGC 
ACCACGTTCC 
CCATGCGACC 
TGCTGCTGCA 
CGCAGCTCCG 
CAGGAGTGCC 
TCCCAGGTCG 
AGTCCTGGAC 
TTGGGAAAAT 
TGTTCAGTGG 
CATTCAATGG 
AAGGAAGCCC 
TTTGTGAAGG 
ATTACCCAAA 
AACTACCAAA 
ATGGTTCACT 
CTAAATATGT 
TTTTGCTTCA 
TAGTCACATT 
AGTATAGCAT 
AGAATTTTTT 
AAAA 



31 
I 

GCATTGATGC 
TCTCCTCGGT 
CCAGGGCCCC 
GCTGCCCGCG 
GCAGAGGGAG 
TGGTCGAGAC 
GGATGGATTC 
ACCCAACTAC 
TGCGGAGTGT 
CTCACTTCGG 
AGCTGAATGT 
TGAAATGAAT 
AATTGGTGCT 
AGGAGATGCT 
ATAAATGCTT 
TAAATGACAT 
TTACAGACCA 
ATCAAAAGTG 
CTCTCAACCT 
TTTTAAAAAA 
TTATATCTGT 



41 

I 

AGCCTGCGGC 
CTCCTCCGCC 
GCCGCCTCCC 
CCGTCGAGCG 
GTGGTGGACC 
GGGAGCCCTG 
AAAGGAGAAA 
AAGCAGTGTT 
ACATTTACAA 
CTAAAATGCA 
TCAGGACCTC 
TCAACAATTA 
GGATTAGTGG 
TCTACTGGAT 
TAATTTTCAT 
TTTAAATAAG 
AAGTGTGATT 
GTTTCAATAT 
ATAATTTGGA 
ATATAAAAGC 
TAAATAAAAA 



51 

I 

GGCCTCGGAG 
TCCAGCTCCG 
CGCAGCGGCT 
CCTCTGAGAT 
TGTATAATGG 
GGGCCAATGG 
AGGGGGAATG 
CATGGAGTTC 
AGATGCGTTC 
GAAATGCATG 
TTCCCATTGA 
ATATTCATCG 
ATGTTGCTAT 
GGAATTCAGT 
TTGCTACCTC 
TTTATGTATA 
TCACACTGTT 
TTTTTTTAGT 
ATATTGTTGT 
TACCAATCTT 
TTATTTCCAA 



Seq ID NO: 413 Protein sequence 
Protein Accession #: XP_057014 



31 



41 



51 



1 11 21 

I 1 I 1 1 I 

KRPQGPAASP QRLRGLLLLL LLQLPAPSSA SEIPKGKQKA QLRQREWDL YNGMCLQGPA 
GVPGRDGSPG ANGIPGTPGI PGRDGFKGEK GECLRESFEE SWTPNYKQCS WSSLNYGIDL 
GKIAECTFTK MRSNSALRVL FSGSLRLKCR NACCQRWYFT FNGAECSGPL PIEAIIYLDQ 
GSPEMNSTIN IHRTSSVEGL CEGIGAGLVD VAIWVGTCSD YPKGDASTGW NSVSRIIIEE 
IiPX 
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60 
120 

ieo 

240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
10BO 



60 



60 
120 
180 
240 
300 
360 
420 
480 
S40 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 



60 
120 
180 
240 
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Seq ID HO: 414 DNA sequence 

Nucleic Acid Accession ft : XM_0B4007 

Coding sequence: 138.. 2405 

1 11 21 31 41 51 

I I 1 I 1 I 

CTCGTGCCGA ATTCGGCACG AGACCGCGTG TTCGCGCCTG GTAGAGATTT CTCGAAGACA 60 

CCAGTGGGCC CGTGTGGAAC CAAACCTGCG CGCGTGGCCG GGCCGTGGGA CAACGAGGCC 120 

GCGGAGACGA AGGCGCAATG GCGAGGAAGT TATCTGTAAT CTTGATCCTG ACCTTTGCCC 180 

TCTCTGTCAC AAATCCCCTT CATGAACTAA AAGCAGCTGC TTTCCCCCAG ACCACTGAGA 240 

AAATTAGTCC GAATTGGGAA TCTGGCATTA ATGTTGACTT GGCAATTTCC ACACGGCAAT 300 

ATCATCTACA ACAGCTTTTC TACCGCTATG GAQAAAATAA TTCTTTGTCA GTTGAAGGGT 360 

TCAGAAAATT ACTTCAAAAT ATAGGCATAG ATAAGATTAA AAGAATCCAT ATACACCATG 420 

ACCACGACCA TCACTCAGAC CACGAGCATC ACTCAGACCA TGAGCGTCAC TCAGACCATG 480 

AGCATCACTC AGACCACGAG CATCACT CTG ACCATGATCA TCACTCCCAC CATAATCATG 540 

CTGCTTCTGG TAAAAATAAG CGAAAAGCTC TTTGCCCAGA CCATGACTCA GATAGTTCAG 600 

GTAAAGATCC TAGAAACAGC CAGGGGAAAG GAGCTCACCG ACCAGAACAT GCCAGTGGTA 660 

GAAGGAATGT CAAGGACAGT GTTAGTGCTA GTGAAGTGAC CTCAACTGTG TACAACACTG 720 

TCTCTGAAGG AACTCACTTT CTAGAGACAA TAGAGACTCC AAGACCTGGA AAACTCTTCC 780 

CCAAAGATGT AAGCAGCTCC ACTCCACCCA GTGTCACATC AAAGAGCCGG GTGAGCCGGC 840 

TGGCTGGTAG GAAAACAAAT GAATCTGTGA GTGAGCCCCG AAAAGGCTTT ATGTATTCCA 900 
GAAACACAAA TGAAAATCCT CAGGAGTGTT TCAATGCATC AAAGCTACTG ACATCTCATG • 960 

GCATGGGCAT CCAGGTTCCG CTGAATGCAA CAGAGTTCAA CTATCTCTGT CCAGCCATCA 1020 

TCAACCAAAT TGATGCTAGA TCTTGTCTGA TTCATACAAG TGAAAAGAAG GCTGAAATCC 1080 

CTCCAAAGAC CTATTCATTA CAAATAGCCT GGGTTGGTGG TTTTATAGCC ATTTCCATCA 1140 

TCAGTTT CCT GTCTCTGCTG GGGGTTATCT TAGTGCCTCT CATGAATCGG GTGTTTTTCA 1200 

AATTTCT CCT GAGTTTCCTT GTGGCACTGG CCGTTGGGAC TTTGAGTGGT GATGCTTTTT 1260 

TACACCTTCT- TCCACATTCT CATGCAAGTC ACCACCATAG TCATAGCCAT GAAGAACCAG 1320 

CAATGGAAAT GAAAAGAGGA CCACTTTTCA GTCATCTGTC TTCTCAAAAC ATAGAAGAAA 1380 

GTGCCTATTT TGATTCCACX3 TGGAAGGGTC TAACAGCTCT AGGAGGCCTG TATTTCATGT 1440 

TTCTTGTTGA ACATGTCCTC ACATTGATCA AACAATTTAA AGATAAGAAG AAAAAGAATC 1500 

AGAAGAAACC TGAAAATGAT GATGATGTGG AGATTAAGAA GCAGTTGTCC AAGTATGAAT 1560 

CTCAACTTTC AACAAATGAG GAGAAAGTAG ATACAGATGA TCGAACTGAA GGCTATTTAC 1620 

GAGCAGACTC ACAAGAGCCC TCCCACTTTG ATTCTCAGCA GCCTGCAGTC TTGGAAGAAG 1680 

AAGAGGTCAT GATAGCTCAT GCTCATCCAC AGGAAGTCTA CAATGAATAT GTACCCAGAG 1740 

GGTGCAAGAA TAAATGCCAT TCACATTTCC ACGATACACT CGGCCAGTCA GACGATCTCA 1800 

TTCACCACCA TCATGACTAC CATCATATTC TCCAT CAT CA CCACCACCAA AACCACCATC 1860 

CTCACAGTCA CAGCCAGCGC TACTCTCGGG AGGAGCTGAA AGATGCCGGC GTCGCCACTT 1920 

TGGCCTGGAT GGTGATAATG GGTGATGGCC TGCACAATTT CAGCGATGGC CTAGCAATTG 1980 

GTGCTGCTTT TACTGAAGGC TTATCAAGTG GTTTAAGTAC TTCTGTTGCT GTGTTCTGTC 2040 

ATGAGTTGCC TCATGAATTA GGTGACTTTG CTGTTCTACT AAAGGCTGGC ATGACCGTTA 2100 

AGCAGGCTGT CCTTTATAAT GCATTGTCAG CCATGCTGGC GTATCTTGGA ATGG CAACAG 2160 

GAATTTTCAT TGGT CAT TAT GCTGAAAATG TTTCTATGTG GATATTTGCA CTTACTGCTG 2220 

GCTTATTCAT GTATGTTGCT CTGGTTGATA TGGTACCTGA AATGCTGCAC AATGATGCTA 2280 

GTGACCATGG ATGTAG CCGC TGGGGGTATT TCTTTTTACA GAATGCTGGG ATGCTTTTGG 2340 

GTTTTGGAAT TATGTTACTT ATTTCCATAT TTGAACATAA AATCGTGTTT CGTATAAATT 2400 

TCTAGTTAAG GTTTAAATGC TAGAGTAGCT TAAAAAGTTG TCATAGTTTC AGTAGGTCAT 2460 

AGGGAGATGA GTTTGTATGC TGTACTATGC AGCGTTTAAA GTTAGTGGGT TTTGTGATTT 2520 

TTGTATTGAA TATTGCTGTC TGTTACAAAG TCAGTTAAAG GTACGTTTTA ATATTTAAGT 2580 

TATTCTATCT TGGAGATAAA ATCTGTATGT GCAATTCACC GGTATTACCA GTTTATTATG 2640 

TAAACAAGAG ATTTGGCATG ACATGTTCTG TATGTTTCAG GGAAAAATGT CTTTAATGCT 2700 

TTTTCAAGAA CTAACACAGT TATTCCTATA CTGGATTTTA GGTCTCTGAA GAACTGCTGG 2760 

TGTTTAGGAA TAAGAATGTG CATGAAGCCT AAAATACCAA GAAAGCTTAT ACTGAATTTA 2820 

AGCAAAGAAA TAAAGGAGAA AAGAGAAGAA TCTGAGAATT GGGGAGGCAT AGATTCTTAT 2880 

AAAAATCACA AAATTTGTTG TAAATTAGAG GGGAGAAATT TAGAATTAAG TATAAAAAGG 2 940 

CAGAATTAGT ATAGAGTACA TTCATTAAAC ATTTTTGTCA GGATTATTTC CCGTAAAAAC 3000 

GTAGTGAGCA CTCTCATATA CTAATTAGTG TACATTTAAC TTTGTATAAT ACAGAAATCT 3060 

AAATATATTT AATGAATTCA AGCAATATAC ACTTGACCAA GAAATTGGAA TTTCAAAATG 3120 

TTCGTGCGGG TTATATACCA GATGAGTACA GTGAGTAGTT TATGTATCAC CAGACTGGGT 3180 

TATTGCCAAG TT AT AT AT CA CCAAAAGCTG TATGACTGGA TGTTCTGGTT ACCTGGTTTA 3240 

CAAAATTATC AGAGTAGTAA AACTTTGATA TATATGAGGA TATTAAAACT ACACTAAGTA 3300 

TCATTTGATT CGATTCAGAA AGTACTTTGA TATCTCTCAG TGCTTCAGTG CTATCATTGT 3360 

GAGCAATTGT CTTTATATAC GGTACTGTAG CCATACTAGG CCTGTCTGTG GCATTCTCTA 3420 
GATGTTTCTT TTTTACACAA TAAATTCCTT ATAT CAGCTT G 
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MARKLSVILI LTFALSVTNP LHEIiKAAAFP QTTEKISPNW ESGINVDLAI STRQYHLQQL 60 

FYRYGENNSL SVEGFRKLLQ NIGIDKIKRI HIHHDHDHHS DHEHHSDHER HSDHEHHSDH 120 

EHHSDHDHHS HHNHAASGKN KRKAIiCPDED SDSSGKDPRN SQGKGAHRPE HASGRRNVKD 180 

SVSASEVTST VYNTVSEGTH FLETIETPRP GKLFPKDVSS STPPSVTSKS RVSRLAGRKT 240 

NESVSEPRKG FMYSRNTNEN PQECFNASKL LTSHGMGIQV PLNATEFNYL CPAIINQIDA 300 

RSCLIHTSEK KAEIPPKTYS LQIAWVGGFI AISIISFLSL LGVILVPLMN RVFFKFLIiSF 360 

liVAIiAVGTItS GDAFLHLLPH SHASHHHSHS HEEPAMEMKR GPLFSHLSSQ NIEESAYFDS 420 

TWKGLTALGG LYFMFLVEHV LTLIKQFKDK KKKNQKKPEN DDDVEIKKQL SKYESQLSTN 480 

EEKVDTDDRT EGYLRADSQE PSHFDSQQPA VLEEEEVMIA HAHPQEVYNE YVPRGCKNKC 540 

HSHFHDTLGQ SDDL I HHHKD YHHILHHHHH QNHHPHSKSQ RYSREELKDA GVATLAWMVI 600 

MGDGLHNFSD GIAIGAAFTE GLSSGLSTSV AVFCHELPHE LGDFAVLLKA GMTVKQAVLY 660 

NALSAMLAYL GMATGIFIGH YAENVSMWIF ALTAGLFMYV ALVDMVPEML HNDASDHGCS 720 
RWGYFFLQNA GML LGFG I MI* LISIFEHKIV FRINF 
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ATGCCCAAGC GCGCGCACTQ GGGGGCCCTC TCCGTGQTGC TQATCCTGCT TTGGGGCCAT 60 

CCGOGAGTGG CGCTGGCCTG CCCGCATCCT TGTGCCTGCT ACGTCCCCAG CGAGGTCCAC 12 0 

TGCACGTTCC GATCCCTGGC TTCCGTGCCC GCTGGCATTG CTAGACACGT GGAAAGAATC 180 

AATTTGGGGT TTAATAGCAT ACAGGCCCTG TCAGAAACCT CATTTGCAGG ACTGACCAAG 240 

TTGGAGCTAC TTATGATTCA CGGCAATGAG ATCCCAAGCA TCCCCGATGG AGCTTTAAGA 300 

GACCTCAGCT CTCTTCAGGT TTTCAAGTTC AGCTACAACA AGCTGAGAGT GATCACAGGA 360 

CAGACCCTCC AGGGTCTCTC TAACTTAATG AGGCTGCACA TTGACCACAA CAAGATCGAG 420 

TTTATCCACC CTCAAGCTTT CAACGGCTTA ACGTCTCTGA GGCTACTCCA TTTGGAAGGA 480 

AATCTCCTCC ACCAGCTGCA CCCCAGCACC TTCTCCACGT TCACATTTTT GGATTATTTC 540 

AGACTCTCCA CCATAAGGCA CCTCTACTTA GCAGAGAACA TGGTTAGAAC TCTTCCTGCC 600 

AGCATGCTTC GGAACATGCC GCTTCTGGAG AATCTTTACT TGCAGGGAAA TCCGTGGACC 660 

TGCGATTGTG AGATGAGATG GTTTTTGGAA TGGGATGCAA AATCCAGAGG AATTCTGAAG 720 

TGTAAAAAGG ACAAAGCTTA TGAAGGCGGT CAGTTGTGTG CAATGTGCTT CAGTCCAAAG 780 

AAGTTGTACA AACATGAGAT ACACAAGCTG AAGGACATGA CTTGTCTGAA GCCTTCAATA 840 

GAGTCCCCTC TGAGACAGAA CAGGAGCAGG AGTATTGAGG AGGAGCAAGA ACAGGAAGAG 900 

GATGGTGGCA GCCAGCTCAT CCTGGAGAAA TTCCAACTGC CCCAGTGGAG CATCTCTTTG 960 

AATATGACCG ACGAGCACGG GAACATGGTG AACTTGGTCT GTGACATCAA GAAACCAATG 1020 

GATGTGTACA AGATTCACTT GAACCAAACG GATCCTCCAG ATATTGACAT AAATGCAACA 1080 

GTTGCCTTGG ACTTTGAGTG TCCAATGACC CGAGAAAACT ATGAAAAGCT ATGGAAATTG 1140 

ATAGCATACT ACAGTGAAGT TCCCGTGAAG CTACACAGAG AGCTC ATGC T CAGCAAAGAC 1200 

CCCAGAGTCA GCTACCAGTA CAGGCAGGAT GCTGATGAGG AAGCTCTTTA CTACACAGGT 1260 

GTGAGAGCCC AGATTCTTGC AGAACCAGAA TGGGTCATGC AGCCATCCAT AGATATCCAG 1320 

CTGAACCGAC GTCAGAGTAC GGCCAAGAAG GTGCTACTTT CCTACTACAC CCAGTATTCT 1380 

CAAACAATAT CCACCAAAGA TACAAGGCAG GCTCGGGGCA GAAGCTGGGT AATGATTGAG 1440 

CCTAGTGGAG CTGTGCAAAG AGATCAGACt GTCCTGGAAG GGGGTCCATG CCAGTTGAGC 1500 

TG CAACGTG A AAGCTTCTGA GAGTCCATCT ATCTTCTGGG TGCTTCCAGA TGGCTCCATC 1560 

CTGAAAGCGC CCATGGATGA CCCAGACAGC AAGTTCTCCA TTCTCAGCAG TGGCTGGCTG 1620 

AGGATCAAGT CCATGGAGCC AT CTGACTCA GGCTTGTACC AGTGCATTGC TCAAGTGAGG 1680 

GATGAAATGG ACCGCATGGT ATATAGGGTA CTTGTGCAGT CTCCCTCCAC TCAGCCAGCC 1740 

GAGAAAGACA CAGTGACAAT TGGCAAGAAC CCAGGGGAGT CGGTGACATT GCCTTGCAAT 1800 

GCTTTAGCAA TACCCGAAGC CCACCTTAGC TGGATTCTTC CAAACAGAAG GATAATTAAT 1860 

GATTTGG CTA ACACATCACA TGTATACATG TTGCCAAATG GAACTCTTTC CATCCCAAAG 1920 

GTCCAAGTCA GTGATAGTGG TTACTACAGA TGTGTGGCTG TCAACCAGCA AGGGGCAGAC 1980 

CATTTTACGG TGGGAATCAC AGTGACCAAG AAAGGGTCTG GCTTGCCATC CAAAAGAGGC 2040 

AGACGCCCAG GTGCAAAGGC TCTTTCCAGA GTCAGAGAAG ACAT CGTGGA GGATGAAGGG 2100 

GGCTCGGGCA TGGGAGATGA AGAGAACACT TCAAGGAGAC TTCTGCATCC AAAGGACCAA 2160 

GAGGTGTTCC TCAAAACAAA GGATGATGCC ATCAATGGAG ACAAGAAAGC CAAGAAAGGG 2220 

AGAAGAAAGC TGAAACTCTG GAAGCATTCG GAAAAAGAAC CAGAGACCAA TGTTGCAGAA 2280 

GGTCGCAGAG TGTTTGAATC TAGACGAAGG ATAAACATGG CAAACAAACA GATTAATCCG 2340 

GAGCGCTGGG CTGATATTTT AGCCAAAGTC CGTGGGAAAA ATCTCCCTAA GGGCACAGAA 2400 

GTACCCCCGT TGATTAAAAC CACAAGTCCT CCATCCTTGA GCCTAGAAGT CACACCACCT 2460 

TTTCCTGCTG TTTCTCCCCC CTCAGCATCT CCTGTGCAGA CAGTAACCAG TGCTGAAGAA 2520 

TCCTCAGCAG ATGTACCTCT ACTTGGTGAA GAAGAGCACG TTTTGGGTAC CATTTCCTCA 2580 

GCCAGCATGG GGCTAGAACA CAACCACAAT GGAGTTATTC TTGTTGAACC TGAAGTAACA 2640 

AGCACACCTC TGGAGGAAGT TGTTGATGAC CTTTCTGAGA AGACTGAGGA GATAACTTCC 2700 

ACTGAAGGAG ACCTGAAGGG GACAGCAGCC CCTACACTTA TATCTGAGCC TTATGAACCA 2760 

TCTCCTACTC TGCACACATT AGACACAGTC TATGAAAAGC CCACCCATGA AGAGACGGCA 2820 

ACAGAGGGTT GGTCTGCAGC AGATGTTGGA TCGTCACCAG AGCCCACATC CAGTGAGTAT 2880 

GAGCCTCCAT TGGATGCTGT CTCCTTGGCT GAGTCTGAGC CCATGCAATA CTTTGACCCA 2 940 

GATTTGGAGA CTAAGTCACA ACCAGATGAG GATAAGATGA AAGAAGACAC CTTTGCACAC 3000 

CTTACTCCAA CCCCCACCAT CTGGGTTAAT GACTCCAGTA CATCACAGTT ATTTGAGGAT 3060 

TCTACTATAG GGGAACCAGG TGTCCCAGGC CAATCACATC TACAAGG AC T GACAGACAAC 3120 

ATCCACCTTG TGAAAAGTAG TCTAAGCACT CAAGACACCT TACTGATTAA AAAGGGTATG 3180 

AAAGAGATGT CTCAGACACT ACAGGGAGGA AATATGCTAG AGGGAGACCC CACACACTCC 3240 

AGAAGTTCTG AGAGTGAGGG CCAAGAGAGC AAATCCATCA CTTTGCCTGA CTCCACACTG 3300 

GGTATAATGA GCAGTATGTC TCCAGTTAAG AAGCCTGCGG AAACCACAGT TGGTACCCTC 3360 

CTAGACAAAG ACACCACAAC AGTAACAACA ACACCAAGGC AAAAAGTTGC TCCGTCATCC 3420 

ACCATGAGCA CTCACCCTTC TCGAAGGAGA CCCAACGGGA GAAGGAGATT ACGCCCCAAC 3480 

AAATTCCGCC ACCGGCACAA GCAAACCCCA CCCACAACTT TTGCCCCATC AGAGACTTTT 3540 

TCTACTCAAC CAACTCAAGC ACCTGACATT AAGATTTCAA GTCAAGTGGA GAGTTCTCTG 3600 

GTTCCTACAG CTTGGGTGGA TAACACAGTT AATACCCCCA AACAGTTGGA AATGGAGAAG 3 660 

AATGCAGAAC CCACATCCAA GGGAACACCA CGGAGAAAAC ACGGGAAGAG GCCAAACAAA 3 720 

CATCGATATA CCCCTTCTAC AGTGAGCTCA AGAGCGTCCG GATCCAAGCC CAGCCCTTCT 3780 

CCAGAAAATA AACATAGAAA CATTGTTACT CCCAGTTCAG AAACTATACT TTTGCCTAGA 3 840 

ACTGTTTCTC TGAAAACTGA GGGCCCTTAT GATTCCTTAG ATTACATGAC AACCACCAGA 3 900 

AAAATATATT CATCTTACCC TAAAGTCCAA GAGACACTTC CAGTCACATA TAAACCCACA 3960 

TCAGATGGAA AAGAAATTAA GGATGATGTT GCCACAAATG TTGACAAACA TAAAAGTGAC 4020 

ATTTTAGTCA CTGGTGAATC AATTACTAAT GCCATACCAA CTTCTCGCTC CTTGGTCTCC 4080 

ACTATGGGAG AATTTAAGGA AGAATCCTCT CCTGTAGGCT TTCCAGGAAC TCCAACCTGG 4140 

AATCCCTCAA GGACGGCCCA GCCTGGGAGG CTACAGACAG ACATACCTGT TACCACTTCT 42 00 

GGGGAAAATC TTACAGACCC TCCCCTTCTT AAAGAGCTTG AGGATGTGGA TTTCACTTCC 42 60 

GAGTTTTTGT CCTCTTTGAC AGTCTCCACA CCATTTCACC AGGAAGAAGC TGGTTCTTCC 4320 

ACAACTCTCT CAAGCATAAA AGTGGAGGTG GCTTCAAGTC AGGCAGAAAC CACCACCCTT 4380 

GATCAAGATC ATCTTGAAAC CACTGTGGCT ATTCTCCTTT CTGAAACTAG ACCACAGAAT 4440 

CACACCCCTA CTGCTGCCCG GATGAAGGAG CCAGCATCCT CGTCCCCATC CACAATTCTC 4500 

ATGTCTTTGG GACAAACCAC CACCACTAAG CCAGCACTTC CCAGTCCAAG AATATCTCAA 4560 

GCAT CTAGAG ATTCCAAGGA AAATGTTTTC TTGAATTATG TGGGGAATCC AGAAACAGAA 4620 

GCAACCCCAG TCAACAATGA AGGAACACAG CATATGTCAG GGCCAAATGA ATTATCAACA 4680 

CCCTCTTCCG ACCGGGATGC ATTTAACTTG TCTACAAAGC TGGAATTGGA AAAGCAAGTA 4740 

TTTGGTAGTA GGAGTCTACC ACGTGGCCCA GATAGCCAAC GCCAGGATGG AAGAGTTCAT 4800 

GCTTCTCATC AACTAACCAG AGTCCCTGCC AAACCCATCC TACCAACAGC AACAGTGAGG 4860 

CTACCTGAAA TGTCCACACA AAGCGCTTCC AGATACTTTG TAACTTCCCA GTCACCTCGT 4920 

CACTGGACCA ACAAACCGGA AATAACTACA TATCCTTCTG GGGCTTTGCC AGAGAACAAA 4 980 

CAGTTTACAA CTCCAAGATT ATCAAGTACA ACAATTCCTC TCCCATTGCA CATGTCCAAA 5040 
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CCCAGCATTC CTAGTAAOTT TACTGACCGA 
GTGTTTGGAA ATAACAACAT CCCTGAGGCA 
AGAATTCCTC ATTATTCCAA TGGAAGACTC 
CCACAGTTGG GAGTCACCCG GAGACCCCAG 
GAGAGAAAAG TTATTCCAGG TTCCTACAAC 
GACTTTGGCC CTCCGGCACC TCCGTTGTTG 
ACTAACTTAC AGAATATCCC TATGGTCTCT 
TCTTCTGTCC AGTCCTCAGG AAGCTTCCAC 
CCTCCTGCAT CCAAATTCTG GTCTCTTGGG 
CAGACTGTGT CCGTCACCGC TGAGACAGAC 
CCAAAGCCTT TCGTTACTTG GACAAAGGTT 
AGGATACAAC GGTTTGAGGT TCTCAAGAAC 
CAAGATCGAG GCCAGTATAT GTGCACCGCC 
GTCTTGCTTT CGGTCACCGT GCAGCAACCT 
ACTGTCTACC TGGGAGACAC CATTGCAATG 
CAAATTTCCT GGATCTTCCC TGACAGGAGG 
CGCATCACCC TGCACGAAAA CCGGACCCTT 
GGCGTCTATA AGTGCGTGGC CAGCAATGCA 
CACGTGGCGG CACTGCCCCC CGTTATCCAC 
CCGGGGCTCA GCATTCACAT TCACTGCACT 
TGGGTGCTCG GGGACGGTAC CCAGATCCGC 
GTTTTCCCCA ACGGQACGCT CTACATCCGC 
GAGTGCX3TGG CCGCCAACCT GGTAGGCTCC 
CGTGCAGCAG CCAACGCGCG CATCACGGGC 
GGAGGAACCC TCAAGCTGGA CTGCAGCGCC 
AGGCTGCCGT CCAAGAGGAT GATCGACGCG 
TTTGCCAATG GGACCCTGGT GGTGAAATCA 
TGCGTAGCTC GAAATAAGGT TGGTGATGAC 
AAACCGGCCA AGATTGAACA CAAGGAGGAG 
CTGAAAGTGG ACTGTGTGGC CACCGGGCTT 
GACGGGAGTC TGGTGAACTC CTTCATGCAG 
TATGTCGTCT TCAACAATGG GACACTCTAC 
GACTACACCT GCTTTGCTGA AAATCAGGTC 
GTGGTGACAG CGCCCGCCAC CATCCGGAAC 
GGAGACGTGG TCACTGTAGC CTGTGAGGCC 
TTGTCCCCAA CCAACAAGGT GATCCCCACC 
GGCACTCTCC TTATTCAGAA AGCCCAGCGT 
AGGAACAGCG CGGGAGAGGA TAGGAAGACG 
AAGATCAACG GTAACCCCAA CCCCATCACC 
CGGAAACTGA TTGACTGCAA AGCTGAAGGC 
CCCGAGGGTG TGGTTCTGCC AGCTCCATAC 
GGTTCCCTGG ACATCAGGAG TTTGAGGAAG 
CGCAACGAGG GAGGGGAGGC GAGGTTGATC 
AAACCCATCT TCCACGACCC GATCAGCGAG 
AGCCTCAACT GCTCTGCCGC GGGGACCCCG 
GG CACOGATC TGCAGAGTGG A CAG CAGCTG 
CTACACATTA GCGGTCTCTC CTCGGTGGAC 
GCCGCTGGCC ACACGGAGAG GCTGGTCTCC 
AAGCAGTATC ATAACCTGGT CAGCATCATC 
CCTCCCGGGG CTGGGCAGGG ACGTTTCTCC 
GGCCCCCAAA CCCTGGGACG CGTTTCTCTT 
GCCTCGGTGT TTGACAGGGG TACCTATGTA 
GTCACCAGCA TCOCCGTGAT TGTGATCGCC 
CCGGTCATCT ACACCCGGCC CGGGAACACC 
CCCAAAGCTG ACATCACGTG GGAGTTACCG 
GCTCGTCTGT ATGGAAACAG ATTTCTTCAC 
ACACAGAGAG ATGCCGGCTT CTACAAGTGC 
AAAACAACTT ACATCCACGT CTTCTGAAAT 
ACAACAAAGC GGGGTTTGTA AGGGAAGCCA 
GTCACAGTGC ATGGTGGCCT CTGGTGGGTT 
GTTGGGAAAA GGAAGCAATG CAGACACGAG 
CTTTTGTGTT TACATCATGC CAGGGGCTTC 
TTTCTTCTTT TGCAAATGCC ACTCGACTGC 
ACATTCATCA AAAATAAGCC ATAGACATGA 
TCACCTAGTT AACCTGCTGC AGTTTTTACA 
TCTTTCAGTT ATTTCCTCTG TCACTTCAAA 
CAGAGTGACT GATATATATA TATATATTTT 
TTTATATGAA AAAAGAAAAA CATTTCTTCC 
TATATATTTT TTCCTTTCAA ATCAGACGAT 
ATTAAAATTA ATAAATTATT GGTCTTTACA 
ATATAATTTT AAAAAATTTC TCTCCAACCT 
CCTTCTCCAG GAACCCTCCA GTGGGGAAGG 
TTTTTGTTGA AAGCTGTGCT CAGAGGAGGT 
AACTTTACAG AATTGAATCT AGAGTCTTCC 
CTGGCTTGTC CATCTGGTCT AAGGTGGCTG 
CCATGAATAA TACACGACCT GTTATTTCCA 
ATACTGTACA TTTGATAATA AAATAATATT 
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MPKRAHWGAL SWLILLWGH PRVAIACPHP 
NLGPKSIQAL SBTSFAGLTK LELLMIHGNE 
QTLQGLSNLM RLHIDHNKIE PIHPQAFNGL 
RLSTIRHIiYL AENMVRTLPA SMLRNMPLLE 



AGAACTGACC AATTCAATGG TTACTCCAAA 5100 

AGAAACCCAG TTGGAAAGCC TCCCAGTCCA 5160 

CCTTTCTTTA CCAACAAGAC TCTTTCTTTT 5220 

ATACCCACTT CTCCTGCCCC AGTAATGAGA 5280 

AGGATACATT CCCATAGCAC CTTCCATCTG 5340 

CACACTCCGC AGACCACGGG ATCACCCTCA 5400 

TCCACCCAGA GTTCTATCTC CTTTATAACA 5460 

CAG AG CAG CT CAAAGTTCTT TGCAGGAGGA 5520 

GAAAAGCCCC AAATCCTCAC CAAGTCCCCA 5580 

ACTGTGTTCC CCTGTGAGGC AACAGGAAAA 5640 

TCCACAGGAG CTCTTATGAC TCCGAATACC 5700 

GGTACCTTAG TGATACGGAA GGTTCAAGTA 5760 

AGCAACCTGC ACGGCCTGGA CAGGATGGTG 5820 

CAAATCCTAG CCTCCCACTA CCAGGACGTC 5880 

GAGTGTCTGG CCAAAGGGAC CCCAGCCCCC 5940 

GTGTGGCAAA CTGTGTCCCC CGTGGAGAGC 6000 

TCCATCAAGG AGGCGTCCTT CTCAGACAGA 6060 

GCCGGGGCGG ACAGCCTGGC CATCCGCCTG 6120 

CAGGAGAAGC TGGAGAACAT CTCGCTGCCC 6180 

GCCAAGGCTG CGCCCCTGCC CAGCGTGCGC 6240 

CCCTCGCAGT TCCTCCACGG GAACTTGTTT 6300 

AACCTCGCGC CCAAGGACAG CGGGCGCTAT 6360 

GCGCGCAGGA CGGTGCAGCT GAACGTGCAG 6420 

ACCTCCCCGC GGAGGACGGA CGTCAGGTAC 6480 

TCGGGGGACC CCTGGCCGCG CATCCTCTGG 6540 

CTCTTCAGTT TTGATAGCAG AATCAAGGTG 6600 

GTGACGGACA AAGATGCCGG AGATTACCTG 6660 

TACGTGGTGC TCAAAGTGGA TGTGGTGATG 6720 

AACGACCACA AAGTCTTCTA CGGGGGTGAC 6780 

CCCAATCCCG AGATCTCCTG GAGCCTCCCA 6840 

TCGGATGACA GCGGTGGACG CACCAAGCGC 6900 

TTTAACGAAG TGGGGATGAG GGAGGAAGGA 6960 

GGGAAGGACG AGATGAGAGT CAGAGTCAAG 7020 

AAGACTTACT TGGCGGTTCA GGTGCCCTAT 7080 

AAAGGAGAAC CCATGCCCAA GGTGACTTGG 7140 

TCCTCTGAGA AGTATCAGAT ATACCAAGAT 7200 

TCTGACAGCG GCAACTACAC CTGCCTGGTC 7260 

GTGTGGATTC ACGTCAACGT CCAGCCACCC 7320 

ACCGTGCGGG AGATAGCAGC CGGGGGCAGT 7380 

ATCCCCACCC CGAGGGTGTT ATGGGCTTTT 7440 

TATGGAAACC GGATCACTGT CCATGGCAAC 7500 

AGCGACTCCG TCCAGCTGGT ATGCATGGCA 7560 

GTGCAGCTCA CTGTCCTGGA GCCCATGGAG 7620 

AAGATCACGG CCATGGCGGG CCACACCATC 7680 

ACACCCAGCC TGGTGTGGGT CCTTCCCAAT 7740 

CAGCGCTTCT ACCACAAGGC TGACGGCATG 7800 

GCTGGGGCCT ACCGCTGCGT GGCCCGCAAT 7860 

CTGAAGGTGG GACTGAAGCC AGAAGCAAAC 7920 

AATGGTGAGA CCCTGAAGCT CCCCTGCACC 7980 

TGGACGCTCC CCAATGGCAT GCATCTGGAG 8040 

CTGGACAATG GCACCCTCAC GGTTCGTGAG 8100 

TGCAGGATGG AGACGGAGTA CGGCCCTTCG 8160 

TATCCTCCCC GGATCACCAG CGAGCCCACC 8220 

GTGAAACTGA ACTGCATGGC TATGGGGATT 8280 

GATAAGTCGC ATCTGAAGGC AGGGGTTCAG 8340 

CCCCAGGGAT CACTGACCAT CCAGCATGCC 8400 

ATGGCAAAAA ACATTCTCGG CAGTGACTCC 8460 

GTGGATTCCA GAATGATTGC TTAGGAACTG 8520 

GGTTGGGGAA TAGGAGCTCT TAAATAATGT 8580 

TCAAGTTGAG GTTGATCTTG ATCTACAATT 8640 

AAGGAGGGCT CAGCCTTGCT GAGACACTTT 8700 

ATTCAGGGTG TCTGTGCTCT GACTGCAATT 8760 

CTTCATAAGC GTCCATAGGA TATCTGAGGA 8820 

ACAACACCTC ACTACCCCAT TGAAGACGCA 8880 

TGATAGACTT TGTTCCAGAT TGACAAGTCA 8940 

ACTCCAGCTT GCCCAATAAG GATTTAGAAC 9000 

AATTCAGAGT TACATACATA CAGCTACCAT 9060 

TGGAACTCAC TTTTTATATA ATGTTTTATA 9120 

GAGACTAGAA GGAGAAATAC TTTCTGTCTT 9180 

AGACTTGGAT ACATTACAGC AGACATGGAA 9240 

CCTTCAAATT CAGTCACCAC • TGTTATATTA 9300 

CTGCGATATT AGATTTCCTT GTATGCAAAG 9360 

GAGAGGAGAG GAAGGAGAAA ACTGCATCAT 9420 

CCGAAAAGCC CAGAAACTTC TCTGCAGTAT 9480 

CTTCTTCCCC AGCCATGAGT CAGTTTGTGC 9540 

TGACTGCTTT ACTGTATTTT TAAGGTCAAT 9600 
CTCCCAAAAA AAAAA 
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CACYVPSEVH CTFRSIiASVP AGIARHVERI 60 

IPSIPDGALR DLSSLQVFKP SYNKLRVITG 120 

TSLRLLHLEG NLLHQLHPST FSTFTFLDYP 180 

NLYLQGNPWT CDCEMRWFLE WDAKSRG I LK 240 
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CKKDKAYEGG QLCAMCPSPK KLYKHEIHKL KDMTCLKPSI ESPLRQNRSR SIEEEQEQEE 300 

DGGSQLILEK FQLPQWSISL NMTDEHGNMV NLVCDIKKPM DVYKIHLNQT DPPDIDINAT 360 

VALDFECPMT RENYEKLWKL IAYYSEVPVK LHRELMLSKD PRVSYQYRQD ADEEALYYTG 420 

VRAQILAEPE WVMQPSIDIQ LNRRQSTAKK VLLSYYTQYS QTISTKDTRQ ARGRSWVMIE 490 

PSGAVQRDQT VLEGGPCQLS CNVKASESPS IFWVIiPDGSI LKAPMDDPDS KFSILSSGWL S40 

RIKSMEPSOS GLYQCIAQVR DEMDRMVYRV LVQSPSTQPA EKDTVTIGKN PGESVTLPCN 600 

ALAIPEAHLS WILPNRRIIN DLANTSHVYM LPNGTLSIPK VQVSDSGYYR CVAVNQQGAD 660 

HFTVGITVTK KGSGLPSKRG RRPGAKALSR VREDIVEDEG GSGMGDEENT SRRLLHPKDQ 720 

EVFLKTKDDA INGDKKAKKG RRKLKLWKHS EKEPETNVAE GRRVFESRRR INMANKQINP 780 

ERWADILAKV RGKNLPKGTE VPPLIKTTSP PSLSLEVTPP FPAVSPPSAS PVQTVTSAEE 840 

SSADVPLLGE EEHVLGTISS ASMGLEHNHN GVILVEPEVT STPLEEWDD LSEKTEEITS 900 

TEGDLKGTAA PTLISEPYEP SPTLHTLDTV YEKPTHBETA TEGWSAADVG SSPEPTSSEY 960 

EPPLDAVSLA ESEPMQYFDP DLETKSQPDB DKMKEDTFAH LTPTPTIWVN DSSTSQLFED 1020 

STIGEPGVPG QSHIiQGLTDN IHLVKSSLST QDTLLIKKGM KEMSQTLQGG NMLEGDPTHS 1080 

RSSESEGQES KSITLPDSTL GIMSSMSPVK KPAETTVGTL LDKDTTTVTT TPRQKVAPSS 1140 

TMSTHPSRRR PNGRRRLRPN KFRHRHKQTP PTTFAPSETF STQPTQAPDI KISSpVESSL 1200 

VPTAWVDNTV NTPKQLEMEK NAEPTSKGTP RRKHGKRPNK HRYTPSTVSS RASGSKPSPS 1260 

PENKHRNIVT PSSETILLPR TVSLKTEGPY DSLDYMTTTR KIYSSYPKVQ ETLPVTYKPT 1320 

SDGKEIKDDV ATNVDKHKSD ILVTGESITN AIPTSRSLVS TMGEFKEESS PVGFPGTPTW 1380 

NPSRTAQPGR LQTDIPVTTS GENLTDPPLL KELEDVDFTS EFLSSLTVST PFHQEEAGSS 1440 

TTLSSIKVEV ASSQAETTTL DQDHLETTVA ILLSETRPQN HTPTAARMKE PASSSPSTIL 1500 

MSLGQTTTTK PALPSPRISQ ASRDSKENVP LMYVGNPETE ATPVNNEGTQ HMSGPNELST 1560 

PSSDRDAFNL STKLELEKQV FGSRSLPRGP DSQRQDGRVH ASHQI/TRVPA KPIIiPTATVR 1620 

LPEMSTQSAS RYPVTSQSPR HWTNKPEITT YPSGALPENK QFTTPRLSST TIPLPLHMSK 1680 

PSIPSKFTDR RTDQFNGYSK VFGNNN I PEA RNPVGKPPSP RIPHYSNGRL PPFTNKTLSP 1740 

PQLGVTRRPQ IPTSPAPVMR ERKVIPGSYN RIHSHSTPHL DFGPPAPPLL HTPQTTGSPS 1800 

TNLQNIPMVS STQSSISFIT SSVQSSGSFH QSSSKFFAGG PPASKFWSLG EKPQILTKSP 1860 

QTVSVTAETD TVFPCEATGK PKPFVTWTKV STGALMTPNT RIQRFEVLkN GTLVIRKVQV 1920 

QDRGQYMCTA SNLHGLDRMV VLLSVTVQQP QILASHYQDV TVYLGDTIAM ECIAKGTPAP 1980 

QISWIFPDRR VWQTVSPVES RITLHENRTL SIKEASFSDR GVYKCVASNA AGADSLAIRL 2040 

HVAALPPVIH QEKLENISLP PGLSIHIHCT AKAAPLPSVR WVLGDGTQIR PSQFLHGNLF 2100 

VFPNGTLYIR NLAPKDSGRY ECVAANLVGS ARRTVQLNVQ RAAANARITG TSPRRTDVRY 2160 

GGTLKLDCSA SGDPWPRILW RLPSKRMIDA LFSFDSRIKV FANGTLWKS VTDKDAGDYL 2220 

CVARNKVGDD YWLKVDWM KPAKI EHKEE NDHKVFYGGD LKVDCVATGL PNPEISWSLP 2280 

DGSLVNSFMQ SDDSGGRTKR YWFNNGTLY FNEVGMREEG DYTCFAENQV GKDEMRVRVK 2340 

WTAPATIRN KTYLAVQVPY GDWTVACEA KGEPMPKVTW LSPTNKVIPT SSEKYQIYQD 2400 

GTLLIQKAQR SDSGNYTCIiV RNSAGEDRKT VWIHVNVQPP KINGNPNPIT TVREIAAGGS 2460 

RKLIDCKAEG IPTPRVLWAF PEGWLPAPY YGMRITVHGN GSLDIRSLRK SDSVQLVCMA 2520 

RNEGGEARLI VQLTVLEPME KPIFHDPISE K I TAMAGHT I SLNCSAAQTP TPSLVWVLPN 2580 

GTDLQSGQQL QRFYHKAEGM LHISGLSSVD AGAYRCVARN AAGHTERLVS LKVGLKPEAN 2640 

KQYHNLVSII NGETLKLPCT PPGAGQGRFS WTLPNGMHLE GPQTLGRVSL LDNGTLTVRE 2700 

ASVFDRGTYV CRMETEYGPS VTSIPVIVIA YPPRITSEPT PVIYTRPGNT VKLNCMAMGI 2760 

PKADITWELP DKSHLKAGVQ ARLYGNRFLH PQGSLTIQHA TQRDAGFYKC MAKNILGSDS 2820 
KTTYIHVF 

Seq ID NO i 41B DNA sequence 

Nucleic Acid Accession ft: Eos sequence 

Coding sequence : 1 . . 5001 

1 11 21 31 41 51 

1 1 I I I I 

ATGCCAGGCA CAAAACTAAC CCGAACAGGC GCCCCAGCAG ACTACAGAGT GATATTGAAG 60 

ACCTCTCAAG AGGACGAATT GGATGTACCT GACGACATCA GCGTCCGGGT TATGTCATCT 120 

CAGTCTGTGC TTGTGTCCTG GGTGGATCCT GTTCTGGAAA AACAGAAGAA AGTTGTTGCA 180 

TCAAGACAGT ACACCGTGCG CTATCGAGAG AAGGGGGAAT TGGCCAGGTG GGATTATAAG 240 

CAGATCGCTA ACAGGCGTGT GCTGATTGAG AACCTGATTC CAGACACTGT GTATGAATTT 300 

GCAGTCCGTA TTTCACAGGG TGAAAGAGAT GGCAAATGGA GTACGTCAGT CTTCCAAAGA 360 

ACACCAGAAT CTGCCCCTAC CACAGCTCCT GAAAACTTGA ACGTCTGGCC AGTCAATGGC 420 

AAACCTACAG TTGTCGCTGC ATCTTGGGAT GCGCTACCAG AGACTGAGGG GAAAGTGAAA 480 

GTCTGTCTGC TGGACACAGG ACTGTTTTCA GTTTCCTCCT TCCAACCATC TGCCAAATCA 540 

TTTCAGAATA CATTCTTTCA TACGCCCCGG CTCTCAAACC ATTTGGAGCA AAGTCCCTCA 600 

CCTATCCTGG AGACACTACT TCTGCCCTGG TGGATGGTCT GCAGCCTGGG GAACGCTATC 660 

TTTTCAAAAT CCGGGCCACA AACAGGAGAG GCCTGGGACC TCACTCCAAA GCCTTCATTG 720 

TCGCTATGCC AACAAGAATG CAGCTGTACC CAGAAGGATT TCAGTTGTCT AGCTTACCTG 780 

ATCGATATCC AAACCAAACA AGTTAATAAA GATCCACAAC TGGAAGGGAG TGTTTTTGGA 840 

CCATGTTTTC TTTTCTACTT CCTCACATTT ATGCTGGATA TTGGCGGCTT TTCCTTCATT 900 

ATGTGCTATG AAGACCCANN TGTTTCTTCT TTGACAGGCA ATTCTTTAAA ATCTGTTGCA 960 

GCCAGTAAGG CGGATGTTCA GCAGAACACG GAGGACAATG GGAAACCCGA AAAACCTGAG 1020 

CCTTCCTCAC CTTCTCCCAG AGCTCCAGCT TCCTCCCAAC ACCCCTCTGT GCCTGCTTCT 1080 

CCCCAAGGGA GAAATGCCAA GGACCTTCTT CTTGACTTGA AGAACAAAAT ATTGGCTAAT 1140 

GGTGGGGCGC CCCGAAAACC CCAGCTTCGC GCCAAGAAGG CAGAGGAGCT GGATCTTCAG 1200 

TCGACAGAAA TCACTGGGGA GGAGGAGCTG GGTTCCCGGG AGGACTCGCC CATGTCACCC 1260 

TCAGACACCX: AAGACCAGAA ACGGACCCTG AGGCCGCCAA GTAGACACGG CCACTCGGTG 1320 

GTTGCTCCCG GCAGGACTGC AGTGAGGGCC CGGATGCCAG CGCTGCCCCG AAGGGAAGGC 1380 

GTAGATAAGC CTGGCTTTTC CCTGGCCACG CAGCCCCGCC CAGGGGCGCC CCCCTCGGCT 1440 

TCGGCCTCTC CTGCCCACCA CGCGTCCACC CAGGGCACCT CTCATCGTCC TTCCCTGCCT 1500 

GCCAGCTTGA ATGACAACGA CTTGGTGGAC TCAGACGAAG ATGAGCGCGC TGTGGGCTCC 1560 

CTCCACCCCA AGGGCGCCTT CGCCCAGCCC CGGCCAGCCC TGTCCCCCAG CCGCCAGTCC 1620 

CCGTCCAGCG TTCTCCGCGA CAGAAGCTCT GTGCACCCCG GCGCAAAGCC AGCCTCGCCG 1680 

GCGCGGAGGA CCCCCCATTC AGGGGCCGCA GAGGAAGATT CCAGTGCCTC AGCCCCACCC 1740 

TCAAGACTTT CTCCACCCCA TGGGGGATCA TCTCGGCTGC TGCCCACCCA GCCACACCTG 1800 

AGCTCTCCAC TTTCCAAGGG CGGGAAGGAT GGTGAGGACG CCCCAGCCAC CAACTCCAAT 1860 

GCGCCATCAC GGTCCACCAT GTCCTCCTCC GTCTCTTCTC ATCTCTCGTC CAGGACGCAG 1920 

GTCTCTGAGG GAGCGGAGGC TTCTGATGGT GAAAGCCACG GTGACGGCGA TAGGGAAGAC 1980 

GGCGGAAGGC AGGCGGAGGC CACGGCCCAG ACGCTGCGGG CCCGGCCTGC CTCTGGACAC 2040 

TTCCATTTGC TCAGACACAA ACCCTTTGCT GCCAACGGGA GGTCTCCAAG CAGGTTCAGC 2100 

ATTGGGCGGG GACCTCGGCT GCAGCCCTCC AGCTCCCCAC AGTCGACTGT GCCCTCCCGA 2160 
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GCCCACCCCA GGGTTCCCTC TCACTCTGAT TCCCACCCTA AGCTTAGCTC AGGTATCCAT 2220 

GGAGACGAGG AGGATGAGAA GCCGCTTCCT GCCACCGTTG TCAATGACCA CGTGCCTTCC 2280 

TCCTCCAGGC AGCCCATCTC CCGGGGCTGG GAGGACTTAA GGAGAAGCCC GCAGAGAGGG 2340 

GCCAGCCTGC ATCGGAAGGA ACCCATCCCA GAGAACCCCA AATCCACAGG GGCAGATACA 2400 

CATCCTCAGG GCAAGTACTC CTCCCTGGCC TCCAAGGCTC AGGATGTTCA ACAGAGCACA 2460 

GACGCGGACA CGGAGGGTCA TTCTCCCAAA GCACAGCCAG GGTCCACAGA CCGCCACGCG 2520 

TCCCCTGCTC GTCCTCCCGC AGCACGGTCA CAGCAGCATC CCAGTGTTCC CAGAAGGATG 2580 

ACACCCGGCC GGGCCCCAGA ACAGCAGCCC CCTCCTCCCG TCGCCACGTC CCAGCACCAC 2640 

CCGGGACCCC AGAGCAGAGA CGCGGGTCGG TCACCTTCCC AGCCCAGGCT CTCACTGACC 2700 

CAGGCCGGGC GGCCCCGCCC CACGTCGCAG GGCCGCTCCC ACTCCTCCTC GGACCCTTAC 2760 

ACGGCGAGCT CCAGAGGGAT GCTCCCCACG GCCCTCCAGA ACCAGGACGA GGATGCCCAG 2820 

GGCAGCTACG ACGACGACAG CACAGAAGTC GAGGCCCAGG ATGTGCGGGC CCCCGCGCAC 2880 

GCCGCGCGCG CCAAGGAGGC AGCTGCGTCC CTTCCCAAGC ACCAGCAGGT GGAGTCTCCC 2940 

ACAGGCGCAG GGGCAGGTGG CGACCACAGG TCCCAGCGCG GACATGCGGC CTCCCCCGCC 3000 

AGGCCCAGCC GACCCGGCGG CCCCCAGTCC CGCGCCCGGG TCCCCAGCAG GGCAGCGCCG 3060 

GGGAAGTCGG AGCCTCCTTC CAAGCGGCCC CTGTCCTCCA AGTCCCAGCA GTCGGTCTCA 3120 

GCCGAGGACG AGGAGGAGGA GGACGOGGGG TTTTTTAAAG GCGGGAAAGA AGACCTTCTG 3180 

TCTTCCTCTG TGCCAAAGTG GCCCTCTTCC TCCACTCCCA GGGGCGGCAA AGACGCCGAT 3240 

GGGAGCCTCG CCAAGGAAGA GAGGGAGCCT GCCATCGCGC TTGCCCCTCG CGGAGGGAGC 3300 

CTGGCTCCTG TGAAGCGACC TCTCCCCCCA CCTCCAGGCA GCTCCCCCAG GGCCTCCCAC 3360 

GTCCCTTCCC GACCGCCGCC TCGCAGCGCT GCCACCGTGA GCCCCGTCGC GGGCACCCAC 3420 

CCCTGGCCGC GGTACACCAC GCGCGCCCCV CCTGGCCACT TCTCCACCAC CCCGATGCTG 3480 

TCCTTGCGCC AGAGGATGAT GCATGCCAGA TTCCGTAACC CTCTCTCCCG ACAGCCTGCC 3540 

AGACCCTCTT ACAGACAAGG TTATAATGGC AGACCAAATG TAGAAGGGAA AGTCCTTCCT 3600 

GGTAGTAATG GAAAACCGAA TGGACAGAGA ATTATCAATG GCCCTCAAGG AACAAAGTGG 3660 

GTTGTGGACC TTGATCGTGG GTTAGTATTG AATGCAGAAG GAAGGTACCT CCAAGATTCA 3720 

CATGGAAATC CTCTTCGGAT TAAACTAGGA GGAGATGGTC GAACCATTGT AGATCTGGAA 37 80 

GGGACCCCCG TGGTGAGTCC TGACGGCCTC CCACTCTTTG GGCAGGGGCG ACATGGCACA 3840 

CCTCTGGCCA ATGCCCAAGA TAAGCCAATT TTGAGTCTTG GAGGAAAGCC GCTGQTQGQC 3900 

TTGGAGGTCA TCAAAAAAAC CACCCATCCC CCTACCACTA CCATGCAGCC CACCACTACT 3960 

ACGACGCCCC TGCCTACCAC TACAACCCCG AGGCCCACCA CTGCCACCAC CATGCAGCCC 4020 

ACCACTACTA CGACGCCCCT GCCTACCACT ACACCGAGGC CCACCACTGC CACCACCCGC 40 BO 

CGCACGACCA CCAGGCGTCC AACAACCACA GTCCGAACCA CTACGCGGAC AACCACCACC 4140 

ACCACCCCCA AACCCACCAC TCCCATCCCC ACCTGTCCCC CTGGGACCTT GGAACGGCAC 420 0 

GACGATGATG GCAACCTGAT AATGAGCTCC AATGGGATCC CAGAGTGCTA CGCTGAAGAA 4260 

GATGAGTTCT CAGGCTTGGA GACTGACACT GCAGTACCTA CGGAAGAGGC CTACGTTATA 4320 

TATGATGAAG ATTATGAATT TGAGACGTCA AGGCCACCAA CCACCACTGA GCCTTCGACC 4380 

ACTGCTACCA CACCGAGGGT GATCCCAGAG GAAGGCGCCA TCAGTTCCTT TCCTGAAGAA 4440 

GAATTTGATC TGGCTGGAAG GAAACGATTT GTTGCTCCTT ACGTGAOGTA CCTAAATAAA 4500 

GACCCATCAG CCCCGTGCTC TCTGACTGAT GCACTGGATC ACTTCCAAGT GGACAGCCTG 4560 

GATGAAATCA TCCCCAATGA CCTGAAGAAG AGTGATCTGC CTCCCCAGCA TGCTCCCCGC 4620 

AACATCACCG TGGTGGCCGT GGAAGGTTGC CACTCATTTG TCATTGTGGA TTGGGACAAA 4680 

GCCACCCCAG GAGATTTGGT CACAGGTTAT TTGGTTTACA GTGCATCCTA TGAAGATTTC 4740 

ATCAGGAACA AGTTTTCCAC TCAAGCTTCA TCAGTAACTC ACTTGCCCAT TGAGAACCTA 4800 

AAGCCCAACA CGAGGTATTA TTTTAAAGTG CAAGCACAAA ATCCTCATGG CTACGGACCT 4860 

ATCAGCCCTT CGGTCTCATT TGTCACCGAA TCAGATAATC CTCTGCTTGT TGTGAGGCCC 4920 

CCAGGCGGTG AGCTATCTGG AT CCCATTCG CTTTCAAACA TGATCCCAGC TACACGGACT 4980 

GCCATGGACG GCAATATGTG AAGCGCACGT GGTATCGAAA GTTCGTGGGA GTTGTTCTTT 5040 

GTAATTCACT GAGGTATAAA ATCTACCTCA GTGACAACCT GAAAGATACA TTCTACAGCA 5100 

TTGGAGACAG CTGGGGAAGA GGTGAAGACC ATTGCCAATT TGTGGATTCA CACCTTGATG 5160 

GAAGAACAGG GCCTCAGTCC TATGTAGAAG CCCTCCCTAC TATTCAAGGC TACTATCGCC 5220 

AGTATCGTCA GGAGCCTGTC AGGTTTGGGA ACATCGGCTT CGGAACCCCC TACTACTATG 5280 

TGGGCTGGTA CGAGTGTGGG GTCTCCATCC CTGGAAAGTG GTAATCACAG GACCGTCATG 5340 

CTGCAAGCTT GCCCTGCCCA GCCCCACCAA CTAAGTCGCA CTAGGGGCTG TGAGCAAAGA 5400 

CAGCCAGCAT GCTCAGCCCC GCTGCCCTAG GTGCCAGGAA GGTCACAGAT GGACACTGGC 5460 

CATTCTGGTC ATCTCAGTCT GGAACTCAGT CCCACTTCTT GGCCTGGACA ATGAACAGGA 5520 

TTCAGTTTTG CTGTTAACTT TGCTTCTCTA CTTTTTTTTG TTTGTTTGTA ATAGCACATC 5580 

CCAGAGACAT CAGAAACCAG CAACTGATTC AGTGTGATTT CCCAGACTTT TTAGGCATGA 5640 

AATTCGGACA CTTCAGTATT TCCAGGAATA GCATATGCAC GCTGTTCTTG CTTCATGGAA 5700 

* TGCTACATGC TTTCTGTTTT TCTCATTTTG GATTTCTCCA AAACTAACTG AATTTAAGCT 5760 

TCAGGTCCCT TTGTATGCAG TAGAAAGGAA TTATTAAAAA CACCACCAAA GAAAATAAAT 5820 

ATATCCTACT TGAAATTTAC TCTATGGACT TACCCACTGC TAGAATAAAT GTATCAAATC 5880 

TTATTTGTAA ATTCTCAATT TTGATATATA TATGTATATA TGCATATACA TATCCACACT 5940 

TGTCTGCAAG AATATTGATT AAAATTGCTA AATTTGTACT TGTTCACCAA AAAAAAAAAA 6000 



Seq ID NO: 419 Protein sequence 
Protein Accession #: Eos sequence 

1 11 21 31 41 51 

I I I I I I 

MPGTKLTRTG APADYRVILK TSQEDELDVP DDISVRVMSS QSVLVSWVDP VLEKQKKWA 60 

SRQYTVRYRE KGEIARWDYK QIANRRVLXE NLIPDTVYEP AVRISQGERD GKWSTSVFQR 120 

TPESAPTTAP ENLNVWPVNG KPTWAASWD ALPETEGKVK VCLLDTGLFS VSSFQPSAKS 180 

FQNTPFHTPR LSNHLEQSPS PILETIiLLPW WMVCSLGNAI FSKSGPQTGE AWDLTPKPSL 240 

SLCQQECSCT QKDFSCLAYL IDIQTKQVNK DPQLEGSVFG PCFLFYFLTF MLDIGGPSFI 300 

MCYEDPVSSIi TGNSLKSVAA SKADVQQNTE DNGKPEKPEP SSPSPRAPAS SQHPSVPASP 360 

QGRNAKDLLL DLKNKILANG GAPRKPQLRA KKAEELDLQS TEITGEEELG SREDSPMSPS 420 

DTQDQKRTLR PPSRHGHSW APGRTAVRAR MPALPRREGV DKPGPSLATQ PRPGAPPSAS 480 

ASPAHHASTQ GTSHRPSLPA SLNDNDLVDS DEDERAVGSL HPKGAFAQPR PALSPSRQSP 540 

SSVLRDRSSV HPGAKPASPA RRTPHSGAAE EDSSASAPPS RLSPPHGGSS RLLPTQPHLS 600 

SPLSKGGKDG EDAPATNSNA PSRSTMSSSV SSHI»SSRTQV SEGAEASDGE SHGDGDREDG 660 

GRQAEATAQT LRARPASGHF HLLRHKPFAA NGRSPSRFSI GRGPRLQPSS SPQSTVPSRA 720 

HPRVPSHSDS HPKLSSGIHG D EEDEKPIiP A TWNDHVPSS SRQPISRGWE DLRRSPQRGA 780 

SLHRKEPIPE NPKSTGADTH PQGKYSSLAS KAQDVQQSTD ADTEGHSPKA QPGSTDRHAS 840 

PAR PPAARSQ QHPSVPRRMT PGRAPEQQPP PPVATSQHHP GPQSRDAGRS PSQPRIiSLTQ 900 

AGRPRPTSQG RSHSSSDPYT ASSRGMLPTA LQNQDEDAQG SYDDDSTEVE AQDVRAPAHA 960 
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ARAXEAAASL PKHQQVBSPT GAGAGGDHRS QRGHAASPAR PSRPGGPQSR ARVPSRAAPO 1020 

KSEPPSKRPL SSKSQQSVSA EDEEEEDAGF PKGGKEDLLS SSVPKWPSSS TPRGGKDADG 1080 

SLAKEEREPA IALAPRGGSL. APVKRPLPPP PGSSPRASHV PSRPPPRSAA TVSPVAGTHP 1140 

WPRYTTRAPP GHFSTTPMLS LRQRMMHARF RNPLSRQPAR PSYRQGYNGR PNVEGKVLPG 1200 

SNGKPNGQRI INGPQGTKWV VDLDRGLVLN AEGRYLQDSH GNPLRIKLGG DGRTIVDLEG 1260 

TPWSPDGLP LPGQGRHGTP LANAQDKPIL SLGGKPLVGL EVI KKTTHPP TTTMQPTTTT 1320 

TPLPTTTTPR PTTATTMQPT TTTTPLPTTT PRPTTATTRR TTTRRPTTTV RTTTRTTTTT 1380 

TPKPTTPIPT CPPGTLBRHD DDGNLIMSSK GIPECYAEED EFSGLETDTA VPTEEAYVIY 1440 

DEDYEFETSR PPTTTEPSTT ATTPRVI PEE GAISSFPEEE FDLAGRKRFV APYVTYLNKD 1500 

PSAPCSLTDA LDHFQVDSLD EIIPNDLKKS DLPPQHAPRN ITWAVEGCH SFVIVDWDKA 1560 

TPGDLVTGYL VYSASYEDFI RNKFSTQASS VTHLPIENLK PNTRYYFKVQ AQNPHGYGPI 1620 
SPSVSFVTES DNPLLWRPP GGELSGSHSL SNMIPATRTA MDGNM 

Seq ID NO i 420 DNA sequence 
Nucleic Acid Accession #: NM_022743 
Coding sequence: 128.. 1237 

1 11 21 31 41 51 

I I I I I I 

GTGGATTTTA GAGATACCTC CCCTCCTTCT GCTCAGCTGC CTTGCAGTAA TTAAACTCTT 60 

TCTCTGCTGC AACACCCCTA CTGTTCTCCG TGTATTGGCT TTTCTGGGCA GCAGGAAGGA 120 

AAAG CTGATG CGATGCTCTC AGTGCCGCGT CGCCAAATAC TGTAGTGCTA AGTGTCAGAA 180 

AAAAGCTTGG CCAGACCACA AGOGGGAATG CAAATGCCTT AAAAGCTGCA AACCCAGATA 240 

TCCTCCAGAC TCCGTTCGAC TTCTTGGCAG AGTTGTCTTC AAACTTATGG ATGGAGCACC 300 

TTCAGAATCA GAGAAGCTTT ACTCATTTTA TGATCTGGAG TCAAATATTA ACAAACTGAC . 360 

TGAAGATAAG AAAGAGGGCC TCAGGCAACT CGTAATGACA TTTCAACATT TCATGAGAGA 420 

AGAAATACAG GATGCCTCTC AGCTGCCACC TGCCTTTGAC CTTTTTGAAG CCTTTGCAAA 480 

AGTGATCTGC AACTCTTTCA CCATCTGTAA TGCGGAGATG CAGGAAGTTG GTGTTGGCCT 540 

ATATCCCAGT ATCTCTTTGC TCAATCACAG CTGTGACCCC AACTGTTCGA TTGTGTTCAA 600 

TGGGCCCCAC CTCTTACTGC GAGCAGTCCG AGACATCGAG GTGGGAGAGG AGCTCACCAT 660 

CTGCTACCTG GATATGCTGA TGACCAGTGA GGAGCGCCGG AAGCAGCTGA GGGACCAGTA 720 

CTGCTTTGAA TGTGACTGTT TCCGTTGCCA AACCCAGGAC AAGGATGCTG ATATGCTAAC 780 

TGGTGATGAG CAAGTATGGA AGGAAGTTCA AGAAT CCCTG AAAAAAATTG AAGAACTGAA 840 

GGCACACTGG AAGTGGGAGC AGGTTCTGGC CATGTGCCAG GCGATCATAA GCAGCAATTC 900 

TGAACGGCTT CCCGATATCA ACATCTACCA GCTGAAGGTG CTCGACTGCG CCATGGATGC 960 

CTGCATCAAC CTCGGCCTGT TGGAGGAAGC CTTGTTCTAT GGTACTCGGA CCATGGAGCC 1020 

ATACAGGATT TTTTTCCCAG GAAGCCATCC CGTCAGAGGG GTTCAAGTGA TGAAAGTTGG 1080 

CAAACTGCAG CTACATCAAG GCATGTTTCC CCAAGCAATG AAGAATCTGA GACTGGCTTT 1140 

TGATATTATG AGAGTGACAC ATGGCAGAGA ACACAGCCTG ATTGAAGATT TGATTCTACT 1200 

TTTAGAAGAA TGCGACGCCA ACATCAGAGC ATCCTAAGGG AACGCAGTCA GAGGGAAATA 1260 

CGGCGTGTGT CTTTGTTGAA TGCCTTATTG AGGTCACACA CTCTATGCTT TGTTAGCTGT 1320 

GTGAACCTCT CTTATTGGAA ATTCTGTTCC GTGTTTGTGT AGGTAAATAA AGGCAGACAT 13 80 

GGTTTGCAAA CCACAAGAAT CATTAGTTGT AGAGAAGCAC GATTATAATA AATTCAAAAC 1440 
ATTTGGTTGA GGATGCCAAA AAAAAAAAAA AAAAAAA 

Seq ID NO i 421 Protein sequence 
Protein Accession ft: NP_073580 

1 n 21 31 41 51 

I 1 I I 1 I 

MRCSQCRVAK YCSAKCQKKA WPDHKRECKC LKSCKPRYPP DSVRLLGRW FKLMDGAPSE 60 

SEKLYSFYDL ESNINKXtTED KKEGLRQLVM TFQHFMREEI QDASQLPPAF DLFEAFAKVI 120 

CNSFTICNAE MQEVGVGLYP SISLLNHSCD PNCSIVFNGP HLLLRAVRDI EVGEELTICY 180 

LDMLMT9EER RKQLRDQYCF ECDCFRCQTQ DKDADMLTGD EQVWKEVQES LKKIEELKAH 240 

WKWEQVLAMC QAIISSNSER LPD INI YQUC VLDCAMDACI NLGIiLEEALF YGTRTMEPYR 300 

IPPPGSHPVR GVQVMKVGKL QLHQGMPPQA MKNLRLAFDI MRVTHGREHS LIEDLILLIiE 360 
ECDANIRAS 

Seq ID NO: 422 DNA sequence 

Nucleic Acid Accession #t NM_003014.2 

Coding sequencer 238.. 648 

1 11 21 31 41 51 

GGCGGGTTCG CGCCCCGAAG GCTGAGAGCT GGCGCTGCTC GTGCCCTGTG TGCCAGACGG 60 

CGGAGCTCCG CGGCCGGACC CCGCGGCCCC GCTTTGCTGC CGACTGGAGT TTGGGGGAAG 120 

AAACTCTCCT GCGCCCCAGA AGATTTCTTC CTCGGCGAAG GGACAGCGAA AGATGAGGGT 180 

GGCAGGAAGA GAAGGCGCTT TCTGTCTGCC GGGGTCGCAG CGCGAGAGGG CAGTGCCATG 240 

TTCCTCTCCA TCCTAGTGGC GCTGTGCCTG TGGCTGCACC TGGCGCTGGQ CGTGCGCGGC 300 

GCGCCCTGCG AGGCGGTGCG CATCCCTATG TGCCGGCACA TGCCCTGGAA CATCACGCGG 360 

ATGCCCAACC ACCTGCACCA CAGCACGCAG GAGAACGCCA TCCTGGCCAT CGAGCAGTAC 420 

GAGGAGCTGG TGGACGTGAA CTGCAGCGCC GTGCTGCGCT TCTTCTTCTG TGCCATGTAC 480 

GCGCCCATTT GCACCCTGGA GTTCCTGCAC GACCCTATCA AGCCGTGCAA GTCGGTGTGC 540 

CAACGCGCGC GCGACGACTG CGAGCCCCTC ATGAAGATGT ACAACCACAG CTGGCCCGAA 600 

AGCCTGGCCT GCGACGAGCT GCCTGTCTAT GACCGTGGCG TGTGCATTTC GCCTGAAGCC 660 

ATCGTCACGG ACCTCCCGGA GGATGTTAAG TGGATAGACA TCACACCAGA CATGATGGTA 720 

CAGGAAAGGC CTCTTGATGT TGACTGTAAA CGCCTAAGCC CCGATCGGTG CAAGTGTAAA 780 

AAGGTGAAGC CAACTTTGGC AACGTATCTC AGCAAAAACT ACAGCTATGT TATTCATGCC 840 

AAAATAAAAG CTGTGCAGAG GAGTGGCTGC AATGAGGTCA CAACGGTGGT GGATGTAAAA 900 

GAGATCTTCA AGTCCTCATC ACCCATCCCT CGAACTCAAG TCCCGCTCAT TACAAATTCT 960 

TCTTGCCAGT GTCCACACAT CCTGCCCCAT CAAGATGTTC TCATCATGTG TTACGAGTGG 1020 

CGTTCAAGGA TGATGCTTCT TGAAAATTGC TTAGTTGAAA AATGGAGAGA TCAGCTTAGT 1080 

AAAAGATCCA TACAGTGGGA AGAGAGGCTG CAGGAACAGC GGAGAACAGT TCAGGACAAG 1140 

AAGAAAACAG CCGGGCGCAC CAGTCGTAGT AATCCCCCCA AACCAAAGGG AAAGCCTCCT 1200 

GCTCCCAAAC CAGCCAGTCC CAAGAAGAAC ATTAAAACTA GGAGTGCCCA GAAGAGAACA 1260 

AACCCGAAAA GAGTGTGAGC TAACTAGTTT CCAAAGCGGA GACTTCCGAC TTCCTTACAG 1320 

GATGAGGCTG GGCATTGCCT GGGACAGCCT ATGTAAGGCC ATGTGCCCCT TGCCCTAACA 1380 



343 
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WO 02/086443 

ACTCACTGCA GTGCTCTTCA TAGACACATC TTGCAGCATT TTTCTTAAGG 
GTTTTTCTTT GTAAGCCATC ACAAGCCATA GTGGTAGGTT TGCCCTTTGG 
GAGTTAAAGC TGGTGGAAAA GGCTTATTGC ATTGCATTCA GAGTAACCTG 
CTAGAAGAGT AGGGAAAATA ATGCTTGTTA CAATTCGACC TAATATGTGC 
AAATGCCATA TTTCAAACAA AACACGTAAT TTTTTTACAG TATGTTTTAT 
TATCTGTTGT TGCAATGTTA GTGATGTTTT AAAATGTGAT GAAAATATAA 
AAGGAACAGT AGTGGAATGA ATGTTAAAAG ATCTTTATGT GTTTATGGTC 
TTTTTGTGAT GAAAGGGGAT TTTTTGAAAA ATTAGAGAAG TAGCATATGG 
TGTGTTTTTT TACCAATGAC TTCAGTTTCT GTTTTTAGCT AGAAACTTAA 
AATAATAAAG AAAAATAAAT AAAAAGGAGA GGCAGACAAT G T CTGGATTC 
GTTACCTGAT TTCCATGATC ATGATGCTTC TTGTCAACAC CCTCTTAAGC 
ACAGTGAGTT TGTCTGTACC ATTAGGAGTT AGGTACTAAT TAGTTGGCTA 
ATTTTATACC CACAAGAGAG GTATGTCACT CATCTTACTT CCCAGGACAT 
AATAATTTGA CAAGCTTAAA AATGGCCTTC ATGTGAGTGC CAAATTTTGT 
TTAAATATTT TCTTTGCCTA AATACATGTG AGAGGAGTTA AATATAAATG 
AAAGTTGAGT TCCACCTCTG AAATGAGAAT TACTTGACAG TTGGGATACT 
AAAAAGAACT TATTTGCAGC ATTTTATCAA CAAATTTCAT AATTGTGGAC 
ATTTATTTTA AAAAACAATT TTATTGGCCT TTTGCTAACA CAGTAAGCAT 
AGGCATTCAA TAAATGCACA ACGCCCAAAG GAAATAAAAT CCTATCTAAT 
ACTACACAGA GGTAATCACT ATTAGTATTT TGGCATATTA TTCTCCAGGT 
GCACTTATAA AATGATTTGA ACAAATAAAA CTAGGAACCT GTATACATGT 
CTGCCTCCTT TGCTTGGCCC TTTATTGAGA TAAGTTTTCC TGTCAAGAAA 
TCTCATTTCT AACAGCTGTG TT AT ATT CCA TAGTATGCAT TACTCAACAA 
TATTGGATAC TTAGGTGGTT TCTTCACTGA CAATACTGAA TAAACATCTC 

Seq ID NOt 423 Protein sequence 
Protein Accesaion ft: NP_0 03 005.1. 



PCT/US02/12476 



1 

I 

MFLSILVALC 
YEELVDVNCS 
ESLACDELPV 
KKVKPTLATY 
SSCQCPHILP 
KKKTAGRTSR 



11 
I 

LWLHlALrGVR 
AVLRFFFCAM 
YDRGVCISPE 
LSKNYSYVIH 
HQDVLIMCYE 
SNPPKPKGKP 



21 
I 

GAPCEAVRIP 
YAPICTLEFL 
AIVTDLPEDV 
AKIKAVQRSG 
WR SRMMIrLEN 
PAPKPASPKK 



31 

I 

MCRHMPWNIT 
HDPIKPCKSV 
KWIDITPDMM 
CNEVTTWDV 
CLVEXWRDQL 
NIKTRSAQKR 



Seq ID NOt 424 DNA sequence 
Nucleic Acid Accession ft: BC010423 
Coding sequence: 248.. 1780 



1 

I 

CACAGCGTGG 
AGCTACGGCT 
CAAGTGCGAG 
TCTGCAGCCG 
TTCAACCATG 
GCTGCTACTG 
CGTGGTAACT 
CGGCGAGCAA 
ACTAGCGCTA 
GGAGCAGCCG 
GCAGGCGGAT 
GGCGCGGCTG 
ACTAGAAGAG 
CCCCAGCGTG 
CTCCCGCTCT 
GCAGCCACTG 
CATCCTCCAC 
GTGGCACATT 
CTCATACAAC 
CACTTTGGGC 
CAATGAGTTC 
CTCTGGGAAG 
ACTCTTGTTC 
GGCCCAGCAG 
CCGGAGGCTG 
GAGAGCCGAG 
AGAGCCCGAG 
TGAACTGCTG 
CAAACAGGCC 
CAATGGCATC 
CTAGGCCTGG 
ACACCCCCAT 
AACCCTTCTG 
CACTGTGTGT 
TGACTGTCCG 
AAGTGAACTG 
GTTTGGCGTG 
CAGACCCCAG 
CAGACCCAGG 
TCTCCTACCA 
GAGGCTTGAA 
ACATATTTTC 
ACTTTTAATT 
TTTTATTTTT 



11 
I 

GAAGCAGCTC 
GGGTGTGTAG 
AGGCAAGAAC 
GCTCCCAGGG 
CCCCTGTCCC 
CTGGCATCAT 
GTGGTGCTGG 
GTGGGGCAAG 
CTGCACTCCA 
CCGCCCCCAC 
GAGGGCGAGT 
CGGCTCCGAG 
GGCCAGGGCC 
ACCTGGGACA 
GCTGCCGTCA 
ACTTGTGTGG 
GTGTCCTTCC 
GGCAGAGAAG 
TGGACACGGC 
TTTCCCCCAC 
TCCTCAAGGG 
CAGGTGGACC 
TGCCTTCTGG 
ATGACCCAGA 
CATTCCCATC 
GGCCACCCTG 
GGCCGCAGTT 
TCTCCAGGCT 
ATGAACCATT 
TACATCAATG 
CTCCTTCTGT 
TTCTTGCGGA 
TTCATCGGGA 
GTGCATGTGT 
TGGAGGGGTG 
TGGTGTATGT 
TGTGTCATGT 
AGCAGTATTA 
TGTGCGGGCA 
CTTCGGAGCC 
CTGTTACAGA 
TGTAAATATA 
TTTTTCTTTT 
ATTTTTTTTT 



21 
I 

TGGGGGAGCT 
AACGGGGCCG 
TCTGCAGCTT 
AGATCTCGGT 
TGGGAGCCGA 
TTACAGGCCG 
GCCAGGACGC 
TGG CATGGGC 
AATACGGGCT 
GCAACCCCCT 
ACGAGTGCCG 
TGCTGGTGCC 
TGACCCTGGC 
CGGAGGTCAA 
CCTCAGAGTT 
TGTCCCATCC 
TTGCTGAGGC 
GAGCTATGCT 
TGGATGGGCC 
TGACCACTGA 
ATTCTCAGGT 
TAGTGTCAGC 
TGGTGGTGGT 
AATATGAGGA 
ACACGGACCC 
ATAGTCTCAA 
ACTCCACGCT 
CTGGGCGGGC 
TTGTTCAGGA 
GGCGGGGACA 
TGACATGGGA 
AGATGCTCCC 
GGGCTCCACC 
GCCTGTGTGA 
ACTGTGTCCG 
GCCACGGGAT 
GGCTGTGTGT 
ATGATGCAGA 
TAGCTGGAGC 
ATGGGGGCAA 
AGCCCTCTGC 
CATGCGCCGG 
TTTTTTCTTG 
AGAGTTTGAG 



31 
I 

CGGAGCTCCC 
GGGCTGGGGC 
CCTGCCTTCT 
GGAACTTCAG 
GATGTGGGGG 
OTGCCCCGCO 
AAAACTGCCC 
TCGGGTGGAC 
TCATGTGAGC 
GGACGGCTCA 
GGTCAGCACC 
TCCCCTGCCC 
AGCCTCCTGC 
AGGCACAACG 
CCACTTGGTG 
TGGCCTGCTC 
CTCTGTGAGG 
CAAGTGCCTG 
TCTGCCCAGT 
GCACAGCGGC 
CACTGTGGAT 
CTCGGTGGTG 
GGTGCTCATG 
GGAGCTGACC 
CAGGAGCCAG 
GGACAACAGT 
GACCACGGTG 
CGAGGAGGAG 
GAATGGGACC 
CCTGGTCTGA 
GATTTTAGCT 
CATCCCACTG 
AATTGAGTCT 
GTGTTGACTG 
TGGTGTGTAT 
TTGAGTGGTT 
GACCTCTGCC 
GGTTGGAGGA 
TGGAATCTGC 
GTGTGAAGCA 
CCTCTGGTGG 
GAGCTTCTTG 
CCCTTTCCAT 
TCCAGCCTGG 



41 
I 

RMPNHLHHST 
CQRARDDCEP 
VQERPLDVDC 
KEIFKSSSPI 
SKRSIQWEER 
TNPKRV 



41 
I 

GATCACGGCT 
TGGGTCCCCT 
GGGTCAGTTC 
AAACGCTGGG 
CCTGAGGCCT 
GGTGAGCTGG 
TGCTTCTACC 
GCGGGCGAAG 
CCGGCTTACG 
GTGCTCCTGC 
TTCCCCGCCG 
TCACTGAATC 
ACAGCTGAGG 
TCCAGCCGTT 
CCTAGCCGCA 
CAGGACCAAA 
GGCCTTGAAG 
AGTGAAGGGC 
GGGGTACGAG 
ATCTACGTCT 
GTTCTTGACC 
GTGGTGGGTG 
TCCCGATACC 
CTGACCAGGG 
CCGGAGGAGA 
AGCTGCTCTG 
AGGGAGATAG 
GAAGATCAGG 
CTACGGGCCA 
CCCAGGCCTG 
CATCTTGGGG 
ACTGCTTGAC 
CTCCCACCAT 
ACTGTGTGTG 
TATGCTGTCA 
GCGTGGGCAA 
TGAAAAAGCA 
GAGAGGTGGA 
CTCCGGTGTG 
GCCAGTCCCT 
CCTCTGGGCC 
CAGGAATACT 
TAGTTGTATT 
ACGATATAGC 



CTATGCTTCA 


1440 


TACAGAAGGT 


1500 


TGTGCATACT 


1560 


ATTGTAAAAT 


1620 


TACCTTTTGA 


1680 


TGTTTTTAAG 


1740 


TGCAGAAGGA 


1800 


AAAATTATAA 


1860 


AAACAAAAAT 


1920 


CTGTTTTTTG 


1980 


AGCACCAGAA 


2040 


ATGCTCAAGT 


2100 


CCACCCTGAG 


2160 


TTTTCTTCAT 


2220 


TACAGAGAGG 


2280 


TTAATCAGAA 


2340 


AATTGGAGGC 


2400 


GTATTTTATA 


2460 


CCTACTCTCC 


2520 


GTTTGCTTAT 


2580 


GTTTCATAAC 


2640 


GCAGAAACCA 


2700 


ACTGTTGTGC 


2760 


ACCGGAATTC 




51 




1 

QENAILAIBQ 


60 


LMKMYNHSWP 


120 


KRLSPDRCKC 


180 


PRTQVPLITN 


240 


LQEQRRTVQD 


300 


51 




| 

TCTTGGGGGT 


60 


AGTGGAGACC 


120 


CTTATTCAAG 


180 


CAGTCTGCCT 


240 


GGCTGCTGCT 


300 


AGACCTCAGA 


360 


GAGGGGACTC 


420 


GCGCCCAGGA 


480 


AGGGCCGCGT 


540 


GCAACGCAGT 


600 


GCAGCTTCCA 


660 


CTGGTCCAGC 


720 


GCAGCCCAGC 


780 


CCTTCAAGCA 


840 


GCATGAATGG 


900 


GGATCACCCA 


960 


ACCAAAATCT 


1020 


AGCCCCCTCC 


1080 


TGGATGGGGA 


1140 


GCCATGTCAG 


1200 


CCCAGGAAGA 


1260 


TGATCGCCGC 


1320 


ATCGGCGCAA 


1380 


AGAACTCCAT 


1440 


GTGTAGGGCT 


1500 




1560 


AAACACAGAC 


1620* 


ATGAAGGCAT 


1680 


AGCCCACGGG 


1740 


CCTCCCTTCC 


1800 


GCCTCCTTAA 


1660 


CTTTACCTCC 


1920 


GCATGCAGGT 


1980 


TGTGGAGGGG 


2040 


TATCAGAGTC 


2100 


CACTGTCAGG 


2160 


GGTATTTTCT 


2220 


GACTGTGGCT 


2280 


AGGGAACCTG 


2340 


GGGTCAGCCA 


2400 


TGCTGCATGT 


2460 


GCTCCGAATC 


2520 


TTTTATTTAT 


2580 


CAGACCCTGT 


2640 



J 

344 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 
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CTGTAAAAAA ACCAAAACCC AAAAAAAAAA AAAAAAAAAA 

Seq ID NOt 425 Protein sequence 
Protein Accession #: AAH10423 



PCT/US02/12476 



1 

I 

MPLSLGAEMW 
QVGQVAWARV 
DEGBYECRVS 
VTWDTEVKGT 
HVSFLAEASV 
GFPPLTTEHS 
FCLLVWWL 
EGHPDSLKDN 
AMNHPVQENG 



11 

I 

GPEAWLLLLL 
DAGEGAQELA 
TFPAGSFQAR 
TSSRSFKHSR 
RGLEDQNLWH 
GIYVCHVSNE 
MSRYKRRKAQ 
SSCSVMSEEP 
TLRAKPTGNG 



21 
I 

LLASFTGRCP 
LLHSKYGLHV 
LRLRVLVPPL 
SAAVTSEFHL 
IGRBGAMLKC 
FSSRDSQVTV 
QMTQKYEEEL 
EGRSYSTLTT 
IYINGRGHLV 



31 

I 

AGELETSDW 
S PAY EGRVEQ 
PSLNPGPALE 
VPSRSMNGQP 
LSEGQPPPSY 
DVLDPQEDSG 
TLTRENSIRR 
VREIETQTEL 



41 

I 

TWLGQDAKL 
PPPPRNPLDG 
EGQGLTLAAS 
LTCWSHPGL 
NWTRLDGPLP 
KQVDLVSASV 
LHSHHTDPRS 
LSPGSGRAEE 



51 
1 

PCFYRGDSGE 
SVLLRNAVQA 
CTAEGSPAPS 
LQDQRITHIL 
SGVRVDGDTL 
WVGVIAALL 
QPEESVGLRA 



Seq ID NO i 426 DNA sequence 

Nucleic Acid Accession #: NM_003474.2 

Coding sequence: 37.. 3036 



1 
I 

CACTAACGCT 
TCAAGGCTGG 
CTTTTTTAAA 
CCGGAGCTGA 
GCCCGCGTGG 
GCGACGATGG 
GCCGGTGCTC 
GCTGATGAAG 
TTCGACTCCA 
CTGATCATAA 
TATCTGCAAG 
TGTTACTACC 
TCTGGTCTCA 
AGTGCAACCA 
TGTGGATCAC 
CAGACATGGG 
GTGATCGTGG 
CAGCGATTAA 
ATCGTGTTGG 
CCATTCACCA 
TCCCATGACA 
GCCCCAATCA 
GACAATCCCC 
AATCATGACA 
ATCATGAACG 
GACTTGGAGA 
AGGGAGTCTT 
GACTGTGGGG 
AAGCCGGACG 
GGAACAGCGT 
AGCCCTCACT 
GGCTACTGCT 
CCAGGTGCTA 
TATGGCAACT 
AAATGTGGAA 
GTTTCCATAG 
CACGTGTACT 
GCAGATGGAA 
GAGTGTGCAA 
GAGGCCCACT 
GGCCCCATCC 
TGTCTTCTTG 
TTTACAAATA 
CGTGGCTTCC 
CCGCCAGATT 
GACATCAGCA 
CTTCCTCCCC 
AAGCCTGCAC 
CCTGCAGATC 
CAATGGGAGA 
GTGCCCAGAT 
TGAAGACAGA 
GGATTTTTTT 
GCTGTGCTGT 
GCAGAATGTT 
CCATGGCAGG 
ATGGGATTCT 
CCAACTACCC 
CtfCAGTTGAT 
TGTGTTTGGC 
ACACCTGGGA 
AGGAATCTTA 
TGACCCTGAG 
GGTGCTTGAT 



11 
I 

CTTCCTAGTC 
CTTGTGCCAG 
AATGAAAGGC 
CTCGCCGAGG 
GATGGTGCAG 
CAGCGCGCCC 
TGCTCGCGCC 
TTGTCAGTGC 
AGAATCATCC 
ATCTGGAAAG 
ACGGTACTGA 
ATGGACATGT 
GGGGACTTAT 
ACAGATACAA 
ATCACAACAC 
CAAGAAGGCA 
CAGACAACCG 
TAGAGATTGC 
TAGGCGTGGA 
GCCTCCATGA 
ATGCGCAGCT 
TGAGCATGTG 
TTGGTGGAGC 
CACTGGACAG 
CTTCCACCGG 
CCAGCCTGGA 
TCGGGGGCCA 
AGCCAGAGGA 
CTGTGTGCGC 
GCAGGGACTC 
GCCCAGCCAA 
ACAATGGCAT 
AACCTGCCCC 
GTGGCAAAGT 
AAATCCAGTG 
AAACAAACAT 
TGGGCGATGA 
AAATCTGCCT 
TGCAGTGCCA 
GGGCACCTCC 
GGCAAGCAGA 
CTGCCGGATT 
AGAAGACCAC 
AACCCTGTCA 
CCTACCCACC 
GACCCCTCAA 
TCCACCGGGC 
TTAGGCAGGC 
CTCTGGCCAG 
CTGGGCTCCG 
CCACCCACAC 
AGTTTGCACT 
TAATGTTTAA 
GCTATGGTGC 
GATTACAGTG 
AAGGCTTGTT 
GGACAGGATG 
CCAGCTGTGC 
TTTCTGGATT 
TTTCAGGGAG 
GAAATCTGGC 
AGGTGTAGCC 
CTGACCAGCC 
AGAAATGCCA 



21 

I 

CCCGGGCCAA 
AACGGCGCG C 
TAGAAGAGCT 
CAGGAAATCC 
CGCTCGCCGC 
GCTGCCCGTG 
CTGCGAGGCC 
CTCTGTTCGG 
AGAAGTGCTG 
AAATGAAGGT 
TGTCTCCCTC 
ACGGGGATAT 
TGTGTTTGAA 
ACTCTTCCCA 
ACCAAACCTC 
TAAAAGAGAG 
AGAGTTTCAG 
TAATCACGTT 
AGTGTGGAAT 
ATTTCTGGAC 
TGTCAGTGGG 
CACGGCAGAC 
CGTGACCCTG 
GGGCTGTAGC 
GTACCCATTT 
GAAAGGAATG 
GAAGTGTGGG 
ATGTATGAAT 
ACATGGGCTG 
CAGCAACTCC 
CGTGTACCTG 
CTGCCAGACT 
TGGGATCTGC 
CTCGAAGAGT 
TCAAGGAGGT 
CCCCCTGCAG 
CATGCCGGAC 
GAATCGTCAA 
CGGCAGAGGG 
CTTCTGTGAC 
TAACCAAGGT 
TGTGGTTTAT 
CATTGAAAAA 
GGCTCACCTC 
GAAGGACAAT 
CGGCCTGAAT 
CCCACGTGCA 
CCAGGGGACG 
AACAACTCGG 
CCTGGCACCC 
CGCCTATATT 
ATCTTTCAGC 
AACATCATTA 
TCTGTCTACT 
CAGTGCGCTG 
GTGCTTTTAG 
TGTTTGCTTT 
TTATGGTACC 
CCCCATCTCA 
GCCCTGTGCC 
TTCTGGCCAG 
ACACCAGGAT 
GTGAGCATGT 
AGCACTTCTT 



31 
I 

CTOGGACAGT 
GCGCGACGCA 
CAGCGGCGGC 
CTCCGGTCGC 
CGGGCCCGAG 

CGAGGGGTGA 
AGTGGGGACC 
AATATTCGAC 
CTCATTGCCA 
GCTCGAAATT 
TCTGATTCAQ 
AATGAAAGCT 
GCGAAGAAGC 
GCTGCAAAGA 
ACCCTCAAGG 
AGGCAAGGAA 
GACAAGTTTT 
GACATGGACA 
TGGAGGAAGA 
GTTTATTTCC 
CAGTCTGGGG 
GCACATGAGC 
TGTCAAATGG 
CCCATGGTGT 
GGGGTGTGCC 
AACAGATTTG 
CGCTGCTGCA 
TGCTGTGAAG 
TGTGACCTCC 
CACGATGGGC 
CACGAGCAGC 
TTTGAGAGAG 
TCCTTTGCCA 
GCCAGCCGGC 
CAAGGAGGCC 
CCAGGGCTTG 
TGTCAAAATA 
GTGTGCAACA 
AAGTTTGGCT 
TTAACCATAG 
CTCAAAAGGA 
CTAAGGTGTG 
GGCCACCTTG 
CCCAGGAGAT 
GTCCCTCAGC 
CCTAGCGTCC 
TGTAAGCCAA 
CTCACTCATG 
CTCAGACCTG 
AAGTGAGAAG 
TCCAGTTGGA 
CTATAAGAAC 
TGCACAGGTA 
TAGTAGGCAT 
TATTTTAGTG 
CTGATCAAGG 
AGATGCAGCT 
GGCCAGAGCC 
CCTTGACAAC 
GAAGCTTTGG 
AGAGACTGGA 
TTGGAAGGGG 
TTTCTCGCTG 



41 

1 

TTGCTCATTT 
CGCACACACA 
GCGGGCCGTG 
GACGCCCGGC 
AGCTGCTGCA 
GCGCCCTCCT 
GCTTATGGAA 
TCTGGATCCC 
TACAACGGGA 
GCAGTTTCAC 
ACACGGTAAT 
CAGTCAGTCT 
ATGTCTTAGA 
TGAAAAGCGT 
ATGTGTTTCC 
CAACTAAGTA 
AAGATCTGGA 
ACAGACCACT 
AATGCTCTGT 
TGAAGCTTCT 
AAGGGACCAC 
GAATTGTCAT 
TGGGCCACAA 
CGGTTGAGAA 
TCAGCAGTTG 
TGTTTAACCT 
TGGAAGAAGG 
ATGCCACCAC 
ACTGCCAGCT 
CAGAGTTCTG 
ACTCATGTCA 
AGTGTGTCAC 
TCAATTCTGC 
AATGCGAGAT 
CAGTCATTGG 
GGATTCTGTG 
TGCTTGCAGG 
TTAGTGTCTT 
ACAGGAAGAA 
TTGGAGGAAG 
GAATTCTGGT 
AGACCTTGAT 
TGCGCCCTTC 
GAAAAGGCCT 
TGCTGCAGTG 
CCCAGTCAAC 
CTGCCAGACC 
ACCCCCCTCA 
CCTTGGCCAG 
CTCCACAATA 
CCGACACCTT 

TTTGAGCTAC 
CTTGTAAATT 
TTTTACCATC 
AACTTGAAAT 
CCTTATTGGA 
CAAGAGATCC 
AAGGGGCTTC 
TGGCAGGCAG 
TGAGAACCTG 
ACACTAGACA 
TCTGTAGTGT 
TCCTTTCTAG 



51 
I 

ATTGCAACGG 
CGGGGGGAAA 
CGCGAGGGCT 
CCCGCTCGGC 
CTGAAGGCCG 
GCTCGCCCTG 
CGAAGGAAGA 
AGTGAAGAGC 
AAGCAAAGAA 
GGAAACCCAC 
TCTGGGTCAC 
CAGCACGTGT 
ACCAATGAAA 
CCGGGGATCA 
ACCACCCTCT 
TGTGGAGCTG 
AAAAGTTAAG 
GAACATTCGG 
AAGTCAGGAC 
ACCTCGCAAA 
CATCGGCATG 
GGACCATTCA 
TTTCGGGATG 
AGGAGGCTGC 
CAGCAGGAAG 
GCCGGAAGTC 
AGAGGAGTGT 
CTGTACCCTG 
GAAGCCTGCA 
CACAGGGGCC 
GGATGTGGAC 
ACTCTGGGGA 
AGGTGATCCT 
GAGAGATGCT 
TACCAATGCC 
CCGGGGGACC 
CACAAAGTGT 
TGGGGTTCAC 
CTGCCACTGC 
CACAGACAGC 
GACCATCCTG 
ACGACTGCTG 
CCGGCCACCC 
GATGAGGAAG 
TCAGAATGTT 
TCAGCGAGTG 
CCTGCCAGCC 
GAAGCCTCTG 
GACCCCAGGA 
TCCACACCAA 
TTTTCAACAG 
CCAACTTTTA 
TGCCGTCAGT 
ATTAATTTAT 
ACTGAGTTTT 
ATCCTGCTTG 
AAGCAGTCCC 
CAAGTAGAAT 
AGGTCCAGGC 
GCTCCCAGGG 
GGTTGCAGAC 
AGCCAGAACT 
CACTCAAGGC 
AGCACTGCCA 



60 
120 
180 
240 
300 
360 
420 
480 



60 
120 
180 
240 
300 
360 
420 
4B0 
540 
600 
660 
720 
7B0 
840 
900 
960 
1020 
10B0 
1140 
1200 
1260 
1320 
13B0 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2B80 
2940 
3000 
3060 
3120 
31B0 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 



345 



WO 02/086443 

CCAGTAGGTT ATTTAGCTTG GGAAAGGTGG TGTTTCTGTA AGAAACCTAC TGCCCAGGCA 3900 

CTGCAAACCG CCACCTCCCT ATACTGCTTG GAGCTGAGCA AATCACCACA AACTGTAATA 3960 

CAATGATCCT GTATTCAGAC AGATGAGGAC TTTCCATGGG ACCACAACTA TTTTCAGATG 4020 

TGAACCATTA ACCAGATCTA GTCAATCAAG TCTGTTTACT GCAAGGTTCA ACTTATTAAC 4080 

AATTAGGCAG ACTCTTTATG CTTGCAAAAA CTACAACCAA TGGAATGTGA TGTTCATGGG 4140 

TATAGTTCAT GTCTGCTATC ATTATTCGTA GATATTGGAC AAAGAACCTT CTCTATGGGG 4200 

CATCCTCTTT TTCCAACTTG GCTGCAGGAA TCTTTAAAAG ATGCTTTTAA CAGAGTCTGA 4260 

ACCTATTTCT TAAACACTTG CAACCTACCT GTTGAGCATC ACAGAATGTG ATAAGGAAAT 4320 

CAACTTGCTT ATCAACTTCC TAAATATTAT GAGATGTGGC TTGGGCAGCA TCCCCTTGAA 4380 

CTCTTCACTC TTCAAATGCC TGACTAGGGA GCCATGTTTC ACAAGGTCTT TAAAGTGACT 4440 

AATGGCATGA GAAATACAAA AATACTCAGA TAAGGTAAAA TGCCATGATG CCTCTGTCTT 4500 

CTGGACTGGT TTTCACATTA GAAGACAATT GACAACAGTT ACATAATTCA CTCTGAGTGT 4560 

TTTATGAGAA AGCCTTCTTT TGGGGTCAAC AGTTTTCCTA TGCTTTGAAA CAGAAAAATA 4620 

TGTACCAAGA ATCTTGGTTT GCCTTCCAGA AAACAAAACT GCATTTCACT TTCCCGGTGT 4680 

TCCCCACTGT ATCTAGGCAA CATAGTATTC ATGACTATGG ATAAACTAAA CACGTGACAC 4740 

AAACACACAC AAAAGGGAAC CCAGCTCTAA TACATTCCAA CTCGTATAGC ATGCATCTGT 4800 

TTATTCTATA GTTATTAAGT TCTTTAAAAT GTAAAGCCAT GCTGGAAAAT AATACTGCTG 4860 

AGATACATAC AGAATTACTG TAACTGATTA CACTTGGTAA TTGTACTAAA GCCAAACATA 4920 

TATATACTAT TAAAAAGGTT TACAGAATTT TATGGTGCAT TACGTGGGCA TTGTCTTTTT 4980 

AGATGCCCAA ATCCTTAGAT CTGGCATGTT AGCCCTTCCT CCAATTATAA GAGGATATGA 5040 
ACCAAAAAAA AAAAAAAAAA AA 

Seq ID NO: 427 Protein sequence 
Protein Accession #; NP_003465 

1 11 21 31 41 51 

| ! I I I I 

MAARPLPVSP ARALLLAIiAG ALLAPCEARG VSLWNEGRAD EWSASVRSG DLWIPVKSFD 60 

SKNHPEVLNI RLQRESKELI INLERNEGLI ASSFTETHYL QDGTDVSLAR NYTVILGHCY 120 

YHGHVRGYSD SAVSLSTCSG LRGLIVFENE SYVLEPMKSA TNRYKLFPAK KLKSVRGSCG 180 

SHHNTPNLAA KNVFPPPSQT WARRHKRETL KATKYVELVI VADNREFQRQ GKDLEKVKQR 240 

LIEIANHVDK FYRPLNIRIV LVGVEVWNDM DKCSVSQDPF TSLHEFLDWR KMKLLPRKSH 300 

DNAQLVSGVY FQGTTIGMAP IMSMCTADQS GGIVMDHSDN PLGAAVTLAH ELGHNFGMNH 360 

DTLDRGCSCQ MAVEKGGCIM NASTGYPFPM VFSSCSRKDL ETSLEKGMGV CLFNLPEVRE 420 

SFGGQKCGNR FVEEGEECDC GEPEECMNRC CNATTCTLKP DAVCAHGLCC EDCQLKPAGT 480 

ACRDSSNSCD LPEFCTGASP HCPANVYLHD GHSCQDVDGY CYNGICQTHE QQCVTLWGPG 540 

AKPAPGICFE RVNSAGDPYG NCGKVSKSSF AKCEMRDAKC GKIQCQGGAS RPVIGTNAVS 600 

IETNIPLQQG GRILCRGTHV YLGDDMPDPG LVUVGTKCAD GKICLNRQCQ NISVFGVHEC 660 

AMQCHGRGVC NNRKNCHCEA HWAPPFCDKF GFGGSTDSGP IRQADNQGLT IGILVTILCL 720 

LAAGFWYIiK RKTLIRLLFT NKKTTIEKIiR CVRPSRPPRG FQPCQAHLGH LGKGLMRKPP 780 

DSYPPKDNPR RLLQCQNVDI SRPLNGLNVP QPQSTQRVLP PLHRAPRAP3 VPARPLPAKP 840 

AXjRQAQGTCK PNPPQKPLPA DP LARTTRLT HALARTPGQW ETGLRLAPLR PAPQYPHQVP 900 
RSTHTAYIK 

Seq ID NO i 428 DNA sequence 
Nucleic Acid Accession #> NM_003714 
Coding sequence i 13 5.. 1043 

1 11 21 31 41 51 

I I I I I 

GAGGAGGAGG GAAAAGGCGA GCAAAAAGGA AGAGTGGGAG GAGGAGGGGA AGCGGCGAAG 60 

GAGGAAGAGG AGGAGGAGGA AGAGGGGAGC ACAAAGGATC CAGGTCTCCC GACGGGAGGT 120 

TAATACCAAG AACCATGTGT GCCGAGCGGC TGGGCCAGTT CATGACCCTG GCTTTGGTGT 180 

TGGCCACCTT TGACCCGGCG CGGGGGACCG ACGCCACCAA CCCACCCGAG GGTCCCCAAG 240 

ACAGGAGCTC CCAGCAGAAA GGCCGCCTGT CCCTGCAGAA TACAGCGGAG ATCCAGCACT 300 

GTTTGGTCAA CGCTGGCGAT GTGGGGTGTG GCGTGTTTGA ATGTTTCGAG AACAACTCTT 360 

GTGAGATTCG GGGCTTACAT GGGATTTGCA TGACTTTTCT GCACAACGCT GGAAAATTTG 420 

ATGCCCAGGG CAAGTCATTC ATCAAAGACG CCTTGAAATG TAAGGCCCAC GCTCTGCGGC 480 

ACAGGTTCGG CTGCATAAGC CGGAAGTGCC CGG CCATCAG GGAAATGGTG TCCCAGTTGC 540 

AGCGGGAATG CTACCTCAAG CACGACCTGT GCGCGGCTGC CCAGGAGAAC ACCCGGGTGA 600 

TAGTGGAGAT GATCCATTTC AAGGACTTGC TGCTGCACGA ACCCTACGTG G AC CT CGTGA 660 

ACTTGCTGCT GACCTGTGGG GAGGAGGTGA AGGAGGCCAT CACCCACAGC GTGCAGGTTC 720 

AGTGTGAGCA GAACTGGGGA AGCCTGTGCT CCATCTTGAG CTTCTGCACC TCGGCCATCC 780 

AGAAGCCTCC CACGGCGCCC CCCGAGCGCC AGCCCCAGGT GGACAGAACC AAGCTCTCCA 840 

GGGCCCACCA CGGGGAAGCA GGACATCACC TCCCAGAGCC CAGCAGTAGG GAGACTGGCC 900 

GAGGTGCCAA GGGTGAGCGA GGTAGCAAGA GCCACCCAAA CGCCCATGCC CGAGGCAGAG 960 

TCGGGGGCCT TGGGGCTCAG GGACCTTCCG GAAGCAGCGA GTGGGAAGAC GAACAGTCTG 1020 

AGTATTCTGA TATCCGGAGG TGAAATGAAA GGCCTGGCCA CGAAATCTTT CCTCCACGCC 1080 

GTCCATTTTC TTATCTATGG ACATTCCAAA ACATTTACCA TTAGAGAGGG GGGATGTCAC 1140 

ACGCAGGATT CTGTGGGGAC TGTGGACTTC ATCGAGGTGT GTGTTCGCGG AACGGACAGG 1200 

TGAGATGGAG ACCCCTGGGG CCGTGGGGTC TCAGGGGTGC CTGGTGAATT CTGCACTTAC 1260 

ACGTACTCAA GGGAGCGCGC CCGCGTTATC CTCGTACCTT TGTCTTCTTT CCATCTGTGG 1320 

AGTCAGTGGG TGTCGGCCGC TCTGTTGTGG GGGAGGTGAA CCAGGGAGGG GCAGGGCAAG 1380 

GCAGGGCCCC CAGAGCTGGG CCACACAGTG GGTGCTGGGC CTCGCCCCGA AGCTTCTGGT 1440 

GCAGCAGCCT CTGGTGCTGT CTCCGCGGAA GTCAGGGCGG CTGGATTCCA GGACAGGAGT 1500 

GAATGTAAAA ATAAATATCG CTTAGAATGC AGGAGAAGGG TGGAGAGGAG GCAGGGGCCG 1560 

AGGGGGTGCT TGGTGCCAAA CTGAAATTCA GTTTCTTGTG TGGGGCCTTG CGGTTCAGAG 1620 

CTCTTGGCGA GGGTGGAGGG AGGAGTGTCA TTTCTATGTG TAATTTCTGA GC CATTG TAC 1680 

TGTCTGGGCT GGGGGGGACA CTGTCCAAGG GAGTGGCCCC TATGAGTTTA TATTTTAACC 1740 

ACTGCTTCAA ATCTCGATTT CACTTTTTTT ATTTATCCAG TTATATCTAC ATATCTGTCA 1800 

TCTAAATAAA TGGCTTTCAA ACAAAGCAAC TGGGTCATTA AAACCAGCTC AAAGGGGGTT 1860 

TAAAAAAAAA AAAACCAGCC CATCCTTTGA GGCTGATTTT TCTTTTTTTT AAGTTCTATT 1920 

TTAAAAGCTA TCAAACAGCG ACATAGCCAT ACATCTGACT GCCTGACATG GACTCCTGCC 1980 

CACTTGGGGG AAACCTTATA CCCAGAGGAA AATACACACC TGGGGAGTAC ATTTGACAAA 2040 

TTTCCCTTAG GATTTCGTTA TCTCACCTTG ACCCTCAGCC AAGATTGGTA AAGCTGCGTC 2100 

CTGGCGATTC CAGGAGACCC AGCTGQAAAC CTGGCTTCTC CATGTGAGGG GATGGGAAAG 2160 

GAAAGAAGAG AATGAAGACT ACTTAGTAAT TCCCATCAGG AAATGCTGAC CTTTTACATA 2220 
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AAATCAAGGA OACTGCTGAA AATCTCTAAG GGACAGGATT TTCCAGATCC TAATTGGAAA 
TTTAGCAATA AGQAGAGGAG TCCAAGGGGA CAAATAAAGG CAGAGAGAGA GAGAOAGAGA 
GGGAGAGGAA GAAAAGAGAG AGAGAAAAGA GCCTCGTGCC 



PCT/US02/12476 



2260 
2340 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



Seq ID NO i 429 Protein sequence 
Protein Accession #: NP 003705 



1 
I 

MCAERLGQFM 
GDVGCGVPEC 
ISRKCPAIRE 
CGEEVKEAIT 
EAGHHLPEPS 
RR 



11 
I 

TLALVLATFD 
FENNSCEIRG 
MV9QU3RECY 
HSVQVQCEQN 
SRETGRGAKG 



21 

I 

PARGTDATNP 
LHGICMTFLH 
LKHDLCAAAQ 
WGSLCSILSF 
ERGSKSHPNA 



31 .41 51 

I I I 

PEGPQDRSSQ QKGRLSLQNT AEIQHCLVNA 
NAGKFDAQGK SFIKDALKCK AHALRHRFGC 
ENTRVIVEMI HFKDLLLHEP YVDLVNLLLT 
CTSAIQKPPT APPERQPQVD RTKLSRAHHG 
HARGRVGGLG AQGPSGSSEW EDEQSEYSDI 



Seq ID NO: 430 DNA sequence 

Nucleic Acid Accession ft: NM_005940 

Coding sequence i 23.. 1489 



l 
I 

AAGCCCAGCA 
CGCCCTCCTG 
TCTGCCGCCG 
AGCCCTGCCC 
CAGCCTCAGG 
CCGACAGAAG 
GATCCTTCGG 
CCTAAAGGTA 
TGACATCATO 
TGGGGGCATC 
CGACTATGAT 
AGCCCATGAA 
GTCCGCCTTC 
TCAACACCTA 
CCAGGCTGGG 
CTGTGAGGCC 
GGGCTTTGTG 
TCGCCACTGG 
CATTTGGTTC 
CCCCGCACCC 
GGGTCCCGAG 
CAGCACCCGG 
CTCTGAGATC 
CCTCTACTGG 
GGGTCCTGAC 
ATGCCCTCAG 
ATCTTTGTGG 
GGTGGGGTAC 
AGCGACTGTC 
GGGACCCGCT 
GTAGCACCAT 
TCCTTCCAGG 
TGAGCAACTG 
ATCTGTCTGC 
GTTCACAGTC 
CAACATACCT 
ATCCTCCAAA 
TTTTTAAACT 



11 

I 

GCCCCGGGGC 
CCCCCGATGC 
GACGTCCACC 
AGTAGCCCGG 
CCTCCCCGCT 
AGGTTCGTGC 
TTCCCATGGC 
TGGAGCGATG 
ATCGACTTCG 
CTGGCCCATG 
GAGACCTGGA 
TTTGGCCACG 
TACACCTTTC 
TATGGCCAGC 
ATAGACACCA 
TCCTTTGAOG 
TGGCGCCTCC 
CAGGGACTGC 
TTCCAAGGTG 
CTCACCGAGC 
AAGAACAAGA 
CGTGTAGACA 
GACGCTGCCT 
AAGTTTGACC 

GGGTGCTGAC 
CTGTGGGCAC 
AACCACCATG 
TCAGACTGGG 
ATGCAGGTCC 
GGCAGGACTG 
GGCTGGCACT 
GGCTGTAGGG 
CTTCTGGCTG 
AAATGGGGAG 
CAATCCTGTC 
GCCATTGTAA 
GAGGATTGTC 



21 

I 

GGATGGCTCC 
TGCTGCTGCT 
ACCT CCATGC 
CACCTGCCCC 
GTGGCGTGCC 
TTTCTGGCGG 
AGTTGGTGCA 
TGACGCCACT 
CCAGGTACTG 
CCTTCTTCCC 
CTATCGGGGA 
TGCTGGGGCT 
GCTACCCACT 
CCTGGCCCAC 
ATGAGATTGC 
CGGTCTCCAC 
GTGGGGGCCA 
CCAGCCCTGT 
CTCAGTACTG 
TGGGCCTGGT 
TCTACTTCTT 
GTCCCGTGCC 
TCCAGGATGC 
CTGTGAAGGT 
GTGCCGAGCC 
CCCTGCCAGG 
CAGGCATGGG 
ACAACTGCCG 
CAGGGAGGCT 
TGGCAAACCT 
GGGGAACTGG 
GAAGCAAGGG 
CAGGGCCACT 
ACAATCCTGG 
GGGTATTCTT 
CCAGGCCGGA 
ATGTGTGTAC 
ATTAAACACA 



31 
I 

GGCCGCCTGG 
GCTCCAGCCG 
CGAGAGGAGG 
TGCCACGCAG 
CGACCCATCT 
GCGCTGGGAG 
GGAGCAGGTG 
CACCTTTACT 
GCATGGGGAC 
CAAGACTCAC 
TGACCAGGGC 
GCAGCACACA 
GAGTCTCAGC 
TGTCACCTCC 
ACCGCTGGAG 
CATCCGAGGC 
GCTGCAGCCC 
GGACGCTGCC 
GGTGTACGAC 
GAGGTTCCCG 
CCGAGGCAGG 
CCGCAGGGCC 
TGATGGCTAT 
GAAGGCTCTG 
TGCCAACACT 
CCACGAATAT 
ACTGAGCCCA 
GGAGGGCCAC 
TTGGCATGAC 
GGCTGCCCTG 
AGTGTCCTTG 
TGCTGGGGCC 
TCCTGAGGTC 
AAATCTGTTC 
CATGCAGGAG 
TCCTCCTGAA 
AGTGTGTATA 
GTTGTTTTCT 



41 
I 

CTCCGCAGCG 
CCGCCGCTGC 
GGGCCACAGC 
GAAGCCCCCC 
GATGGGCTGA 
AAGACGGACC 
CGGCAGACGA 
GAGGTGCACG 
GACCTGCCGT 
CGAGAAGGGG 
ACAGACCTGC 
ACAGCAGCCA 
CCAGATGACT 
AG QACCCCAG 
CCAGACGCCC 
GAGCTCTTTT 
GGCTACCCAG 
TTCGAGGATG 
GGTGAAAAGC 
QTCCATGCTG 
GACTACTGGC 
ACTGACTGGA 
GCCTACTTCC 
GAAGGCTTCC 
TTCCTCTGAC 
CAGGCTAGAG 
TGTCTCCTGC 
GCAGGTCGTG 
TTAAGAGGAA 
TCTCATCCCT 
CTGTATCCCT 
CCATGGCCTT 
AGGTCTTGGT 
TCCAGAATCC 
ACCCCAGGCC 
GCCCTT TTCG 
AACCTTCTTC 



51 
I 

CGGCCGCGCG 
TGGCCCGGGC 
CCTGGCATGC 
GGCCTGCCAG 
GTGCCCGCAA 
tCACCTACAG 
TGGCAGAGGC 
AGGGCCGTGC 
TTGATGGGCC 
ATGTCCACTT 
TGCAGGTGGC 
AGGCCCTGAT 
GCAGGGGCGT 
CCCTGGGCCC 
CGCCAGATGC 
TCTTCAAAGC 
CATTGGCCTC 
CCCAGGGCCA 
CAGTCCTGGG 
CCTTGGTCTG 
GTTTCCACCC 
GAGGGGTGCC 
TGCGCGGCCG 

CATGGCTTGG 
ACCCATGGCC 
AGGGGGATGG 
GTCACCTGCC 
GGGCAGTCTT 
GTCCCTCAGG 
GTTGTGAGGT 
CAGCCCTGGC 
AGGTGCCTGC 
AGGCCAAAAA 
CTGGAGGCTG 
CAGCACTGCT 
TTCTTTTTTT 



Seq ID NO: 431 Protein sequence 
Protein Accession #: NP_00593l 



l 
I 

MAPAAWLRSA 
PAPATQEAPR 
LVQEQVRQTM 
PFPKTHREGD 
YPLSLSPDDC 
VSTIRGELFP 
QYWVYDGEKP 
PVPRRATDWR 
AEPANTFL 



11 
I 

AARALLPPML 
PASSLRPPRC 
AEALKVWSDV 
VHFDYDETWT 
RGVQHLYGQP 
PKAGFVWRLR 
VLGPAPLTEL 
GVPSEIDAAF 



21 
1 

LLLLQPPPLL 
GVPDPSDGLS 
TPLTFTEVHE 
IGDDQGTDLL 
WPTVTSRTPA 
GGQLQPGYPA 
GLVRPPVHAA 
QDADGYAYFL 



31 
I 

ARALPPDVHH 
ARNRQKRFVL 
GRADIMIDFA 
QVAAHEFGHV 
LGPQAGIDTN 
LASRHWQGLP 
LVWGPEKNKI 
RGRLYWKFDP 



41 
I 

LHAERRGPQP 
SGGRWEKTDL 
RYWHGDDLPF 
LGLQHTTAAK 
EIAPLEFDAP 
SPVDAAFEDA 
YFFRGRDYWR 
VKVKALEGFP 



51 

I 

WHAAIiPSSPA 
TYRILRFPWQ 
DGPGGILAKA 
ALMSAFYTFR 
PDACEASFDA 
QGHIWFFQGA 
FHPSTRRVDS 
RLVGPDFFGC 



Seq ID NO: 432 DNA sequence 

Nucleic Acid Accession ft: NM.024022 

Coding sequence: 202.. 1563 



1 11 21 

I I I 

ACCGGGCACC GGACGGCTCG GGTACTTTCG 
GGAAAGGGCT GTGTTTATGG GAAGCCAGTA 
CCATCTACAT TTTTGGGACT CGGGAATTAT 
AGAGGTCCTG AAATAGTCAC CATGGGGGAA 
TCATTCCGAT CGCTTTTTGG CCTTGATGAT 



31 41 51 

I I I 

TTCTTAATTA GGTCATGCCC GTGTGAGCCA 

ACACTGTGGC CTACTATCTC TTCCGTGGTG 

GAGGTAGAGG TGGAGGCGGA GCCGGATGTC 

AATGATCCGC CTGCTGTTGA AGCCCCCTTC 
TTGAAAATAA GTCCTGTTGC ACCAGATGCA 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 



60 
120 
180 
240 
300 
360 
420 
480 



60 
120 
180 
240 
300 
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GATGCTGTTQ CTGCACAGAT CCTGTCACTQ CTGCCATTGA AGTTTTTTCC AATCATCGTC 360 

ATTGGGATCA TTGCATTGAT ATTAGCACTG GCCATTGGTC TGGGCATCCA CTTCGACTGC 420 

TCAGGGAAGT ACAGATGTCG CTCATCCTTT AAGTGTATCG AGCTGATAGC TCGATGTGAC 480 

GGAGTCTCGG ATTGCAAAQA CGGGGAGGAC GAGTACCGCT GTGTCCGGGT GGGTGGTCAG 540 

AATGCOGTGC TCCAGGTGTT CACAGCTGCT TCGTGGAAGA CCATGTGCTC CGATGACTGG 600 

AAGGGTCACT ACGCAAATGT TGCCTGTGCC CAACTGGGTT TCCCAAGCTA TGTGAGTTCA 660 

GATAACCTCA GAGTGAGCTC GCTGGAGGGG CAGTTCCGGG AGGAGTTTGT GTCCATCGAT 720 

CACCTCTTGC CAGATGACAA GGTGACTGCA TTACACCACT CAGTATATGT GAGGGAGGGA 780 

TGTGCCTCTG GCCACGTGGT ( TACCTTGCAG TGCACAGCCT GTGGTCATAG AAGGGGCTAC 840 

AGCTCACGCA TCGTGGGTGG' AAACATGTCC TTGCTCTCGC AGTGGCCCTG GCAGGCCAGC 900 

CTTCAGTTCC AGGGCTACCA CCTGTGCGGG GGCTCTGTCA TCACGCCCCT GTGGATCATC 960 

ACTGCTGCAC ACTGTGTTTA TGACTTGTAC CTCCCCAAGT CATGGACCAT CCAGGTGGGT 1020 

CTAGTTTCCC TGTTGGACAA TCCAGCCCCA TCCCACTTGG TGGAGAAGAT TGTCTACCAC 1080 

AG CAAGT ACA AGCCAAAGAG GCTGGGCAAT GACATCGCCC TTATGAAGCT GGCCGGGCCA 1140 

CTCACGTTCA ATGAAATGAT CCAGCCTGTG TGCCTGCCCA ACTCTGAAGA GAACTTCCCC 1200 

GATGGAAAAG TGTGCTGGAC GTCAGGATGG GGGGCCACAG AGGATGGAGG TGACGCCTCC 1260 

CCTGTCCTGA ACCACGCGGC CGTCCCTTTG ATTTCCAACA AGATCTGCAA CCACAGGGAC 1320 

GTGTACGGTG GCATCATCTC CCCCTCCATG CTCTGCGCGG GCTACCTGAC GGGTGGCGTG 1380 

GACAGCTGCC AGGGGGACAG CGGGGGGCCC CTGGTGTGTC AAGAGAGGAG GCTGTGGAAG 1440 

TTAGTGGGAG CGACCAGCTT TGGCATCGGC TGCGCAGAGG TGAACAAGCC TGGGGTGTAC 1500 

ACCCGTGTCA CCTCCTTCCT GGACTGGATC CACGAGCAGA TGGAGAGAGA CCTAAAAACC 1560 

TGAAGAGGAA GGGGACAAGT AGCCACCTGA GTTCCTGAGG TGATGAAGAC AGCCCGATCC 1620 

TCCCCTGGAC TCCCGTGTAQ GAACCTGCAC ACGAGCAGAC ACCCTTGGAG CTCTGAGTTC 1680 

CGGCACCAGT AGCAGGCCCG AAAGAGGCAC CCTTCCATCT GATTCCAGCA CAACCTTCAA 1740 

GCTGCTTTTT GTTTTTTGTT TTTTTGAGGT GGAGTCTCGC TCTGTTGCCC AGGCTGGAGT 1800 

GCAGTGGCGA AATCCCTGCT CACTGCAGCC TCCGCTTCCC TGGTTCAAGC GATTCTCTTG 1860 

CCTCAGCTTC CCCAGTAGCT GGGACCACAG GTGCCCGCCA CCACACCCAA CTAATTTTTG 1920 

TATTTTTAGT AGAGACAGGO TTTCACCATG TTGGCCAGGC TGCTCTCAAA CCCCTGACCT 1980 

CAAATGATGT GCCTGCTTCA GCCTCCCACA GTGCTGGGAT TACAGGCATG GGCCACCACG 2040 

CCTAGCCTCA CGCTCCTTTC TGATCTTCAC TAAGAACAAA AGAAGCAGCA ACTTGCAAGG 2100 

GCGGCCTTTC CCACTGGTCC ATCTGGTTTT CTCTCCAGGG GTCTTGCAAA ATTCCTGACG 2160 

AGATAAGCAG TTATGTGACC TCACGTGCAA AGCCACCAAC AGCCACTCAG AAAAGACGCA 2220 

CCAGCCCAGA AGTGCAGAAC TGCAGTCACT GCACGTTTTC AT CTCT AGGG ACCAGAACCA 2280 

AACCCACCCT TTCTACTTCC AAGACTTATT TTCACATGTG GGGAGGTTAA TCTAGGAATG 2340 

ACTCGTTTAA GGCCTATTTT CATGATTTCT TTGTAGCATT TGGTGCTTGA CGTATTATTG 24 00 

TCCTTTGATT CCAAATAATA TGTTTCCTTC CCTCAAAAAA AAAAAAAAAA AAAAAAAAAA 2460 
AAAAA 

Seg ID NO: 433 Protein sequence 
Protein Accession #t NP_076927 

1 n 21 31 41 51 

I I I 1 1 I 

MGENDPPAVE APFSFRSLFG LDDLK I S P VA PDADAVAAQ I LSLLPLKFFP IIVIGIIAIiI 60 

LAIiAIGLGIH FOCSGKYRCR SSFKCIELIA RCDGVSDCKD GEDEYRCVRV GGQNAVLQVF 120 

TAASWKTMCS DDWKGHYAMV ACAQLGFPSY VSSDNLRVSS LEGQFREEFV SIDHLLPDDK 180 

VTAIiHHSVYV RBGCASGHW TLQCTACGHR RGYS9RIVGG NMSIiLSQWPW QASLQFQGYH 240 

LCGGSVITPL WIITAAHCVY DLYLPKSWTI QVGLVSIiLDN PAPSHLVEKI VYHSKYKPKR 300 

LGNDIALMKL AGPLTFNEMI QPVCLPNSEE NFPDGKVCWT SGWGATEDGG DASPVLNHAA 360 

VPLISNKICN HRDVYGGIIS PSMIiCAGYLT GGVDSCQGD9 GGPLVCQERR LWKLVGATSF 420 
GIGCAEVNKP GVYTRVTSFI* DWIHEQMERD LKT 

Seq ID NO: 434 DNA sequence 

Nucleic Acid Accession #: NM_000493.2 

Coding sequence: 97.. 2 13 9 

1 11 21 31 41 51 

I I I I I I 

CACCTTCTGC ACTGCTCATC TGGGCAGAGG AAGCTTCAGA AAGCTGCCAA GGCACCATCT 60 

CCAGGAACTC CCAGCACGCA GAATCCATCT GAGAATATGC TGCCACAAAT ACCCTTTTTG 120 

CTGCTAGTAT CCTTGAACTT GGTTCATGGA GTGTTTTACG CTGAACGATA CCAAATGCCC 180 

ACAGGCATAA AAGGCCCACT ACCCAACACC AAGACACAGT TCTTCATTCC CTACACCATA 240 

AAGAGTAAAG GTATAGCAGT AAGAGGAGAG CAAGGTACTC CTGGTCCACC AGGCCCTGCT 300 

GGACCTCGAG GGCACCCAGG TCCTTCTGGA CCACCAGGAA AACCAGGCTA CGGAAGTCCT 3 60 

GGACTCCAAG GAGAGCCAGG GTTGCCAGGA CCACCGGGAC CATCAGCTGT AGGGAAACCA 420 

GGTGTGCCAG GACTCCCAGG AAAACCAGGA GAGAGAGGAC CATATGGACC AAAAGGAGAT 480 

GTTGGACCAG CTGGCCTACC AGGACCCCGG GGCCCACCAG GACCACCTGG AATCCCTGGA 540 

CCGGCTGGAA TTTCTGTGCC AGGAAAACCT GGACAACAGG GACCCACAGG AGCCCCAGGA 600 

CCCAGGGGCT TTCCTGGAGA AAAGGGTGCA CCAGGAGTCC CTGGTATGAA TGGACAGAAA 660 

GGGGAAATGG GATATGGTGC TCCTGGT CGT CCAGGTGAGA GGGGTCTTCC AGGCCCTCAG 720 

GGTCCCACAG GACCATCTGG CCCTCCTGGA GTGGGAAAAA GAGGTGAAAA TGGGGTTCCA 780 

GGACAGCCAG GCATCAAAGG TGATAGAGGT TTTCCGGGAG AAATGGGACC AATTGGCCCA 840 

CCAGGTCCCC AAGGCCCTCC TGGGGAACGA GGGCCAGAAG GCATTGGAAA GCCAGGAGCT 900 

GCTGGAGCCC CAGGCCAGCC AGGGATTCCA GGAACAAAAG GTCTCCCTGG GGCTCCAGGA 960 

ATAGCTGGGC CCCCAGGGCC' TCCTGGCTTT GGGAAACCAG GCTTGCCAGG CCTGAAGGGA 1020 

GAAAGAGGAC CTGCTGGCCT TCCTGGGGGT CCAGGTGCCA AAGGGGAACA AGGGCCAGCA 1080 

GGTCTTCCTG GQAAGCCAGG TCTGACTGGA CCOCCTGGGA ATATGGGACC CCAAGGACCA 1140 

AAAGGCATCC CGGGTAGCCA TGGTCTCCCA GGCCCTAAAG GTGAGACAGG GCCAGCTGGG 1200 

CCTGCAGGAT ACCCTGGGGC TAAGGGTGAA AGGGGTTCCC CTGGGTCAGA TGGAAAACCA 1260 

GGGTACCCAG GAAAACCAGG TCTCGATGGT CCTAAGGGTA ACCCAGGGTT ACCAGGTCCA 1320 

AAAGGTGATC CTGGAGTTGG AGGACCTCCT GGTCTCCCAG GCCCTGTGGG CCCAGCAGGA 13 80 

GCAAAGGGAA TGCCCGGACA CAATGGAGAG GCTGGCCCAA GAGGTGCCCC TGGAATACCA 1440 

GGTACTAGAG GCCCTATTGG GCCACCAGGC ATTCCAGGAT TCCCTGGGTC TAAAGGGGAT 1500 

CCAGGAAGTC CCGGTCCTCC TGGCCCAGCT GGCATAGGAA CTAAGGGCCT CAATGGACCC 1560 

ACCGGGCCAC CAGGGCCTCC AGGTCCAAGA GGCCACTCTG GAGAGCCTGG TCTTCCAGGG 1620 

CCCCCTGGGC CTCCAGGCCC ACCAGGTCAA GCAGTCATGC CTGAGGGTTT TATAAAGGCA 1680 

GGCCAAAGGC CCAGTCTTTC TGGGACCCCT CTTGTTAGTG CCAACCAGGG GGTAACAGGA 1740 
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ATGCCTGTGT CTGCTTTTAC TGTTATTCTC TCCAAAGCTT ACCCAGCAAT AGGAACTCCC 1800 

ATACCATTTG ATAAAATTTT GTATAACAGG CAACAGCATT ATGACCCAAG GACTGGAATC I860 

TTTACTTGTC AGATACCAGG AATATACTAT TTTTCATACC ACGTGCATGT GAAAGGGACT 1920 

CATGTTTGGG TAGGCCTGTA TAAGAATGGC ACCCCTGTAA TGTACACCTA TGATGAATAC 1980 

ACCAAAGGCT ACCTGGATCA GGCTTCAGGG AGTGCCATCA TCGATCTCAC AGAAAATGAC 2040 

CAGGTGTGGC TCCAGCTTCC CAATGCCGAG TCAAATGGCC TATACTCCTC TGAGTATGTC 2100 

CACTCCTCTT TCTCAGGATT CCTAGTGGCT CCAATGTGAG TACACCCCAC AGAGCTAATC 2160 

TAAATCTTGT GCTAGAAAAA GCATTCTCTA ACTCTACCCC ACCCTACAAA ATGCATATGG 2220 

AGGTAGGCTG AAAAGAATGT AATTTTTATT TTCTGAAATA CAGATTTGAG CTATCAGACC 2280 

AACAAACCTT CCCCCTGAAA AGTGAGCAGC AACGTAAAAA CGTATGTGAA GCCTCTCTTG 2340 

AATTTCTAGT TAGCAATCTT AAGGCTCTTT AAGGTTTTCT CCAATATTAA AAAATATCAC 2400 

CAAAGAAGTC CTGCTATGTT AAAAACAAAC AACAAAAAAC AAAGCAACAA AAAAAAAAAT 2460 

TAAAAAAAAA AACAGAAATA GAGCTCTAAG TTATGTGAAA TTTGATTTGA GAAACTCGGC 2S20 

ATTTCCTTTT TAAAAAAGCC TGTTTCTAAC TATGAATATG AGAACTTCTA GGAAACATCC 2S80 

AGGAGGTATC ATATAACTTT GTAGAACTTA AATACTTGAA TATTCAAATT TAAAAGACAC 2640 

TGTATCCCCT AAAATATTTC TGATGGTGCA CTACTCTGAG GCCTGTATGG CCCCTTTCAT 2700 

CAATATCTAT TCAAATATAC AGGTGCATAT ATACTTGTTA AAGCTCTTAT ATAAAAAAGC 2760 

CCCAAAATAT TGAAGTTCAT CTGAAATGCA AGGTGCTTTC ATCAATGAAC CTTTTCAAAA 2820 

CTTTTCTATG ATTGCAGAGA AGCTTTTTAT ATACCCAGCA TAACTTGGAA ACAGGTATCT 2880 

GACCTATTCT TATTTAGTTA ACACAAGTGT GATTAATTTG ATTTCTTTAA TTCCTTATTG 2940 

AATCTTATGT GATATGATTT TCTGGATTTA CAGAACATTA GCACATGTAC CTTGTGCCTC 3000 

CCATTCAAGT GAAGTTATAA TTTACACTGA GGGTTTCAAA ATT CG ACT AG AAGTGGAGAT 3060 

ATATTATTTA TTTATGCACT GTACTGTATT TTTATATTGC TGTTTAAAAC TTTTAAGCTG 3120 

TGCCTCACTT ATTAAAGCAC AAAATGTTTT ACCTACTCCT TATTTACGAC ACAATAAAAT 3180 

AACATCAATA GATTTTTAGG CTGAATTAAT TTGAAAGCAG CAATTTGCTG TTCTCAACCA 3240 
TTCTTT CAAG GCTTTTCATT CGACACAATA AAATAACATC AATAG 

Seq ID N0» 435 Protein sequence 
Protein Accession #: NP__000484.2 

1 ll 21 31 41 51 

I I I I- I I 

MLPQIPFLIiL vslnlvhgvf yaeryqmptg irgplpntkt qffipytiks KGIAVRGEQG 60 

TPGPPGPAGP RGHPGPSGPP GKPGYGSPGL QGEPGLPGPP GPSAVGKPGV PGLPGKPGER 120 

GPYGPKGDVG PAGLPGPRGP PGPPGIPGPA GISVPGKPGQ QGPTGAPGPR GFPGEKGAPG 180 

VPGMNGQKGE MGYGAPGRPG ERGLPGPQGP TGPSGPPGVG KRGENGVPGQ PGIKGDRGFP 240 

GEMGPIGPPG PQGPPGERGP EGIGKPGAAG APGQPGIPGT KGL PGAPG I A GPPGPPGFGK 300 

PGLPGLKGER GPAGLPGGPG AXGEQGPAGI/ PGXPGLTGPP GNMGPQGPKG IPGSHGI*PGP 360 

KGETGPAGPA GYPGAKGERG SPGSDGKPGY PGKPGLDGPK GNPGLPGPKG DPGVGGPPGL 420 

PGPVGPAGAK GMPGHNGEAG PRGAPGIPGT RGPIGPPGIP GFPGSKGDPG SPGPPGPAGI 480 

ATKGLNGPTG PPGPPGPRGH SGEPGLPGPP GPPGPPGQAV MPEGF1KAGQ RPSLSGTPLV 540 

SANQGVTGMP VSAFTVILSK AYPAIGTPIP FDKILYNRQQ HYDPRTGIFT CQIPGIYYFS 600 

YHVHVKGTHV WVGIiYKNGTP VMYTYDEYTK GYLDQASGSA IIDL.TENDQV WLQLPNAESN 660 
GLYSSEYVHS SFSGFLVAPM 

Seq ID NO: 436 DNA sequence 
Nucleic Acid Accession #: XM_062811 
Coding sequence: 1..888 

1 11 21 31 41 51 

t | I I I I 

ATGTGGGGCG CTCGCCGCTC GTCCGTCTCC TCATCCTGGA ACGCCGCTTC GCTCCTGCAG 60 

CTGCTGCTGG CTGCGCTGCT GGCGGCGGGG GCGAGGGCCA GCGGCGAGTA CTGCCACGGC 120 

TGGCTGGACG CGCAGGGCGT CTGGCGCATC GGCTTCCAGT GTCCCGAGCG CTTCGACGGC 180 

GGCGACGCCA CCATCTGCTG CGGCAGCTGC GCGTTGCGCT ACTGCTGCTC CAGCGCCGAG 240 

GCGCGCCTGG ACCAGGGCGG CTGCGACAAT GACCGCCAGC AGGGCGCTGG CGAGCCTGGC 300 

CGGGCGGACA AAGACGGCCC CGACGGCTCG GCAGTGCCCA TCTACGTGCC GTTCCTCATT 360 

GTTGGCTCCG TGTTTGTCGC CTTTATCATC TTGGGGTCCC TGGTGGCAGC CTGTTGCTGC 420 

AGATGTCTCC GGCCTAAGCA GGATCCCCAG CAGAGCCGAG CCCCAGGGGG TAACCGCTTG 480 

ATGGAGACCA TCCCCATGAT CCCCAGTGCC AGCACCTCCC GGGGGTCGTC CTCACGCCAG 540 

TCCAGCACAG CTGCCAGTTC CAGCTCCAGC GCCAACTCAG GGGCCCGGGC GCCCCCAACA 600 

AGGTCACAGA CCAACTGTTG CTTGCCGGAA GGGACCATGA ACAACGTGTA TGTCAACATG 660 

CCCACGAATT TCTCTGTGCT GAACTGTCAG CAGGCCACCC AGATTGTGCC ACATCAAGGG 720 

CAGTATCTGC ATCCCCCATA CGTGGGGTAC ACGGTGCAGC ACGACTCTGT GCCCATGACA 780 

GCTGTGCCAC CTTTCATGGA CGGCCTGCAG CCTGGCTACA GGCAGATTCA GTCCCCCTTC 840 
CCTCACACCA ACAGTGAACA GAAGATGTAC CCAGCGGTGA CTGTATAA 

Seq ID NOi 437 Protein sequence 
Protein Accession Ih XP_062811 

1 n 21 31 41 51 

I I I I I I 

MWGARRSSVS SSWNAASLLQ LLLAALIiAAG ARASGEYCHG WI*DAQGVWRI GFQCPERFDG 60 

GDATICCGSC ALRYCCSSAE ARLDQGGCDN DRQQGAGEPG RADKDGPDGS AVPIYVPFLI 120 

VGSVFVAFII LGSLVAACCC RCLRPXQDPQ QSRAPGGNRL METIPMIPSA STSRGSSSRQ 180 

SSTAASSSSS ANSGARAPPT RSQTNCCLPE GTMNNVYVNM PTNFSVLNCQ QATQIVPHQG 240 
QYLHPPYVGY TVOHDSVPMT AVPPFMDGI»Q PGYRQIQSPF PHTNSEQKMY PAVTV 

Seq ID NO: 438 DNA sequence 

Nucleic Acid Accession NM__004004.1 

Coding sequence : 1 . . 681 

1 11 21 31 41 51 

I I I I I ! 

ATGGATTGGG GCACGCTGCA GACGATCCTG GGGGGTGTGA ACAAACACTC CACCAGCATT 60 

GGAAAGATCT GGCTCACCGT CCTCTTCATT TTTCGCATTA TGATCCTCGT TGTGGCTGCA 120 

AAGGAGGTGT GGGGAGATGA GCAGGCOGAC TTTGTCTGCA ACACCCTGCA GCCAGGCTGC 180 
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AAGAACGTGT GCTACGATCA CTACTTCCCC ATCTCCCACA TCCGGCTATG GGCCCTGCAG 240 

CTGATCTTCG TGTCCAGCCC AGCGCTCCTA GTGGCCATGC ACGTGGCCTA CCGGAGACAT 300 

GAGAAGAAGA GGAAGTTCAT CAAGGGGGAG ATAAAGAGTG AATTTAAGGA CATCGAGGAG 360 

ATCAAAACCC AGAAGGTCCG CATCGAAGGC TCCCTGTGGT GGACCTACAC AAGCAGCATC 420 

TTCTTCCGGG TCATCTTCGA AGCCGCCTTC ATGTACGTCT TCTATGTCAT GTACGACGGC 480 

TTCTCCATGC AGCGGCTGGT GAAGTGCAAC GCCTGGCCTT GTCCCAACAC TGTGGACTGC 540 

TTTGTGTCCC GGCCCACGGA GAAGACTGTC TTCACAGTGT TCATGATTGC AGTGTCTGGA 600 

ATTTGCATCC TGCTGAATGT CACTGAATTG TGTTATTTGC TAATTAGATA TTGTTCTGGG 660 
AAGTCAAAAA AGCCAGTTTA A 

Seq ID NO: 439 Protein sequence 
Protein Accession #: NP_003 995.1 

1 11 21 31 41 SI 

MDWGTLQTIL GGVNKHST9I GKIWLTVLFI FRIMILWAA KEVWGDEQAD FVCNTLQPGC 60 
KNVCYDHYPP ISHIRLWALQ LIFVSSPALL VAMHVAYRRH EKKRKPIKGE IKSBFKDIEE 120 
IKTQKVRIEG SLWWTYTSSI FPRVIFEAAF MYVFYVMYDG FSMQRLVKCN AWPCPNTVDC 180 
PVSRPTEKTV FTVPMIAVSG ICILLNVTEL CYLLIRYCSG KSKKPV 



Seq ID NO» 440 DNA sequence 

Nucleic Acid Accession «: XM_061091.1 

Coding sequence* 1..2481 

1 U 21 31 41 51 

1 1 I I I I 

ATGCCAAATA CTTCAGGAAC AACCAGGATT GAAATTTGGC TTCTCCAAGA GCCGCCCGGG 60 

CACCGAGCGC TGGTCGCCGC TCTCCTTCCG GTGAGTCCCA GCCCCGAGTT GGCTCTGGCG 120 

CCCGGGTACC CGCCAGTGCC GGCTGCCGAT GACCGATTCA CGCTCCCGAT GATTGGAGGT 180 

CAGATGCATG GTGAGAAGGT AGATCTCTGG AGCCTTGGTG TTCTTTGCTA TGAATTTTTA 240 

GTTGGGAAGC CTCCTTTTGA GGCAAACGAA GTCCATGTAA GCAAAGAAAC CATCGGGAAG 300 

ATTTCAGCTG CCAGCAAAAT GATGTGGTGC TCGGCTGCAG TGGACATCAT GTTTCTGTTA 360 

GATGGGTCTA ACAGCGTCGG GAAAGGGAGC TTTGAAAGGT CCAAGCACTT TGCCATCACA 420 

GTCTGTGACG GTCTGGACAT CAGCCCCGAG AGGGTCAGAG TGGGAGCATT CCAGTTCAGT 480 

TCCACTCCTC ATCTGGAATT CCCCTTGGAT TCATTTTCAA CCCAACAGGA AGTGAAGGCA 540 

AGAATCAAGA GGATGGTTTT CAAAGGAGGG CGCACGGAGA CGGAACTTGC TCTGAAATAC 600 

CTTCTGCACA GAGGGTTGCC TGGAGGCAGA AATGCTTCTG TGCCCCAGAT CCTCATCATC 660 

GTCACTGATG GGAAGTCCCA GGGGGATGTG GCACTGCCAT CCAAGCAGCT GAAGGAAAGG 720 

GGTGTCACTG TGTTTGCTGT GGGGGTCAGG TTTCCCAGGT GGGAGGAGCT GCATGCACTG 780 

GCCAGCGAGC CTAGAGGGCA GCACGTGCTG TTGGCTGAGC AGGTGGAGGA TGCCACCAAC 840 

GGCCTCTTCA GCACCCTCAG CAGCTCGGCC ATCTGCTCCA GCGCCACGCC AGCTGGGAGC 900 

CCCGAGCTTG TCTTCATGGA GCGGTTAATG GGCATCTCTC TGATAGGCCC CTGTGACTCG 960 

CAGCCCTGCC AGAATGGAGG CACATGTGTT CCAGAAGGAC TGGACGGCTA CCAGTGCCTC 1020 

TGCCCGCTGG CCTTTGGAGG GGAGGCTAAC TGTGCCCTGA AGCTGAGCCT GGAATGCAGG 1080 

GTCGACCTCC TCTTCCTGCT GGACAGCTCT GCGGGCACCA CTCTGGACGG CTTCCTGCGG 1140 

GCCAAAGTCT TCGTGAAGCG GTTTGTGCGG GCCGTGCTGA GCGAGGACTC TCGGGCCCGA 1200 

GTGGGTGTGG CCACATACAG CAGGGAGCTG CTGGTGGCGG TGCCTGTGGG GGAGTACCAG 1260 

GATGTGCCTG ACCTGGTCTG GAGCCTCGAT GGCATTCCCT TCCGTGGTGG CCCCACCCTG 1320 

ACGGGCAGTG CCTTGCGGCA GGCGGCAGAG CGTGGCTTCG GGAGCGCCAC CAGGACAGGC 13 BO 

CAGGACCGGC CACGTAGAGT GGTGGTTTTG CTCACTGAGT CACACTCCGA GGATGAGGTT 1440 

GCGGGCCCAG CGCGTCACGC AAGGGCGCGA GAGCTGCTCC TGCTGGGTGT AGGCAGTGAG 1500 

GCCGTGCGGG CAGAGCTGGA GGAGATCACA GGCAGCCCAA AGCATGTGAT GGTCTACTCG 1560 

GATCCTCAGG ATCTGTTCAA CCAAATCCCT GAGCTGCAGG GGAAGCTGTG CAGCCGGCAG 1620 

CGGCCAGGGT GCCGGACACA AGCCCTGGAC CTCGTCTTCA TGTTGGACAC CTCTGCCTCA 1680 

GTAGGGCCCG AGAATTTTGC TCAGATGCAG AGCTTTGTGA GAAGCTGTGC CCTCCAGTTT 1740 

GAGGTGAACC CTGACGTGAC ACAGGTCGGC CTGGTGGTGT ATGGCAGCCA GGTGCAGACT 1800 

GCCTTCGGGC TGGACACCAA ACCCACCCGG GCTGCGATGC TGCGGGCCAT TAGCCAGGCC 1860 

CCCTACCTAG GTGGGGTGGG CTCAGCCGGC ACCGCCCTGC TGCACATCTA TGACAAAGTG 1920 

ATGACCGTCC AGAGGGGTGC CCGGCCTGGT GTCCCCAAAG CTGTGGTGGT GCTCACAGGC 1980 

GGGAGAGGCG CAGAGGATGC AGCCGTTCCT GCCCAGAAGC TGAGGAACAA TGGCATCTCT 2040 

GTCTTGGTCG TGGGCGTGGG GCCTGTCCTA AGTGAGGGTC TGCGGAGGCT TGCAGGTCCC 2100 

CGGGATTCCC TGATCCACGT GGCAGCTTAC GCCGACCTGC GGTACCACCA GGACGTGCTC 2160 

ATTGAGTGGC TGTGTGGAGA AGCCAAGCAG CCAGTCAACC TCTGCAAACC CAGCCCGTGC 2220 

ATGAATGAGG GCAGCTGCGT CCTGCAGAAT GGGAGCTACC GCTGCAAGTG TCGGGATGGC 2280 

TGGGAGGGCC CCCACTGCGA GAACCGTGAG TGGAGCTCTT GCTCTGTATG TGTGAGCCAG 2340 

GGATGGATTC TTGAGACGCC CCTGAGGCAC ATGGCTCCCG TGCAGGAGGG CAGCAGCCGT 2400 

ACCCCTCCCA GCAACTACAG AGAAGGCCTG GGCACTGAAA TGGTGCCTAC CTTCTGGAAT 2460 
GTCTGTGCCC CAGGTCCTTA G 

Seq ID NO: 441 Protein sequence 
Protein Accession #: XP_061091.1 

1 11 21 31 41 51 

1 | 1 I 1 1 

MPNTSGTTRI EIWLLQEPPG HRALVAALLP VSPSPELALA PGYPPVPAAD DRFTLPMIGG 60 

QMHGEKVDLW ' SLGVLCYEFL VGKPPFEANE VHVSKETIGK ISAASKMMWC SAAVD I MFL L 120 

DGSNSVGKGS FERSKHFAIT VCDGLDISPE RVRVGAFQFS STPHIiEPPLD SFSTQQEVKA 180 

RIKRMVFKGG RTETEIALKY LLHRGLPGGH NASVPQILII VTDGKSQGDV ALPSKQLKER 240 

GVTVFAVGVR FPRWEELHAIi ASEPRGQHVL LAEQVEDATN GLFSTLSSSA ICSSATPAGS 300 

PELVFMERLM GISLIGPCDS QPCQNGGTCV PEGLDGYQCL CPLAFGGEAN CALKLSLECR 360 

VDLLFLLDSS AGTTLDGFLR AKVFVKRFVR AVLSEDSRAR VGVATYSREL LVAVPVGEYQ 420 

DVPDLVWSLD GIPFRGGPTL TGSALRQAAE RGFGSATRTG QDRPRRVWL LTESHSEDEV 480 

AGPARHARAR ELLLLGVGSE AVRAELBEIT GSPKHVMVYS DPQDLFNQIP ELQGKLCSRQ 540 

RPGCRTQALD fcVFMLDTSAS VGPENFAQMQ SFVRSCALQF EVNPDVTQVG LWYGSQVQT 600 

AFGLDTKPTR AAMLRAISQA PYLGGVGSAG TALLHIYDKV MTVQRGARPG VPKAWVLTG 660 

GRGAEDAAVP AQKLRNNGIS VLWGVGPVL SEGLRRLAGP RDSLIHVAAY ADLRYHQDVL 720 
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IEWLCGBAKQ PVNLCKPSPC MNEGSCVLQN GSYRCKCRDG WEGPHCENRE WSSCSVCVSQ 
GWILETPLRH MAPVQEGSSR TPPSNYREGL GTEMVPTFWN VCAPGP 

Seq ID NO i 442 DMA sequence 

Nucleic Acid Accession ft: Eos sequence 

Coding sequence: 1..2424 



PCT/US02/12476 



780 
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i 
I 

ATGCCCCCTT 
TCTCTCCCTC 
AGCAAAATGA 
AGCGTCGGGA 
CTGGACATCA 
CTGGAATTCC 
ATGGTTTTCA 
GGGTTGCCTG 
AAGTCCCAGG 
TTTGCTGTGG 
AGAGGGCAGC 
ACCCTCAGCA 
CCCTGTGAGC 
AGAGGATCGC 
AGAGTGTTCC 
TCGCAGCCCT 
CTCTGCCCGC 
AGGGT CGACC 
CGGGCCAAAG 
CGAGTGGGTG 
CAGGATGTGC 
CTGACGGGCA 
GGCCAGGACC 
GTTGCGGGCC 
GAGGCCGTGC 
TCGGATCCTC 
CAGCGGCCAG 
TCAGTAGGGC 
TTTGAGGTGA 
ACTGCCTTCG 
GCCCCCTACC 
GTGATGACCG 
GGCGGGAGAG 
TCTGTCTTGG 
CCCCGGGATT 
CTCATTGAGT 
TGCATGAATG 
GGCTGGGAGG 
CAGGGATGGA 
CGTACCCCTC 
AATGTCTGTG 



11 
I 

TCCTGTTGCT 
TCCAGGAAGT 

AAGGGAGCTT 
GCCCCGAGAG 
CCT TGGATTC 
AAGGAGGGCG 
GAGGCAGAAA 
GGGATGTGGC 
GGGTCAGGTT 
ACGTGCTGTT 
GCTCGGCCAT 
ACAGGACGCT 
GGCGGACCCT 
TAACCCACCC 
GCCAGAATGG 
TGGCCTTTGG 
TCCTCTTCCT 
T CTTCGTGAA 
TGGCCACATA 
CTGACCTGGT 
GTGCCTTGCG 
GGCCACGTAG 
CAGCGCGTCA 
GGGCAGAGCT 
AGGATCTGTT 
GGTGCCGGAC 
CCGAGAATTT 
ACCCTGACGT 
GGCTGGACAC 
TAGGTGGGGT 
TCCAGAGGGG 
GCGCAGAGGA 
TCGTGGGCGT 
CCCTGATCCA 
GGCTGTGTGG 
AGGGCAGCTG 
GCCCCCACTG 
TTCTTGAGAC 
CCAGCAACTA 
CCCCAGGTCC 



21 
I 

GGAGGCCGTC 
CCATGTAAGC 
GGCTGCAGTG 
TGAAAGGTCC 
GGTCAGAGTG 
ATTTTCAACC 
CACGGAGACG 
TGCTTCTGTG 
ACTGCCATCC 
TCCCAGGTGG 
GGCTGAGCAG 
CTGCTCCAGC 
GGAGATGGTC 
TGCGGTGCTG 
TGCCACCTGC 
AGGCACATGT 
AGGGGAGGCT 
GCTGGACAGC 
GCGGTTTGTG 
CAGCAGGGAG 
CTGGAGCCTC 
GCAGGCGGCA 
AGTGGTGGTT 
CGCAAGGGCG 
GGAGGAGATC 
CAACCAAATC 
ACAAGCCCTG 
TGCTCAGATG 
GACACAGGTC 
CAAACCCACC 
GGGCTCAGCC 
TGCCCGGCCT 
TGCAGCCGTT 
GGGGCCTGTC 
CGTGGCAGCT 
AGAAGCCAAG 
CGTCCTGCAG 
CGAGAACCGT 
GCCCCTGAGG 
CAGAGAAGGC 
TTAG 



31 
I 

TGTGTTTTCC 
AAAGAAACCA 
GACATCATGT 
AAGCACTTTG 
GGAGCATTCC 
CAACAGGAAG 
GAACTTGCTC 
CCCCAGATCC 
AAGCAGCTGA 
GAGGAGCTGC 
GTGGAGGATG 
GCCACGCCAG 
CGGGAGTTCG 
GCTGCACACT 
TACAGGACCA 
GTTCCAGAAG 
AACTGTGCCC 
TCTGCGGGCA 
CGGGCCGTGC 
CTGCTGGTGG 
GATGGCATTC 
GAGCGTGGCT 
TTGCTCACTG 
CGAGAGCTGC 
ACAGGCAGCC 
CCTGAGCTGC 
GACCTCGTCT 
CAGAGCTTTG 
GGCCTGGTGG 
CGGGCTGCGA 
GGCACCGCCC 
GGTGTCCCCA 
CCTGCCCAGA 
CTAAGTGAGG 
TACGCCGACC 
CAGCCAGTCA 
AATGGGAGCT 
GAGTGGAGCT 
CACATGGCTC 
CTGGGCACTG 



Seq ID NO: 443 Protein sequence 
Protein Accession ft: Eos sequence 



1 
I 

MPPFLLLEAV 
SVGKGSFERS 
MVFKGGRTET 
PAVGVRPPRW 
PCEHRTLiEMV 
SQPCQNGGTC 
RAKVFVXRFV 
LTGSALRQAA 
EAVRAELEEI 
SVGPENFAQM 
APYLGGVGSA 
SVLWGVGPV 
CMNEGSCVLQ 
RTPPSNYREG 



11 

I 

cvflfsrvpp 
khfai tvcdg 
elalkyli.hr 

EELHALASEP 
REFAGNAPCW 
VPEGLDGYQC 
RAVLSEDSRA 
ERGFGSATRT 
TGSPKHVMVY 
QSFVRSCALQ 
GTALLHIYDK 
LSEGLRRLAG 
NGSYRCKCRD 
LGTEMVPTFW 



21 

I 

SLPLQEVKVS 
LDISPERVRV 
GLPGGRNASV 
RGQHVLLAEQ 
RGSRRTLAVL 
LCPLAFGGEA 
RVGVATYSRE 
GQDRPRRVW 
SDPQDLFNQI 
FEVNPDVTQV 
VMTVQRGARP 
PRDSLIHVAA 
GWEGPHCENR 
NVCAPGP 



31 
I 

KETIGKISAA 
GAFQFSSTPH 
PQILIIVTDG 
VEDATNGLPS 
AAHCPPYSWK 
NCALKLSLEC 
LLVAVPVGEY 
LLTESHSEDE 
PELQGKLCSR 
GLWYGSQVQ 
GVPKAWVLT 
YADLRYHQDV 
EWSSCSVCVS 



41 
I 

TGTTTTCCAG 
TCGGGAAGAT 
TTCTGTTAGA 
CCATCACAGT 
AGTTCAGTTC 
TGAAGGCAAG 
TGAAATACCT 
TCATCATCGT 
AGGAAAGGGG 
ATGCACTGGC 
CCACCAACGG 
ACTGCAGGGT 
CTGGCAATGC 
GTCCCTTCTA 
CCTGCCCAGG 
GACTGGACGG 
TGAAGCTGAG 
CCACTCTGGA 
TGAGCGAGGA 
CGGTGCCTGT 
CCTTCCGTGG 
TCGGGAGCGC 
AGTCACACTC 
TCCTGCTGGG 
CAAAGCATGT 
AGGGGAAGCT 
TCATGTTGGA 
TGAGAAGCTG 
TGTATGGCAG 
TGCTGCGGGC 
TGCTGCACAT 
AAGCTGTGGT 
AGCTGAGGAA 
GTCTGCGGAG 
TGCGGTACCA 
ACCTCTGCAA 
ACCGCTGCAA 
CTTGCTCTGT 
CCGTGCAGGA 
AAATGGTGCC 



41 
I 

SKMMWCSAAV 
LEFPLDSFST 
KSQGDVALP3 
TLSSSAICSS 
RVFLTHPATC 
RVDLLFLLDS 
QDVPDLVWSL 
VAGPARHARA 
QRPGCRTQAL 
TAFGLDTKPT 
GGRGAEDAAV 
LIEWLCGEAK 
QGWILETPLR 



Seq ID NO: 444 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 89.. 2356 



1 

I 

GCCCCCTGGC 
GTCGCCGCTC 
TGTTTTCCTG 
AGAAACCATC 
CATCATGTTT 
GCACTTTGCC 
AGCATTCCAG 
ACAGGAAGTG 
ACTTGCTCTG 
CCAGATCCTC 



11 

I 

CCGAGCCGCG 
TCCTTCCGTT 
TTTTCCAGAG 
GGGAAGATTT 
CTGTTAGATG 
ATCACAGTCT 
TTCAGTTCCA 
AAGGCAAGAA 
AAATACCTTC 
ATCATCGTCA 



21 
I 

CCCGGGTCTG 
ATATCAACAT 
TGCCCCCATC 
CAGCTGCCAG 
GGTCTAACAG 
GTGACGGTCT 
CTCCTCATCT 
TCAAGAGGAT 
TGCACAGAGG 
CTGATGGGAA 



31 
I 

TGAGTAGAGC 
GCCCCCTTTC 
TCTCCCTCTC 
CAAAATGATG 
CGTCGGGAAA 
GGACATCAGC 
GGAATTCCCC 
GGTTTTCAAA 
GTTGCCTGGA 
GTCCCAGGGG 



41 
I 

CGCCCGGGCA 
CTGTTGCTGG 
CAGGAAGTCC 
TGGTGCTCGG 
GGGAGCTTTG 
CCCGAGAGGG 
TTGGATTCAT 
GGAGGGCGCA 
GGCAGAAATG 
GATGTGGCAC 



51 
I 

AGTGCCCCCA 
TTCAGCTGCC 
TGGGTCTAAC 
CTGTGACGGT 
CACTCCTCAT 
AATCAAGAGG 
TCTGCACAGA 
CACTGATGGG 
TGTCACTGTG 
CAGCGAGCCT 
CCTCTTCAGC 
CGAGGCTCAC 
CCCATGCTGG 
CAGCTGGAAG 
CCCCTGTGAC 
CTACCAGTGC 
CCTGGAATGC 
CGGCTTCCTG 
CTCTCGGGCC 
GGGGGAGTAC 
TGGCCCCACC 
CACCAGGACA 
CGAGGATGAG 
TGTAGGCAGT 
GATGGTCTAC 
GTGCAGCCGG 
CACCTCTGCC 
TGCCCTCCAG 
CCAGGTGCAG 
CATTAGCCAG 
CTATGACAAA 
GGTGCTCACA 
CAATGGCATC 
GCTTGCAGGT 
CCAGGACGTG 
ACCCAGCCCG 
GTGTCGGGAT 
ATGTGTGAGC 
GGGCAGCAGC 
TACCTTCTGG 



51 
I 

DIMFLLDGSN 
QQEVKARIKR 
KQLKERGVTV 
ATPDCRVEAH 
YRTTCPGPCD 
SAGTTLDGFL 
DGIPFRGGPT 
RELLLLGVGS 
D LVFMLDT S A 
RAAMLRAISQ 
PAQKLRNNGI 
QPVNLCKPSP 
HMAPVQEGSS 



51 
I 

CCGAGCGCTG 
AAGCCGTCTG 
ATGTAAGCAA 
CTGCAGTGGA 
AAAGGTCCAA 
TCAGAGTGGG 
TTTCAACCCA 
CGGAGACGGA 
CTTCTGTGCC 
TGCCATCCAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
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GCAGCTGAAG GAAAGGGGTG TCACTGTGTT TGCTGTGGGG 
GGAGCTGCAT GCACTGGCCA GCGAGCCTAG AGGGCAGCAC 
GGAGGATGCC ACCAACGGCC TCTTCAGCAC CCTCAGCAGC 
CACGCCAGAC TGCAGGGTCG AGGCTCACCC CTGTGAGCAC 
GGAGTTCGCT GGCAATGCCC CATGCTGGAG AGGATCGCGG 
TGCACACTGT CCCTTCTACA GCTGGAAGAG AGTGTTCCTA 
CAGGACCACC TGCCCAGGCC CCTGTGACTC GCAGCCCTGC 
TCCAGAAGGA CTGGACGGCT ACCAGTGCCT CTGCCCGCTG 
CTGTGCCCTG AAGCTGAGCC TGGAATGCAG GGTCGACCTC 
TGCGGGCACC ACT CTGGACG GCTTCCTGCG GGCCAAAGTC 
GGCCGTGCTG AGCGAGGACT CTCGGGCCCG AGTGGGTGTG 
GCTGGTGGCG GTGCCTGTGG GGGAGTACCA GGATGTGCCT 
TGGCATTCCC TTCCGTGGTG GCCCCACCCT GACGGGCAGT 
GCGTGGCTTC GGGAGCGCCA CCAGGACAGG CCAGGACCGG 
GCTCACTGAG TCACACTCCG AGGATGAGGT TGCGGGCCCA 
AGAGCTGCTC CTGCTGGGTG TAGGCAGTGA GGCCGTGCGG 
AGGCAGCCCA AAGCATGTGA TGGTCTACTC GGATCCTCAG 
TGAGCTGCAG GGGAAGCTGT GCAGCCGGCA GCGGCCAGGG 
CCTCGTCTTC ATGTTGGACA CCTCTGCCTC AGTAGGGCCC 
GAGCTTTGTG AGAAGCTGTG CCCTCCAGTT TGAGGTGAAC 
CCTGGTGGTG TATGGCAGCC AGGTGCAGAC TGCCTTCGGG 
GGCTGCGATG CTGCGGGCCA TTAGCCAGGC CCCCTACCTA 
CACCGCCCTG CTGCACATCT ATGACAAAGT GATGACCGTC 
TGTCCCCAAA GCTGTGGTGG TGCTCACAGG CGGGAGAGGC 
TGCCCAGAAG CTGAGGAACA ATGGCATCTC TGTCTTGGTC 
AAGTGAGGGT CTGCGGAGGC TTGCAGGTCC CCGGGATTCC 
CGCCGACCTG CGGTACCACC AGGACGTGCT CATTGAGTGG 
GCCAGTCAAC CTCTGCAAAC CCAGCCCGTG CATGAATGAG 
TGGGAGCTAC CGCTGCAAGT GTCGGGATGG CTGGGAGGGC 
CTTGAGACGC CCCTGAGGCA CATGGCT CCC GTGCAGGAGG 
AGCAACTACA GAGAAGGCCT GGGCACTGAA ATGGTGCCTA 
CCAGGTCCTT AGAATGTCTG CTTCCCGCCG TGGCCAGGAC 
GAGGATGTCC CAACTGCAGC CATGCTGCTT AGAGACAAGA 
AAACGATGTT GTTGAAAAGT TTTGATGTGT AAGTAAATAC 
GCCTTGTTGA GGCTATGTCA TCTGCCACCT TTCCCTTGAG 
GACTTAAATT TAGCGGCCTG ACGTTCCTTT GCACACAATC 
TGACACAGTA ATGCCCAGCA GAGGCCTTTA CTAGAGCATC 

Seq ID NO: 445 Protein sequence 
Protein Accession #: Eos sequence 



PCT/US02/12476 



GTCAGGTTTC 
GTGCTGTTGG 
TCGGCCATCT 
AGGACGCTGG 
CGGACCCTTG 
ACCCACCCTG 
CAGAATGGAG 
GCCTTTGGAG 
CTCTTCCTGC 
TTCGTGAAGC 
GCCACATACA 
GACCTGGTCT 
GCCTTGCGGC 
CCACGTAGAG 
GCGCGTCACG 
GCAGAGCTGG 
GATCTGTTCA 
TGCCGGACAC 
GAGAATTTTG 
CCTGACGTGA 
CTGGACACCA 
GGTGGGGTGG 
CAGAGGGGTG 
GCAGAGGATG 
GTGGGCGTGG 
CTGATCCACG 
CTGTGTGGAG 
GGCAGCTGCG 
CCCCACTGCG 
GCAGCAGCCG 
CCTTCTGGAA 
CACTATTCTC 
AAGCAGCTGA 
CCACTTTCTG 
GATAAACAAG 
AATGCTCGCC 
CTTTGGACGG 



CCAGGTGGGA 
CTGAGCAGGT 
GCTCCAGCGC 
AGATGGTCCG 
CGGTGCTGGC 
CCACCTGCTA 
GCACATGTGT 
GGGAGGCTAA 
TGGACAGCTC 
GGTTTGTGCG 
GCAGGGAGCT 
GGAGCCTCGA 
AGGCGGCAGA 
TGGTGGTTTT 
CAAGGGCGCG 
AGGAGATCAC 
ACCAAATCCC 
AAGCCCTGGA 
CTCAGATGCA 
CACAGGTCGG 
AACCCACCCG 
GCTCAGCCGG 
CCCGGCCTGG 
CAGCCGTTCC 
GGCCTGTCCT 
TGGCAGCTTA 
AAGCCAAGCA 
TCCTGCAGAA 
AGAACCGATT 
TACCCCTCCC 
TGTCTGTGCC 
ACTGAGGGAG 
TGTCACCCAC 
TACCTGCTGT 
GGGTCCTGAA 
AGAATGTTGT 



MPPFLLLEAV 
SVGKGSFERS 
MVPKGGRTET 
FAVGVRFPRW 
PCEHRTIjEMV 
SQPCQNGGTC 
RAKVPVKRPV 
LTGSALRQAA 
EAVRAELEEI 
SVGPENFAQM 
APYX.GGVGSA 
SVLWGVGPV 
CMNEGSCVLQ 



11 
I 

CVPLPSRVPP 
KHPAITVCDG 
ELALKYLLHR 
EELHALASEP 
REPAGNAPCW 
VPEGLDGYQC 
RAVLSEDSRA 
ERGFGSATRT 
TGSPKHVMVY 
QSFVRSCAIiQ 
GTAIiltH I YDK 
L SEGLRRLAG 
NGSYRCKCRD 



21 
I 

SLPLQEVHVS 
IiDISPERVRV 
GLPGGRNASV 
RGQHVLLAEQ 
RGSRRTLAVL 
LCPLAFGGEA 
RVGVATYSRE 
GQDRPRRWV 
SDPQDLFNQI 
FEVNPDVTQV 
VMTVQRGARP 
PRDSLIHVAA 
GWEGPHCENR 



31 
I 

KETIGKISAA 
GAFQFSSTPH 
PQILIIVTDG 
VEDATNGLFS 
AAHCPFYSWK 
NCALKLSLEC 
LLVAVPVGEY 
LLTESHSEDE 
PELQGK&CSR 
GLWYGSQVQ 
GVPKAVWliT 
YADLRYHQDV 
FLRRP 



41 
I 

SKMMWCSAAV 
LEFPLDSFST 
KSQGDVAXiPS 
TLSSSAICSS 
RVPLTHPATC 
RVDLLFLLDS 
QDVPDLVWSL 
VAGPARHARA 
QRPGCRTQAL 
TAFGLDTKPT 
GGRGAEDAAV 
LIEWLCGEAK 



Seq ID NO: 446 DNA sequence 

Nucleic Acid Accession #: NMJJ31942. 

Coding sequence: 145.. 1260 



1 

I 

CCCGAGCCCC 
TGCTCCTCCT 
CCGATCTGGG 
GTAAAGAAGA 
TCCTCTGATG 
TCAGTTCGGG 
GCGATGAAGT 
CAGCCCTCAG 
AATTTTTTGG 
ATGTCTGAAT 
GACTCACAAT 
CCTGAACGGA 
GCTCTACCCA 
ACCGTGGATG 
GTGACCCTTC 
GTCTGCAGCA 
TGCCGTCAGA 
CGAGGCCAGT 
CTGCTGGATC 
CGGCAGCGAG 
TTTGGGAATG 
TATCTGGAAA 
TTTTTTCACT 
AATCAAGTTA 



11 

I 

GCCCCTCCGG 
GCTGTGGGAC 
CACCCGCCAC 
ACTTAAAGAA 
ACAGTTGTGA 
AAGGCTGTAG 
TTCCAGCGCG 
AGAATTCTGT 
AGAAAAGGGC 
TAGAAAGCTT 
CAAGGAGACC 
GAGCTCGTCC 
TGGAGGAGGA 
GCTACATGAA 
CGCATATAAT 
ATTCTCGAGA 
AGACTATTGA 
TCTGTGGCCC 
CGAACTGGCA 
ATGGACGGTG 
TGCATGCCTA 
ATTTGCTGCC 
GAAACCTGAG 
ATCTTAGCAG 



21 

[ 

GCCCGGGTCG 
CGCTGACCGC 
CAGCATGGAC 
ATTCAGATAT 
CAGCTTTGCT 
GACCCGCAGC 
GAGTACCAGG 
GACTGATTCC 
TTTAAATATA 
CCCTGGCTCG 
GCGAAGGCGT 
TCTTACCAGG 
GGAGGAAGAG 
TGAAGATGAC 
TCGCCCAGTG 
GAAGATATAT 
TACCAAAACA 
CTGCCTTCGA 
TTGCCCGCCT 
TGCGACTGGG 
CTTGAAAAGC 
TGCCTTCTAC 
TTAAAAATCT 
ACATGTGTTT 



31 

I 

GCGCGCCCAG 
GCGGCTGCTC 
GCTCGCCGCG 
GTGAAGTTGA 
TCTGATAATT 
CAGTGCAGGC 
GGAGCAACCA 
AACTCCGATT 
AAGCAAAACA 
TTCCGTGGAA 
ACATTCCCGG 
TCAAGGTCCC 
GATAAGTACA 
CTGCCCAGAA 
GAAGAAATTA 
AACCGTTCAC 
AACTGCAGAA 
AACCGTTATG 
TGTCGAGGAA 
GTCCTTGTGT 
CTGAAACAGG 
TTCTCAAATC 
TGATGATCAG 
CTGGAGCATC 



41 

I 

CCTGCCAGCC 
CGCTCTCCCC 
TGCCGCAGAA 
TTTCCATGGA 
TTGCAAACAC 
ACTCTGGACC 
ACAAAAAAGC 
CAGAAGATGA 
AAGCAATGCT 
GACATCCCCT 
GTGTTGCTTC 
GGATCCTCGG 
TGTTGGTGAG 
GCCGTCGCTC 
CAGAGGAGGA 
TGGGCTCTAC 
ACCCAGACTG 
GTGAAGAGGT 
TCTGCAACTG 
ATTTAGCCAA 
AATTTGAAAT 
TTTCTTGTAA 
CCTGTTTCAT 
ACAGAAGGTA 



51 

I 

GCGCTGCTGC 
GCTCCAAGCG 
AGATCTCAGA 
AACCTCGTCA 
GAGGCTGCAG 
TCTCAGGGTG 
AGAGTCCCGC 
AAGTGGAATG 
TGCAAAACTC 
CCCAGGCTCC 
CAGGAGAAAC 
GTCCCTTGAC 
AAAGAGGAAG 
CAGATCATCC 
GTTGGAGAAC 
TTGTCATCAA 
CTGGGGCGTT 
CAGGGATGCT 
CAGTTTCTGC 
ATATCATGGC 
GCAAGCATAA 
AAGTTTCCAA 
AAGAAACTCC 
TATTGCTAGT 



660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 



51 
I 

DIMFLLDGSN 
QQEVKARIKR 
KQLKERGVTV 
ATPDCRVEAH 
YRTTCPGPCD 
SAGTTLDGFL 
DGIPFRGGPT 
REIiLLLGVGS 
DLVFMU3TSA 
RAAMLRAISQ 
PAQKLRNNGI 
QPVNLCKPSP 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 



352 



WO 02/086443 

TACACTTTGC CCTCCTGCAG TTTCTTCTCT GCTCCCAACC CCCATCTCAT AGCATCCCCC IS 00 

TCTATTTCCA ATGCTCCTCT CCAACCGCTT AGTTTCTGAA TTTCTTTTAA ATTACAGTTT 1560 

TATGAAAGCA TATTTTATTT ACTTGGTGTT GAAATAGCCC TCATAAAACC TAAGCACTTG 1S20 

GAAACACAAT AATAGTATTA ACTAACTAGA TCTATTGAAT TTCAGAGAAG AGCCTTCTAA 1680 

CTTGTTTACA CAAAAACGAG TATGATTTAG CACTCATACT AGTTGAAATT TTTAATAGAA 1740 

TCAAGGCACA AAAGTCTTAA AACCATGTGG AAAAATTAGG TAATTATTGC AGATTGATGT 1800 

CTCTCAATCC CATGTATTGC GCTTATGTTA CAAGTTGTTG TCACAGTTGA GACTTAATTT 1860 

CTCCTAATTT CTTCTGCCCG AAGGGTAAGT GGTGCGTCCA GCTTACACGA TCATAATTCA 1920 

AAGGTTGGTG GGCAATGTAA TACTTAATTA AAATAATGAT GGAAGAGCTA TCTGGAGATT 1980 

ATGAGTAAGC TGATTTGAAT TTTCAGTATA AAACTTTAGT ATAATTGTAG TTTGCAAAGT 2040 

TTATTTCAGT TCACATGTAA GGTATTGCAA ATAAATTCTT GGACAATTTT GTATGGAAAC 2100 

TTGATATTAA AAACTAGTCT GTGGTTCTTT GCAGTTTCTT GTAAATTTAT AAACCAGGCA 2160 

CAAGGTTCAA GTTTAGATTT TAAGCACTTT TATAACAATG ATAAGTGCCT TTTTGG AGA T 2220 

GTAACTTTTA GCAGTTTGTT AACCTGACAT CTCTGCCAGT CTAGTTTCTG GGCAGGTTTC 2280 

CTGTGTCAGT ATTCCCCCTC CTCTTTGCAT TAATCAAGGT ATTTGGTAGA GGTGGAATCT 2340 

AAGTGTTTGT ATGTCCAATT TACTTGCATA TGTAAACCAT TGCTGTGCCA TTCAATGTTT 2400 

GATGCATAAT TGGACCTTGA ATCGATAAGT GTAAATACAG CTTTTGATCT GTAATGCTTT 2460 
TATACAAAAG TTTATTTTAA TAATAAAATG TTTGTTCTAA AAAAAAAAAA 



Seq ID NO j 447 Protein sequence 
Protein Accession #: NP_114148.1 

1 11 21 31 41 51 

I I I 1 I I 

MDARRVPQKD LRVKKNLKKP RYVKLISMET 9S93DDSCDS FASDNFANTR LQ6VREGCRT 60 
RSQCRHSGPL RVAMKFPARS TRGATNKKAE SRQPSENSVT DSNSDSEDES GMNFLEKRAL 120 
NIKQNKAMIjA KLMSELESFP GSFRGRHPLP GSDSQSRRPR RRTFPGVASR RNPERRARPL 180 
TRSRSRILGS UDALPMEEEE EEDKYMLVRK RKTVDGYMNE DDLPRSRRSR SSVTLPHIIR 240 
PVEEITEEEL ENVCSNSREK I YNR SLGSTC HQCRQKTIDT KTNCRNPDCW GVRGQFCGPC 300 
LRNRYGEEVR DAliLDPNWHC PPCRGICNCS FCRQRDGRCA TGVLVYIAKY HGFGNVHAYL 360 
KSLKQEFEMQ A 



Seq ID NO t 448 DNA sequence 
Nucleic Acid Accession #: NM_019894 
Coding sequence: 1..1314 

1 11 21 31 41 51 

| I I I 'I I 

ATGTTACAGG ATCCTGACAG TGATCAACCT CTGAACAGCC TCGATGTCAA ACCCCTGCGC 60 

AAACCCCGTA TCCCCATGGA GACCTTCAGA AAGGTGGGGA TCCCCATCAT CATAGCACTA 120 

CTGAGCCTGG CGAGTATCAT CATTGTGGTT GTCCTCATCA AGGTGATTCT GGATAAATAC 180 

TACTTCCTCT GCGGGCAGCC TCTCCACTTC ATCCCGAGGA AGCAGCTGTG TGACGGAGAG 240 

CTGGACTGTC CCTTGGGGGA GGACGAGGAG CACTGTGTCA AGAGCTTCCC CGAAGGGCCT 300 

GCAGTGGCAG TCCGCCTCTC CAAGGACCGA TCCACACTGC AGGTGCTGGA CTCGGCCACA 360 

GGGAACTGGT TCTCTGCCTG TTTCGACAAC TTCACAGAAG CTCTCGCTGA GACAGCCTGT 420 

AGGCAGATGG GCTACAGCAG CAAACCCACT TTCAGAGCTG TGGAGATTGG CCCAGACCAG 480 

GATCTGGATG TTGTTGAAAT CACAGAAAAC AGCCAGGAGC TTCGCATGCG GAACTCAAGT 540 

GGGCCCTGTC TCTCAGGCTC CCTGGTCTCC CTGCACTGTC TTGCCTGTGG GAAGAGCCTG 600 

AAGACCCCCC GTGTGGTGGG TGGGGAGGAG GCCTCTGTGG ATTCTTGGCC TTGGCAGGTC 660 

AGCATCCAGT ACGACAAACA GCACGTCTGT GGAGGGAGCA TCCTGGACCC CCACTGGGTC 720 

CTCACGGCAG CCCACTGCTT CAGGAAACAT ACCGATGTGT TCAACTGGAA GGTGCGGGCA 780 

GGCTCAGACA AACTGGGCAG CTTCCCATCC CTGGCTGTGG CCAAGATCAT CATCATTGAA 840 

TTCAACCCCA TGTACCCCAA AGACAATGAC ATCGCCCTCA TGAAGCTGCA GTTCCCACTC 900 

ACTTTCTCAG GCACAGTCAG GCCCATCTGT CTGCCCTTCT TTGATGAGGA GCTCACTCCA 960 

GCCACCCCAC TCTGGATCAT TGGATGGGGC TTTACGAAGC AGAATGGAGG GAAGATGTCT 1020 

GACATACTGC TGCAGGCGTC AGTCCAGGTC ATTGACAGCA CACGGTGCAA TGCAGACGAT 1080 

GCGTACCAGG GGGAAGTCAC CGAGAAGATG ATGTGTGCAG GCATCCCGGA AGGGGGTGTG 1140 

GACACCTGCC AGGGTGACAG TGGTGGGCCC CTGATGTACC AATCTGACCA GTGGCATGTG 1200 

GTGGGCATCG TTAGCTGGGG CTATGGCTGC GGGGGCCCGA GCACCCCAGG AGTATACACC 1260 
AAGGTCTCAG CCTATCTCAA CTGGATCTAC AATGTCTGGA AGGCTGAGCT GTAA 

Seq ID NO: 449 Protein sequence 
Protein Accession #» NP_063 947.1 

1 11 21 31 41 51 

I I ] I I I 

MLQDPDSDQP LNSLDVKPLR KPRIPMETFR KVGIPIIIAL fcSLASIIIW VLIKVILDKY 60 

YFLCGQPLHF I PRKQLCDGE LDCPLGEDEE HCVKSFPEGP AVAVRLSKDR STLQVLDSAT 120 

GNWFSACFDN FTEALAETAC RQMGYSSKPT FRAVEIGPDQ DLDWEITEN SQELRMRNSS 180 

GPCLSGSLVS LHCLACGKSL KTPRWGGEE ASVDSWPWQV SIQYDKQHVC GGSILDPHWV 240 

LTAAHCFRKH TDVFNWKVRA GSDKLGSFPS LAVAKIIIIE FNPMYPKDND IALMKLQFPL 300 

TFSGTVRPIC LPFFDEELTP ATPl»WIIGWG FTKQNGGKMS DILLQASVQV IDSTRCNADD 360 

AYQGEVTEKM MCAGIPEGGV DTCQGDSGGP LMYQSDQWHV VGIVSWGYGC GGPSTPGVYT 420 



Seq ID NO: 450 DNA sequence 

Nucleic Acid Accession #: XMJ)51860.2 

Coding sequence » 52 . . 3042 

1 11 21 31 41 51 

1 I I 1 I I 

GCTCACCCAG GAAAAATATG CAATCGTCCC ATTGATATAC AGGCCACTAC AATGGATGGA 60 
GTTAACCTCA GCACCGAGGT TGTCTACAAA AAAGGCCAGG ATTATAGGTT TGCTTGCTAC 120 
GACCGGGGCA GAGCCTGCCG GAGCTACCGT GTACGGTTCC TCTGTGGGAA GCCTGTGAGG 180 
CCCAAACTCA CAGTCACCAT TGACACCAAT GTGAACAGCA CCATTCTGAA CTTGGAGGAT 240 
AATGTACAGT CATGGAAACC TGGAGATACC CTGGTCATTG CCAGTACTGA TTACTCCATG 300 
TACCAGGCAG AAGAGTTCCA GGTGCTTCCC TGCAGATCCT GCGCCCCCAA CCAGGTCAAA 360 



353 



WO 02/086443 

GTGGCAGGGA AACCAATGTA CCTGCACATC GGGGAGGAGA TAGACGGCGT GGACATGCGG 420 
GCGGAGGTTG GGCTTCTGAG CCGGAACATC ATAGTGATGG GGGAGATGGA GGACAAATGC - 480 

TACCCCTACA GAAACCACAT CTGCAATTTC TTTGACTTCG ATACCTTTGG GGGCCACATC 540 

AAGTTTGCTC TGGGATTTAA GGCAGCACAC TTGGAGGGCA CGGAGCTGAA GCATATGGGA 600 

5 CAGCAGCTGG TGGGTCAGTA CCCGATTCAC TTCCACCTGG CCGGTGATGT AGACGAAAGG 660 

GGAGGTTATG ACCCACCCAC ATACATCAGG GACCTCTCCA TCCAT CAT AC ATTCTCTCGC 720 

TGCGTCACAG TCCATGGCTC CAATGGCTTG TTGATCAAGG ACGTTGTGGG CTATAACTCT 780 

TTGGGCCACT GCTTCTTCAC GGAAGATGGG CCGGAGGAAC GCAACACTTT TGACCACTGT 84 0 

CTTGGCCTCC TTGTCAAGTC TGGAACCCTC CTCCCCTCGG ACCGTGACAG CAAGATGTGC 900 

10 AAGATGATCA CAGGAGACTC CTACCCAGGQ TACATCCCCA AGCCCAGGCA AGACTGCAAT 960 

GCTGTGTCCA CCTTCTGGAT GGCCAATCCC AACAACAACC TCAT CAACTG TGCCGCTGCA 1020 

GGATCTGAGG AAACTGGATT TTGGTTTATT TTTCACCACG TACCAAOGGG CCCCTCCGTG 1080 

GGAATGTACT CCCCAGGTTA TTCAGAGCAC ATTCCACTGG GAAAATTCTA TAACAACCGA 1140 

GCACATTCCA ACTACCGGGC TGGCATGATC ATAGACAACG GAGTCAAAAC CACCGAGGCC 1200 

15 TCTGCCAAGG ACAAGCGGCC GTTCCTCTCA ATCATCTCTG CCAGATACAG CCCTCACCAG 1260 

GACGCCGACC CGCTGAAGCC CCGGGAGCCG GCCATCATCA GACACTTCAT TGCCTACAAG 1320 

AACCAGGACC ACGGGGCCTG GCTGCGCGGC GGGGATGTGT GGCTGGACAG CTGCCGGTTT 13 80 

GCTGACAATG GCATTGGCCT GACCCTGGCC AGTGGTGGAA CCTTCCCGTA TGACGACGGC 1440 

TCCAAGCAAG AGATAAAGAA CAGCTTGTTT GTTGGCGAGA GTGGCAACGT GGGGACGGAA 1500 

20 ATGATGGACA ATAGGATCTG GGGCCCTGGC GGCTTGGACC ATAGCGGAAG GACCCTCCCT 1560 

ATAGGCCAGA ATTTTCCAAT TAGAGGAATT CAGTTATATG ATGGCCCCAT CAACATCCAA 1620 

AACTGCACTT TCCGAAAGTT TGTGGCCCTG GAGGGCCGGC ACACCAGCGC CCTGGCCTTC 1680 

CGCCTGAATA ATGCCTGGCA GAGCTGCCCC CATAACAACG TGACCGGCAT TGCCTTTGAG 1740 

GACGTTCCGA TTACTTCCAG AGTGTTCTTC GGAGAGCCTG GGCCCTGGTT CAACCAGCTQ 1800 

25 GACATGGATG GGGATAAGAC ATCTGTGTTC CATGACGTCG ACGGCTCCGT GTCCGAGTAC 1860 

CCTGGCTCCT ACCTCACGAA GAATGACAAC TGGCTGGTCC GGCACCCAGA CTGCATCAAT 1920 

GTTCCCGACT GGAGAGGGGC CATTTGCAGT GGGTGCTATG CACAGATGTA CATTCAAGCC 1980 

TACAAGACCA GTAACCTGCG AATGAAGATC ATCAAGAATG ACTTCCCCAG CCACCCTCTT 2040 

TACCTGGAGG GGGCGCTCAC CAGGAGCACC CATTACCAGC AATACCAACC GGTTGTCACC 2100 

30 CTGCAGAAGG GCTACACCAT CCACTGGGAC CAGACGGCCC CCGCCGAACT CGCCATCTGG 2160 

CTCATCAACT TCAACAAGGG CGACTGGATC CGAGTGGGGC TCTGCTACCC GCGAGGCACC 2220 

ACATTCTCCA TCCTCTCGGA TGTTCACAAT CGCCTGCTGA AGCAAACGTC CAAGACGGGC 2280 

GTCTTCGTGA GGACCTTGCA GATGGACAAA GTGGAGCAGA GCTACCCTGG CAGGAGCCAC 2340 

TACTACTGGG ACGAGGACTC AGGGCTGTTG TTCCTGAAGC TGAAAGCTCA GAACGAGAGA 2400 

35 GAGAAGTTTG CTTTCTGCTC CATGAAAGGC TGTGAGAGGA TAAAGATTAA AGCTCTGATT 2460 

CCAAAGAACG CAGGCGTCAG TGACTGCACA GCCACAGCTT ACCCCAAGTT CACCGAGAGG 2520 

GCTGTCGTAG ACGTGCCGAT GCCCAAGAAG CTCTTTGGTT CTCAGCTGAA AACAAAGGAC 2580 

CATTTCTTGG AGGTOAAGAT GGAGAGTTCC AAGCAGCACT TCTTCCACCT CTGGAACGAC 2640 

TTCGCTTACA TTGAAGTGGA TGGGAAGAAG TACCCCAGTT CGGAGGATGG CATCCAGGTG 2700 

40 GTGGTGATTG ACGGGAACCA AGGGCGCGTG GTGAGCCACA CGAGCTTCAG GAACT CCATT 2760 

CTGCAAGGCA TACCATGGCA GCTTTTCAAC TATGTGGCGA CCATCCCTGA CAATTCCATA 2820 

GTGCTTATGG CATCAAAGGG AAGATACGTC TCCAGAGGCC CATGGACCAG AGTGCTGGAA 2880 

AAGCTTGGGG CAGACAGGGG TCTCAAGTTG AAAGAGCAAA TGGCATTCGT TGGCTTCAAA 2 940 

GGCAGCTTCC GGCCCATCTG GGTGACACTG GACACTGAGG ATCACAAAGC CAAAATCTTC 3 000 

45 CAAGTTGTGC CCATCCCTGT GGTGAAGAAG AAGAAGTTGT GAGGACAGCT GCCGCCCGGT 3 060 

GCCACCTCGT GGTAGACTAT GACGGTGACT CTTGGCAGCA GACCAGTGGG GGATGGCTGG 3120 

GTCCCCCAGC CCCTGCCAGC AGCTGCCTGG GAAGGCCGTG TTTCAGCCCT GATGGGCCAA 3180 

GGGAAGGCTA TCAGAGACCC TGGTGCTGCC ACCTGCCCCT ACTCAAGTGT CTACCTGGAG 3240 

r| _ CCCCTGGGGC GGTGCTGGCC AATGCTGGAA ACATTCACTT TCCTGCAGCC TCTTGGGTGC 3300 

50 TTCTCTCCTA TCTGTGCCTC TTCAGTGGGG GTTTGGGGAC CATATCAGGA GACCTGGGTT 3 360 

GTGCTGACAG CAAAGATCCA CTTTGGCAGG AGCCCTGACC CAGCTAGGAG GTAGTCTGGA 3 420 

GGGCTGGTCA TTCACAGATC CCCATGGTCT TCAGCAGACA AGTGAGGGTG GTAAATGTAG 3 480 

GAGAAAGAGC CTTGGCCTTA AGGAAATCTT TACTCCTGTA AGCAAGAGCC AACCTCACAG 3 540 

GATTAGGAGC TGGGGTAGAA CTGGCTATCC TTGGGGAAGA GGCAAGCCCT GCCTCTGGCC 3600 

55 GTGTCCACCT TTCAGGAGAC TTTGAGTGGC AGGTTTGGAC TTGGACTAGA TGACTCTCAA 3660 

AGGCCCTTTT AGTTCTGAGA TTCCAGAAAT CTGCTGCATT TCACATGGTA CCTGGAACCC 3720 

AACAGTTCAT GGATATCCAC TGATATCCAT GATGCTGGGT GCCCCAGCGC ACACGGGATG 3780 

GAGAGGTGAG AACTAATGCC TAG CTTG AGG GGTCTGCAGT CCAGTAGGGC AGGCAGTCAG 3 840 

GTCCATGTGC ACTGCAATGC CAGGTGGAGA AATCACAGAG AGGTAAAATG GAGGCCAGTG 3900 

60 CCATTTCAGA GGGGAGGCTC AGGAAGGCTT CTTGCTTACA GGAATGAAGG CTGGGGGCAT 3 960 

TTTGCTGGGG GGAGATGAGG CAGCCTCTGG AATGGCTCAG GGATTCAGCC CTCCCTGCCG 4020 

CTGCCTGCTG AAGCTGGTGA CTACGGGGTC GCCCTTTGCT CACGTCTCTC TGGCCCACTC 4080 

ATGATGGAGA AGTGTGGTCA GAGGGGAGCA ATGGGCTTTG CTGCTTATGA GCACAGAGGA 4140 

- _ ATTCAGTCCC CAGGCAGCCC TGCCTCTGAC TCCAAGAGGG TGAAGTCCAC AGAAGTGAGC 4200 

65 TCCTGCCTTA GGGCCTCATT TGCTCTTCAT CCAGGGAACT GAGCACAGGG GGCCTCCAGG 4260 

AGACCCTAGA TGTGCTCGTA CTCCCTCGGC CTGGGATTTC AGAGCTGGAA ATATAGAAAA 4320 

TATCTAGCCC AAAGCCTTCA TTTTAACAGA TGGGGAAAGT GAGCCCCCAA GATGGGAAAG 4380 

AACCACACAG CTAAGGGAGG GCCTGGGGAG CCCCACCCTA GCCCTTGCTG CCACACCACA 4440 

TTGCCTCAAC AACCGGCCCC AGAGTGCCCA GGCACTCCTG AGGTAGCTTC TGGAAATGGG 4500 

70 GACAAGTCCC CTCGAAGGAA AGGAAATGAC TAGAGTAGAA TGACAGCTAG CAGATCTCTT 4560 

CCCTCCTGCT CCCAGCGCAC ACAAACCCGC CCTCCCCTTG GTGTTGGCGG TCCCTGTGGC 4620 

CTTCACTTTG TTCACTACCT GTCAGCCCAG CCTGGGTGCA CAGTAGCTGC AACTCCCCAT 4680 

TGGTGCTACC TGGCTCTCCT GTCTCTGCAG CTCTACAGGT GAGGCCCAGC AGAGGGAGTA 4740 

_ _ GGGCTCGCCA TGTTTCTGGT GAGCCAATTT GGCTGATCTT GGGTGTCTGA ACAGCTATTG 4800 

75 GGTCCACCCC AGTCCCTTTC AGCTGCTGCT TAATGCCCTG CTCTCTCCCT GGCCCACCTT 4860 

ATAGAGAGCC CAAAGAGCTC CTGTAAGAGG GAGAACTCTA TCTGTGGTTT ATAATCTTGC 4920 

ACGAGGCACC AGAGTCTCCC TGGGTCTTGT GATGAACTAC ATTTATCCCC TTTCCTGCCC 4980 

CAACCACAAA CTCTTTCCTT CAAAGAGGGC CTGCCTGGCT CCCTCCACCC AACTGCACCC 5040 

ATGAGACTCG GTCCAAGAGT CCATTCCCCA GGTGGGAGCC AACTGTCAGG GAGGTCTTTC 5100 

oO CCACCAAACA TCTTTCAGCT GCTGGGAGGT GACCATAGGG CTCTGCTTTT AAAGATATGG 5160 

CTGCTTCAAA GGCCAGAGTC ACAGGAAGGA CTTCTTCCAG GGAGATTAGT GGTGATGGAG 5220 

AGGAGAGTTA AAATGACCTC ATGTCCTTCT TGTCCACGGT TTTGTTGAGT TTTCACTCTT 5280 

CTAATGCAAG GGTCTCACAC TGTGAACCAC TTAGGATGTG ATCACTTTCA GGTGGCCAGG 5340 

oc AATGTTGAAT GTCTTTGGCT CAGTTCATTT AAAAAAGATA TCTATTTGAA AGTTCTCAGA 5400 

85 GTTGTACATA TGTTTCACAG TACAGGATCT GTACATAAAA GTTTCTTTCC TAAACCATTC 5460 

ACCAAGAGCC AATATCTAGG CATTTTCTTG GTAGCACAAA TTTTCTTATT GCTTAGAAAA 5520 

TTGTCCTCCT TGTTATTTCT GTTTGTAAGA CTTAAGTGAG TTAGGTCTTT AAGGAAAGCA 5580 



PCT/US02/12476 
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ACGCTCCTCT GAAATGCTTG TCTTTTTTCT GTTGCCGAAA TAGCTGGTCC TTTTTCGGGA 5640 
GTTAGATGTA TAGAGTGTTT GTATGTAAAC ATTTCTTGTA GGCATCACCA TGAACAAAGA S70O 
TATATTTTCT ATTTATTTAT TATATGTGCA CTTCAAGAAG TCACTGTCAG AGAAATAAAG 5760 
AATTGTCTTA AATGTCAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAA 



Seq ID NO: 451 Protein sequence 
Protein Acceooion #t XP_051860.2 

1 11 21 31 41 51 

1 I I I I I 

MDGVNLSTEV VYKKGQDYRF ACYDRGRACR SYRVRFLCGK PVRPKLTVTI DTNVNSTILN 60 

LEDNVQSWKP GDTLVXASTD YSMYQAEEFQ VLPCRSCAPN QVKVAGKPMY LHIGEEIDGV 120 

DMRAEVGLLS RNIIVMGEME DKCYPYRNHI CNPFDPDTFG GHIKFALGFK AAHliEGT ELK 180 

HMGQQLVGQY PIHFHLAGDV DERGGYDPPT YIRDLSIHHT FSRCVTVHGS NGLLIKDWG 240 

YNSLGHCFFT EDGPEERNTF DHCLGLLVKS GTLLPSDRDS KMCKMITGDS YPGYIPKPRQ 300 

DCNAVSTFWM ANPNNNLINC AAAGSEETGF WF I FHHVPTG PSVGMYSPGY SEHIPLGKFY 3 60 

NNRAHSNYRA GMIIDNGVKT TEASAKDKRP FLSIISARYS PHQDADPLKP REPAIIRHFI 420 

AYKNQDHGAW LRGGDVWLDS CRFADNGIGL TLASGGTFPY DDGSKQBIKN SLFVGESGNV 480 

GTEMMDNRIW GPGGLDHSGR TLPIGQNFPI RGIQLYDGPI NIQNCTFRKF VALEGRHT9A 540 

LAFRLNNAWQ SCPHNNVTGI AFEDVPITSR VFFGEPGPWF NQLDMDGDKT SVFHDVDGSV 600 

SEYPGSYZiTK NDNWLVRHPD CINVPDWRGA ICSGCYAQMY IQAYKTSNLR MKIIKNDFPS 660 

HPLYLEGALT R9THYQQYQP WTLQKGYTI HWDQTAPAEI* AIWLINFNKG DWIRVGLCYP 720 

RGTTFSILSD VHNRLLKQTS KTGVFVRTLQ MDKVEQSYPG RSHYYWDEDS GLLFLKLKAQ 780 

NEREKFAFCS MKGCERIKIK ALIPKNAGVS DCTATAYPKF TERAWDVPM PKKI*FGSQI*K 840 

TKDHFLEVKM ESSKQHFFHL WNDFAYIBVD GKKYPSSEDG IQVWIDGNQ GRWSHTSFR 900 

NSILQGIPWQ 1»FNYVAT I PD NSIVLMASKG RYVSRGPWTR VLEKLGADRG LKLKEQMAFV 960 
GFKGSFRPIW VTLDTEDHKA KIFQWPIPV VKKKKL . ... 

Seq ID MO i 452 DNA sequence 

Nucleic Acid Accession ftt Eos sequence 

Coding sequence: 261.. 2861 

1 11 21 31 41 51 

I 1 I 1 1 1 

GAGCTAGCGC TCAAG CAGAG CCCAGCGCGG TGCTATCGGA CAGAGCCTGG CGAGCGCAAG 60 

CGGCGCGGGG AGCCAGCGGG GCTGAGCGCG GCCAGGGTCT GAACCCAGAT TTCCCAGACT 120 

AGCTACCACT CCGCTTGCCC ACGCCCCGGG AGCTCGCGGC GCCTGGCGGT CAGOGACCAG 180 

ACGTCCGGGG CCGCTGCGCT CCTGGCCCGC GAGGCGTGAC ACTGTCTCGG CTACAGACCC 240 

AGAGGGAGCA CACTGCCAGG ATGGGAGCTG CTGGGAGGCA GGACTTCCTC TTCAAGGCCA 300 

TGCTGACCAT CAGCTGGCTC ACTCTGACCT GCTTCCCTGG GGCCACATCC ACAGTGGCTG 360 

CTGGGTGCCC TGACCAGAGC CCTGAGTTGC AACCCTGGAA CCCTGGCCAT GACCAAGACC 420 

ACCATGTGCA TATCGGCCAG GGCAAGACAC TGCTGCTCAC CTCTTCTGCC ACGGTCTATT 480 

CCATCCACAT CTCAGAGGGA GGCAAGCTGG TCATTAAAGA CCACGACGAG CCGATTGTTT 540 

TGCGAACCCG GCACATCCTG ATTGACAACG GAGGAGAGCT GCATGCTGGG AGTGCCCTCT 600 

GCCCTTTCCA GGGCAATTTC ACCATCATTT TGTATGGAAG GGCTGATGAA GGTATTCAGC 660 

CGGATCCTTA CTATGGTCTG AAGTACATTG GGGTTGGTAA AGGAGGCGCT CTTGAGTTGC 720 

ATGGACAGAA AAAGCTCTCC TGGACATTTC TGAACAAGAC CCTTCACCCA GGTGG CATGG 780 

CAGAAGGAGG CTATTTTTTT GAAAGGAGCT GGGGCCACCG TGGAGTTATT GTTCATGTCA 840 

TCGACCCCAA ATCAGGCACA GTCAT CCATT CTGACCGGTT TGACACCTAT AGATCCAAGA 900 

AAGAGAGTGA ACGTCTGGTC CAGTATTTGA ACGCGGTGCC CGATGGCAGG ATCCTTTCTG 960 

TTGCAGTGAA TGATGAAGGT TCTCGAAATC TGGATGACAT GGCCAGGAAG GCGATGACCA 1020 

AATTGGGAAG CAAACACTTC CTGCACCTTG GATTTAGACA CCCTTGGAGT TTTCTAACTG 1080 

TGAAAGGAAA TCCATCATCT TCAGTGGAAG ACCATATTGA ATATCATGGA CATCGAGGCT 1140 

CTGCTGCTGC CCGGGTATTC AAATTGTTCC AGACAGAGCA TGGCGAATAT TTCAATGTTT 1200 

CTTTGTCCAG TGAGTGGGTT CAAGACGTGG AGTGGACGGA GTGGTTCGAT CATGATAAAG 1260 

TATCTCAGAC TAAAGGTGGG GAGAAAATTT CAGACCTCTG GAAAGCTCAC CCAGGAAAAA 1320 

TATGCAATCG TCCCATTGAT ATACAGGCCA CTACAATGGA TGGAGTTAAC CTCAGCACCG 1380 

AGGTTGTCTA CAAAAAAGGC CAGGATTATA GGTTTGCTTG CTACGACCGG GGCAGAGCCT 1440 

GCCGGAGCTA CCGTGTACGG TTCCTCTGTG GGAAGCCTGT GAGGCCCAAA CTCACAGTCA 1500 

CCATTGACAC CAATGTGAAC AGCACCATTC TGAACTTGGA GGATAATGTA CAGTCATGGA 1560 

AACCTGGAGA TACCCTGGTC ATTGCCAGTA CTGATTACTC CATGTACCAG GCAGAAGAGT 1620 

TCCAGGTGCT TCCCTGCAGA TCCTGCGCCC CCAACCAGGT CAAAGTGGCA GGGAAACCAA 1680 

TGTACCTGCA CATCGGGGAG GAGATAGACG GCGTGGACAT GCGGGCGGAG GTTGGGCTTC 1740 

TGAGCCGGAA CATCATAGTG ATGGGGGAGA TGGAGGACAA ATGCTACCCC TACAGAAACC 1800 

ACATCTGCAA TTTCTTTGAC TTCGATACCT TTGGGGGCCA CATCAAGTTT GCTCTGGGAT 1860 

TTAAGGCAGC ACACTTGGAG GGCACGGAGC TGAAGCATAT GGGACAGCAG CTGGTGGGTC 1920 

AGTACCCGAT TCACTTCCAC CTGGCCGGTG ATGTAGACGA AAGGGGAGGT TATGA CCCAC 1980 

CCACATACAT CAGGGACCTC TCCATCCATC ATACATTCTC TCGCTGCGTC ACAGTCCATG 2040 

GCTCCAATGG CTTGTTGATC AAGGACGTTG TGGGCTATAA CTCTTTGGGC CACTGCTTCT 2100 

TCACGGAAGA TGGGCCGGAG GAACGCAACA CTTTTGACCA CTGTCTTGGC CTCCTTGTCA 2160 

AGTCTGGAAC CCTCCTCCCC TCGGACCGTG ACAGCAAGAT GTGCAAGATG ATCACAGAGG 2220 

ACTCCTACCC AGGGTACATC CCCAAGCCCA GGCAAGACTG CAATGCTGTG TCCACCTTCT 2280 

GGATGGCCAA TCCCAACAAC AACCTCATCA ACTGTGCCGC TGCAGGATCT GAGGAAACTG 2340 

GATTTTGGTT TATTTTTCAC CACGTACCAA CGGGCCCCTC CGTGGGAATG TACTCCCCAG 2400 

GTTATTCAGA GCACATTCCA CTGGGAAAAT TCTATAACAA CCGAGCACAT TCCAACTACC 2460 

GGGCTGGCAT GATCATAGAC AACGGAGTCA AAACCACCGA GGCCTCTGCC AAGGACAAGC 2520 

GGCCGTTCCT CTCAATCATC TCTGCCAGAT ACAGCCCTCA CCAGGACGCC GACCCGCTGA 2580 

AGCCCCGGGA GCCGGCCATC ATCAGACACT TCATTGCCTA CAAGAACCAG GACCACGGGG 2640 

CCTGGCTGCG CGGCGGGGAT GTGTGGCTGG ACAGCTGCCA TTTCAGAGGG GAGGCTCAGG 2700 

AAGGCTTCTT GCTTACAGGA ATGAAGGCTG GGGGCATTTT GCTGGGGGGA GATGAGGCAG 2760 

CCTCTGGAAT GGCTCAGGGA TTCAGCCCTC CCTGCCGCTG CCTGCTGAAG CTGGTGACTA 2820 

CGGGGTCGCC CTTTGCTCAC GTCTCTCTGG CCCACTCATG ATGGAGAAGT GTGGTCAGAG 2880 

GGGAGCAATG GGCTTTGCTG CTTATGAGCA CAGAGGAATT CAGTCCCCAG GCAGCCCTGC 2940 

CTCTGACTCC AAGAGGGTGA AGTCCACAGA AGTGAGCTCC TGCCTTAGGG CCTCATTTGC 3000 

TCTTCATCCA GGGAACTGAG CACAGGGGGC CTCCAGGAGA CCCTAGATGT GCTCGTACTC 3060 

CCTCGGCCTG GGATTTCAGA GCTGGAAATA TAGAAAATAT CTAGCCCAAA GCCTTCATTT 3120 
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WO 02/086443 

TAACAGATGG GGAAAGTGAG CCCCCAAGAT GGGAAAGAAC 
TGGGGAGCCC CACCCTAGCC CTTGCTGCCA CACCACATTG 
GTGCCCAGGC ACTCCTGAGG TAGCTTCTGG AAATGGGGAC 
AAATGACTAG AGTAGAATGA CAGCTAGCAG ATCTCTTCCC 
AACCCGCCCT CCCCTTGGTG TTGGCGGTCC CTGTGGCCTT 
AGCCCAGCCT GGGTGCACAG TAGCTGCAAC TCCCCATTGG 
TCTGCAGCTC TACAGGTGAG GCCCAGCAGA GGGAGTAGGG 
CCAATTTGGC TGATCTTGGG TGTCTGAACA GCTATTGGGT 
TGCTGCTTAA TGCCCTGCTC TCTCCCTGGC CCACCTTATA 
TAAGAGGGAG AACT CTATCT GTGGTTTATA ATCTTGCACG 
GTCTTGTGAT GAACTACATT TATCCCCTTT CCTGCCCCAA 
AGAGGGCCTG CCTGGCTCCC TCCACCCAAC TGCACCCATG 
TTCCCCAGGT GGGAGCCAAC TGTCAGGGAG GTCTTTCCCA 
GGGAGGTGAC CATAGGGCTC TGCTTTTAAA GATATGGCTG 
GGAAGGACTT CTTCCAGGGA GATTAGTGGT GATGGAGAGQ 
TC CT TCT T GT CCACGGTTTT GTTGAGTTTT CACTCTTCTA 
GAACCACTTA GGATGTGATC ACTTTCAGGT GGCCAGGAAT 
TTCATTTAAA AAAGATATCT ATTTGAAAGT TCTCAGAGTT 
AGGATCTGTA CATAAAAGTT TCTTTCCTAA ACCATTCACC 
TTTCTTGGTA GCACAAATTT TCTTATTGCT TAGAAAATTG 
TGTAAGACTT AAGTGAGTTA GGTCTTTAAG GAAAGCAACG 
TTTTTCTGTT GCCGAAATAG CTGGTCCTTT TTCGGGAGTT 
TGTAAACATT TCTTGTAGGC ATCACCATGA ACAAAGATAT 
ATGTGCACTT CAAGAAGTCA CTGTCAGAGA AATAAAGAAT 
GAGATGTCCT TTGCATTGCT TGGAAGGGGT GTACCTAGAG 
TTGGAAAAAT TTTGCTGTTA TTATAGTAAA CATACAAAGG 
AAAAAAAAAA AAAAAAAAAA AA 

Seq ID NO: 453 Protein sequence 
Protein Accession #s Eos sequence 



1 
I 

MGAAGRQDFL 
GKTU.LTSSA 
TIILYGRADE 
ERSWGHRGVI 
SRNLDDMARK 
KLFQTEHGEY 
IQATTMDGVN 
STILNLEDNV 
EIDGVDMRAE 
GTELKHMGQQ 
KDWGYNSLG 
PKPRQDCNAV 
liGKPYNNRAH 
IRHFIAYKNQ 
FSPPCRCLLK 



11 
I 

PKAMLTI SWLi 
TVYSIHISEG 
GIQPDPYYGL 
VHVIDPKSGT 
AMTKLGSKHF 
FNVSLSSEWV 
LSTEWYKKG 
QSWKPGDTLV 
VGLLSRNIIV 
LVGQYPIHPH 
HCFPTEDGPE 
STFWMANPNN 
SNYRAGMIID 
DHGAWLRGGD 
LVTTGSPFAH 



21 
I 

TLTCPPGATS 
GKliVIKDHDE 
KYIGVGKGGA 
VIHSDRFDTY 
LHLGFRHPWS 
QDVEWTEWFD 
QDYRPACYDR 
IASTDYSMYQ 
MGEMEDKCYP 
LAGDVDERGG 
ERNTPDHCLG 
NLINCAAAGS 
NGVKTTEASA 
VWLDSCHFRG 
VSLAHS 



31 

I 

TVAAGCPDQS 
PIVLRTRHII* 
LELHGQKKLS 
RSKKESERLV 
FLTVKGNPSS 
HDKVSQTKGG 
GRACRSYRVR 
AEEFQVLPCR 
YRNHICNPFD 
YDPPTYIRDL 
LLVKSGTLLP 
EETGFWPIPH 
KDKRPFIiSII 
EAQEGFLLTG 



CACACAGCTA 
CCTCAACAAC 
AAGTCCCCTC 
TCCTGCTCCC 
CACTTTGTTC 
TGCTACCTGG 
CTCGCCATGT 
CCACCCCAGT 
GAGAGCCCAA 
AGGCACCAGA 
CCACAAACTC 
AGACTCGGTC 
CCAAACATCT 
CTTCAAAGGC 
AGAGTTAAAA 
ATGCAAGGGT 
GTTGAATGTC 
GTACATATGT 
AAGAGCCAAT 
TCCTCCTTGT 
CTCCTCTGAA 
AGATGTATAG 
ATTTTCTATT 
TGTCTTAAAT 
CCAAGGAAAT 
ATGTCAAAAA 



41 

I 

PELQFWNPGH 
IDNGGEIiHAG 
WTFLNKTLHP 
QYLNAVPDGR 
SVEDHIEYHG 
EKISDLWKAH 
FLCGKPVRPK 
SCAPNQVKVA 
FDTFGGHIKF 
SIHHTFSRCV 
SDRDSKMCKM 
HVPTGPSVGM 
SARYSPHQDA 
MKAGGIIiLGG 



AGGGAGGGCC 
CGGCCCCAGA 
GAAGGAAAGG 
AGCGCACACA 
ACTACCTGTC 
CTCTCCTGTC 
TTCTGGTGAG 
CCCTTTCAGC 
AGAGCTCCTG 
GTCTCCCTGG 
TTTCCTTCAA 
CAAGAGTCCA 
TTCAGCTGCT 
CAGAGTCACA 
TGACCTCATG 
CTCACACTGT 
TTTGGCTCAG 
TTCACAGTAC 
ATCTAGGCAT 
TATTTCTGTT 
ATGCTTGTCT 
AGTGTTTGTA 
TATTTATTAT 
GTCATGATTG 
TGGCTCTGGT 
AAAAAAAAAA 



51 
I 

DQDHHVHIGQ 
SALCPFQGNF 
GGMAEGGYFF 
ILSVAVNDEG 
HRGSAAARVF 
PGKICNRPID 
LTVTIDTNVN 
GKPMYLHIGB 
ALGFKAAHLE 
TVHGSNGLLI 
ITEDSYPGYI 
YSPGYSEHIP 
DPLKPREPAI 
DEAASGMAQG 



Seq ID NO: 454 DNA sequence 

Nucleic Acid Accession tt: NMJU3282.2 

Coding sequences 8 5.. 2 4 66 



CGACTCCTTA 
GTCCCTCCCC 
ACCCACACGG 
CAGG AG CTGT 
GAGGACGGCC 
GTCCGCCAGA 
ACCGACTCCG 
GCGGCCGCCG 
GGGCTGTACA 
GAGGCGCAGG 
ACGTCCAGGC 
GAGAACGGCG 
AAGTGGCAGG 
AAGGAGCGGG 
CGGGAACTCT 
TTCGTGGACG 
CCCATGAGAC 
TGCCGGGTCT 
TGCGATGAGT 
CCCAGCGAGG 
GCGGGAGAGC 
TCACAGCGGG 
GTCCCGTCCA 
CGAGTCCAGG 
AGCAACGACG 
GGGAATTTTT 
GCGGAACAGT 
TTTGCTCCCA 
GTCAGGGTGG 
AACCGCTACG 
TTTCTCGTGT 
GAGGGGAAGG 



11 

I 

GAGCATGGCA 
TCAGCGCCGA 
TGGACTCGCT 
TCCACGTGGA 
ATACCCTCTT 
GCCTCGTGCT 
GCTGCTGCCT 
AGACTGACAG 
AGGTCAATGA 
TGGTCAGGGT 
CGGCGCTGGA 
TGGTCCAGAT 
ACCTGGAGGT 
GCTTCTGGTA 
ACGCCAACGT 
AAGTCTTCAA 
GGAAGAGCGG 
GCGCCTGCCA 
GCGACATGGC 
ACGAGTGGTA 
GGCTGAGAGA 
ACTGGGGCAA 
ACCACTACGG 
TCAGCGAGTC 
GAGCGTACTC 
TCACATACAC 
CTTGTGATCA 
TCAATGACCA 
TGCGCAATGT 
ATGGCATCTA 
GGCGCTACCT 
ACCGGATCAA 



21 

I 

TGGCTCAGAG 
CACCATGTGG 
GTCCAGGCTG 
GCCAGGCCTG 
CGACTACGAG 
CCCCCACAGC 
GGGCCAGAGT 
CAGGCCAGCC 
GTACGTCGAT 
GACGCGGAAG 
GGAGGACGTC 
GAACTCCAGG 
GGGCCAGGTG 
CGACGCGGAG 
GGTGCTGGGG 
GATTGAGCGG 
GCCGTCCTGC 
CCTGTGCGGG 
CTTCCACATC 
CTGCCCTGAG 
GAGCAAGAAG 
GGGCATGGCC 
ACCCATCCCG 
GGGTGTCCAT 
CCTAGTCCTG 
GGGTAGTGGT 
GAAACTCACC 
AGAAGGGGCC 
CAAGGGTGGC 
CAAGGTTGTG 
TCTGCGGAGG 
GAAGCTGGGG 



31 
I 

GTGCTGGTAA 
ATCCAGGTTC 
ACCAAGGTGG 
CAGAGGCTGT 
GTCCGCCTGA 
ACCAAGGAGC 
GAGTCAGACA 
GATGAGGACA 
GCTCGGGACA 
GCCCCCTCCC 
ATTTACCACG 
GACGTCCGAG 
GTCATGCTCA 
ATCTCCAGGA 
GATGATTCTC 
CCGGGTGAAG 
AAGCACTGCA 
GGCCGGCAGG 
TACTGCCTGG 
TGCCGGAATG 
AAGGCGAAGA 
TGTGTGGGCC 
GGGATCCCCG 
CGGCCCCACG 
GCGGGGGGCT 
GGTCGAGATC 
AACACCAACA 
GAGGCCAAGG 
AAGAATAGCA 
AAATACTGGC 
GACGATGATG 
CTGACCATGC 



41 
I 

AACTGATGGG 
GGACCATGGA 
AGGAGCTGAG 
TCTACAGGGG 
ATGACACCAT 
GGGACTCCGA 
AGTCCTCCAC 
TGTGGGATGA 
CGAACATGGG 
GGGACGAGCC 
TGAAATACGA 
CGCGCGCCCG 
ACTACAACCC 
AGCGCGAGAC 
TGAACGACTG 
GGAGCCCCAT 
AGGACGACGT 
ACCCCGACAA 
ACCCGCCCCT 
ATGCCAGCGA 
TGGCCTCGGC 
GCACCAAGGA 
TGGGCACCAT 
TGGCTGGCAT 
ATGAGGATGA 
TTTCCGGCAA 
GGGCGCTGGC 
ACTGGCGGTC 
AGTACGCCCC 
CCGAGAAGGG 
AGCCTGGCCC 
AGTATCCAGA 



3190 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
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51 
I 

GGTTTTTGCT 
CGGGAGGCAG 
GCGGAAGATC 
CAAACAGATG 
CCAGCTCCTG 
GCTCTCCGAC 
CCACGGCGAG 
GACGGAATTG 
GGCGTGGTTT 
CTGCAGCTCC 
CGACTACCCG 
CACCATCATC 
CGACAACCCC 
CAGGACGGCG 
TCGGATCATC 
GGTTGACAAC 
GAACAGACTC 
GCAGCTCATG 
CAGCAGTGTT 
GGTGGTAGTG 
CACATCGTCC 
ATGTACCATC 
GTGGCGGTTC 
ACACGGCCGG 
CGTGGACCAT 
CAAGAGGACC 
TCTCAACTGC 
GGGGAAGCCG 
CGCTGAGGGC 
GAAGTCCGGG 
TTGGACGAAG 
AGGCTACCTG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 



60 
120 
180 
240 
300 
360 
420 
460 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
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GAAGCCCTGG CCAACCGAGA GCGAGAGAAG GAGAACAGCA AGAGGGAGGA 
CAGGAGGGGG GCTTQGCGTC CCCCAGGACG GGCAAGGGCA AGTGGAAGCG 
GGAGGTGGCC CGAGCAGGGC CGGGTCCCCG CGCCGGACAT CCAAGAAAAC 
CCCTACAGTC TCACGGCCCA GCAGAGCAGC CTCATCAGAG AGGACAAGAG 
CTGTGGAATG AGGTCCTGGC GTCACTCAAG GACCGGCCGG CGAGCGGCAG 
TTGTTCCTGA GTAAAGTGGA GGAGACGTTC CAGTGTATCT GCTGTCAGGA 
CGGCCCATCA CGAC CGTGTG CCAGCACAAC GTGTGCAAGG ACTGCCTGGA 
CGGGCACAGG TGTTCAGCTG CCCTGCCTGC CGCTACGACC TGGGCCGCAG 
CAGGTGAACC AGCCTCTGCA GACCGTCCTC AACCAGCTCT TCCCCGGCTA 
CGGTGATCTC CAAGCACTTC TCGACAGGCG TTTTGCTGAA AACGTGTCGG 
CATCGGCACT GATTTTGTTC TTAGTGGGCT TAACTTAAAC AGGTAGTGTT 
CCTAAAAAGG TTTGTCTTCC TTTTTTTTTA TTTTTATTTT TCAAATCTAT 
GAATTTATGT ATTCTGGCTA AAAGTTGGAC TTCTCAGTAT TGTGTTTAGT 
CATAAAAGCC TGCAATTTCT CGACAAAACA ACACAAGATT TTTTAAAGAT 
ACTACGTGGT GTGGAGGCTG TTGATGTTTC TGGTGTCAAG TTCTCAGAAG 
CAACTCTTTA AGAAGGCGAC AGGATCAGTC CTTCTCTAGG GTTCTGGCCC 
AGCAAGCATC TTCCTGACAG CATTTTGTCA TCTAAAGTCC AGTGACATGG 
TGGCCCGTGG CAGCCCGTGG CATGGCGTGG CTCAGCTGTC TGTTGAAGTT 
AAAGAGGAAA CATCTCGGGC CTAGTTCAAA CCTTTGCCTC AAAGCCATCC 
TGCTTAGCGT CTGAGATCCG CGTGAAAAGT CCTCTGCCCA CGAGAGCAGG 
CACGCAGAAA TGGCCTCAAG GGGACTCTGC TCCAOGTGGG GCCAGGCGTG 
TGTCCGACGA AGGCGGCCAC GGACGGACGC CAGCACACGA AGTCACGTGC 
GATTCGTTCC TTCTTTCTAA AGACGACAGT CTTTGTTGTT AGCACTGAAT 
GTCAACCAGA TTCTAGAAAC TGCGGTCATC CAGTTCTTCC TGACACCGGA 
GGAACCGTTT GAGCCTTATA GATCATTTAC ATTCAATTTT TTTAACTCAG 
* CTT ACAAGAG GGTTTTTTTT TAATTTTTTT TTCTCTTAAT GAACACATTT 
TTTTTTTTGT AGTTACTGTA TATGTACCAA GAAAGATATA AOGTTAGGGT 
TTGTTTTTGT ATTTTTTTTC TTTTGAAAGG GTTTGTTAAT TTTTCTAATT 
TTGCAGCCTA TACCTCAATA AAACAGGGAT AT TTT AAATC ACATACCTGC 
GAGCAATGTT ATTTTTAAAG GGTTTTTTTC ACCTCCTTAT TCTTAGATTA 
AGGGAAGAAT GAGACAATTT TGTGTAGGCT TTTTCTAAAG TCCAGTACTT 
TTAGATTCTC AGAATAAATG TTTTTCACAG ATTGAAAAAA AAAAAAAA 

seq ID NO: 455 Protein sequence 
Protein Accession &: NP_037414.2 



GGAGGAGCAG 
GAAGTCGGCA 
CAAGGTGGAG 
CAACGCCAAG 
CCCGTTCCAG 
GCTGGTGTTC 
CAGATCCTTT 
CTATGCCATG 
CGGCAATGGC 
AGGGCTCGTT 
TCCTCCGTTC 
ACATTTTCAG 
TCTTTGAAAA 
GGAATCAGAA 
TTGCTGCCAC 
CCAAGGTCAG 
TTCCCCGTGG 
GTTGCAAGGA 
CCCACCAGAC 
GAGTTGGGGC 
TGACTGACGC 
AAGTGCCTTT 
TATTGAAAAT 
TGGGTGCTTG 
CAAGTGAGAA 
TCTAAATGAA 
TTGGTTGTTT 
TTACCAAAGT 
AGACAAACTG 
TTAATGTATT 
TGTCCAGATT 



1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
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MWIQVRTMDG 
YEVRLNDTIQ 
PADEDMWDET 
D V I YHVKYDD 
AEI8RKRETR 
SCKHCKDDVN 
PECRNDASEV 
IPGIPVGTMW 
SGGRDLSGNK 
GGKNSKYAPA 
LGLTMQYPEG 
SPRRTSKKTK 
TFQCICCQEL 



11 
I 

RQTHTVDSItS 
LLVRQSLVLP 
ELGLYKVNEY 
YPENGWQMN 
TARBLYANW 
RliCRVCACHli 
VLAGERLRES 
RPRVQVSESG 
RTAEQSCDQK 
EGNRYDGIYK 
YLEALANRER 
VEPYSLTAQQ 
VFRPITTVCQ 



21 
I 

RLTKVEELRR 
HSTKERDSEL 
VDARDTNMGA 
SRDVRARART 
LGDDSLNDCR 
CGGRQDPDKQ 
KKKAKMASAT 
VHRPHVAGIH 
LTNTNRALAL 
WKYWPEKGK 



SSLIREDKSN 
KNVCKDCLDR 



31 

! 

KIQELFHVEP 
SDTDSGCCLG 
WFEAQWRVT 
IIKWQDLEVG 
IIFVDEVFKI 
LMCDECDMAF 
SSSQRDWGKG 
GRSNDGAYSb 
NCFAPINDQE 
SGFLVWRYliL 
EQQEGGFASP 
AKLWNEVLAS 
SFRAQVFSCP 



41 

I 

GLQRLFYRGK 
QSESDKSSTH 
RKAPSRDEPC 
QWMLNYNPD 
ERPGEGSPMV 
HIYCLDPPLS 
MACVGRTKEC 
VLAGGYEDDV 
GAEAKDWRSG 
RRDDDEPGPW 
RTGKGKWKRK 
LKDRPASGSP 
ACRYDLGRSY 



Seq ID NO: 456 DNA sequence 

Nucleic Acid Accession #t NM_00l200.l 

Coding sequence: 325.. 1514 



GGGGACTTCT 
TGCCCCAGCG 
TGCCCGACAC 
GAGAAGGAGG 
AGAGTTTTTC 
CTGCGGTCTC 
TTCCCCAGGT 
TCGCGGCGGC 
TCGAGTTGCG 
CCGTGGTGCC 
CCGCCCCAGA 
ACCATGAAGA 
TCTTTAATTT 
TCCGAGAACA 
TTTATGAAAT 
ACACCAGGTT 
TGATGCGGTG 
TGGAGGAGAA 
ATGAACACAG 
GGCATCCTCT 
AGTCCAGCTG 
GGATTGTGGC 
TGGCTGATCA 
ACTCTAAGAT 
ACCTTGACGA 
GTGGGTGTCG 



11 
I 

TGAACTTGCA 
GAGCCTGCTT 
TGAGACGCTG 



CATGTGGACG 
CTAAAGGTCG 
CCTCCTGGGC 
GTCGTCGGGC 
GCTGCTCAGC 
CCCCTACATG 
CCACCGGTTG 
ATCTTTGGAA 
AAGTTCTATC 
GATGCAAGAT 
CATAAAACCT 
GGTGAATCAG 
GACTGCACAG 
ACAAGGTGTC 
CTGGTCACAG 
CCACAAAAGA 
TAAGAGACAC 
TCCCCCGGGG 
TCTGAACTCC 
TCCTAAGGCA 
GAATGAAAAG 
CTAGTACAGC 



21 
I 

GGGAGAATAA 
CGCCATCTCC 
TTCCCAGOGT 
AAGGAACGGA 
CTCTTTCAAT 
ACCATGGTGG 
GGCGCGGCTG 
CGCCCCTCAT 
ATGTTCGGCC 
CTAGACCTGT 
GAGAGGGCAG 
GAACTACCAG 
CCCACGGAGG 
GCTTTAGGAA 
GCAACAGCCA 
AATGCAAGCA 
GGACACGCCA 
TCCAAGAGAC 
ATAAGGCCAT 
GAAAAACGTC 
CCTTTGTACG 
TATCACGCCT 
ACTAATCATG 
TGCTGTGTCC 
GTTGTATTAA 
AAAATTAAAT 



31 

I 

CTTGCGCACC 
GAGCCCCACC 
GAAAAGAGAG 
CATTCGGTCC 
GGACGTGTCC 
CCGGGACCCG 
GCCTCGTTCC 
CCCAGCCCTC 
TGAAACAGAG 
ATCGCAGGCA 
CCAGCCGAGC 
AAACGAGTGG 
AGTTTATCAC 
ACAATAG CAG 
ACTCGAAATT 
GGTGGGAAAG 
ACCATGGATT 
ATGTTAGGAT 
TGCTAGTAAC 
AAGCCAAACA 
TGGACTTCAG 
TTTACTGCCA 
CCATTGTTCA 
CGACAGAACT 
AGAACTATCA 
ACATAAATAT 



41 

I 

CCACTTTGCG 
GCCCCTCCAC 
ACTGCGCGGC 
TTGCGCCAGG 
CCGCGTGCTT 
CTGTCTTCTA 
GGAGCTGGGC 
TGACGAGGTC 
ACCCACCCCC 
CTCAGGTCAG 
CAACACTGTG 
GAAAACAACC 
CTCAGCAGAG 
TTTCCATCAC 
CCCCGTGACC 
TTTTGATGTC 
CGTGGTGGAA 
AAGCAGGTCT 
TTTTGGCCAT 
CAAACAGCGG 
TGACGTGGGG 
CGGAGAATGC 
GACGTTGGTC 
CAGTGCTATC 
GGACATGGTT 
ATATATA 
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QMEDGHTLFD 
GEAAAETDSR 
98TSRPALEE 
NPKERGFWYD 
DNPMRRKSGP 
SVPSEDEWYC 
TIVPSNHYGP 
DKGNFFTYTG 
KPVRWRNVK 
TKEGKDRIKK 
SAGGGPSRAG 
FQLFLSKVEE 
AMQVNQPLQT 
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TCCTCGGCCT 
CGGCACCCGG 
TCCTTTGACC 
CTTAGACGGA 
GCGTTGCTGC 
CGCAGGAAGT 
CTGAGCGAGT 
AGCAGGGACG 
CCGGGCTCAC 
CGCAGCTTCC 
CGGAGATTCT 
CTTCAGGTTT 
CGAATTAATA 
AGACTTTTGG 
ACCCCCGCTG 
GTGGCCCACT 
TTGCACCAAG 
GATGGAAAAG 
AAACGCCTTA 
TGGAATGACT 
CCTTTTCCTC 
AACTCTGTTA 
TCGATGCTGT 
GTGGAGGGTT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
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11111 
MVAGTRCLLA LLLPQVLLGG AAGLVPELGR RKFAAASSGR PSSQPSDEVL 
PGLXQRPTPS RDAWPPYML DLYRRHSGQP GSPAPDHRLE RAASRANTVR 
LPETSGKTTR RPPFNLSSIP TEEFITSAEL QVFREQMQDA LGNNSSFHHR 
TANSKPPVTR LLDT 

Seq ID NO: 458 DNA sequence 
Nucleic Acid Accession #: NM_001999.2 
Coding sequence: 1..8736 

1 11 21 31 41 51 

I I I i I I 

ATGGGGAGAA GACGGAGGCT GTGTCTCCAG CTCTACTTCC TGTGGCTGGG CTGTGTGGTG 60 

CTCTGGGCGC AGGGCACGGC CGGCCAGCCT CAGCCTCCTC CGCCCAAGCC GCCCCGGCCC 120 

CAGCCGCCGC CGCAACAGGT TCGGTCCGCT ACAGCAGGCT CTGAAGGCGG GTTTCTAGCG 180 

CCCGAGTATC GCGAGGAGGG TGCCGCAGTG GCCAGCCGCG TCCGCCGGCG AGGACAGCAG 240 

GACGTGCTCC GAGGGCCCAA CGTGTGCGGC TCCAGATTCC ACTCCTACTG CTGCCCTGGA 300 

TGGAAGACGC TCCCTGGAGG AAACCAGTGC ATTGTCCCGA TTTGTAGAAA TAGTTGTGGA 360 

GATGGATTTT GTTCCCGTCC TAACATGTGT ACTTGTTCCA GTGGGCAAAT ATCATCAACC 420 

TGTGGATCAA AATCAATTCA GCAGTGCAGT GTGAGATGCA TGAATGGTGG GACCTGTGCA 480 

GATGACCACT GCCAGTGCCA GAAAGGATAT ATTGGAACTT ATTGTGGACA ACCTGTCTGT 540 

GAAAATGCAT GTCAGAATGG TGGACGTTGC ATCGCCCAAC CGTGTGCTTG TGTTTATGGG 600 

TTCACTGGTC CACAGTGTGA AAGAGATTAC AGGACAGGCC CGTGTTTCAC TCAGGTCAAC 660 

AACCAGATGT GCCAAGGGCA GCTGACAGGC ATTGTCTGCA CGAAGACTCT GTGCTGTGCC 720 

ACCACTGGAC GGGCGTGGGG CCATCCCTGT GAGATGTGTC CAGCCCAGCC TCAGCCCTGC 780 

CGACGGGGTT TCATCCCCAA CATCCGCACT GGAGCTTGCC AAGATGTTGA TGAATGCCAG 840 

GCTATCCCAG GGATATGCCA AGGAGGAAAC TGTATCAATA CAGTGGGCTC TTTTGAATGC 900 

AGATGCCCTG CTGGTCACAA ACAGAGTGAA ACTACTCAGA AATGTGAAGA CATTGATGAG 960 

TGCAGCATCA TTCCTGGGAT ATGTGAAACT GGTGAATGTT CCAACACCGT GGGAAGCTAT 1020 

TTTTGTGTTT GTCCACGTGG ATATGTAACC TCAACAGATG GCTCTCGATG CATCGATCAG 1080 

AGAACAGGCA TGTGTTTCTC GGGCCTGGTG AATGGCCGCT GTGCACAAGA GCTCCCGGGG 1140 

AGAATGACGA AAATGCAGTG CTGCTGTGAG CCTGGCCGCT GCTGGGGCAT CGGAACCATT 1200 

CCTGAAGCCT GTCCTGTCAG AGGTTCTGAG GAATATCGCA GACTTTGCAT GGATGGACTT 1260 

CCAATGGGAG GAATTCCAGG GAGTGCTGGT TCCAGACCTG GAGGCACTGG GGGAAATGGC 1320 

TTTGCCCCAA GTGGCAATGG CAATGGCTAT GGCCCAGGAG GGACAGGCTT CATCCCCATC 13 80 

CCTGGAGGCA ATGGCTTTTC TCCTGGCGTT GGGGGAGCCG GTGTGGGGGC CGGGGGACAG 1440 

GGACCTATCA TCACTGGACT AACAATTCTG AACCAGACAA TAGATATCTG TAAGCATCAT 1500 

GCTAACCTTT GTTTAAATGG ACGCTGTATA CCAACTGTCT CAAGCTACCG ATGTGAATGC 1560 

AACATGGGTT ATAAGCAGGA TGCAAATGGA GATTGTATAG ATGTTGATGA ATGCACATCA 162 0 

AATCCCTGCA CTAATGGAGA TTGTGTTAAC ACACCTGGTT CCTATTATTG TAAATGTCAT 1680 

GCTGGATTCC AGAGGACTCC TACCAAGCAA GCATGCATTG ATATTGATGA GTGCATCCAG 1740 

AATGGGGTTC TTTGTAAAAA CGGTCGATGC GTGAACTCAG ATGGAAGTTT CCAGTGCATT 1800 

TGCAATGCCG GCTTTGAATT AACTACAGAT GGAAAAAACT GTGTTGATCA TGATGAATGT 1860 

ACAACTACCA ACATGTGTTT GAATGGAATG TGCATCAATG AAGATGGCAG CTTCAAGTGC 1920 

ATCTGCAAAC CAGGATTTGT CTTGGCTCCA AATGGGCGTT ACTGTACTGA TGTTGATGAA 1980 

TGCCAGACCC CAGGAATCTG CATGAATGGG CACTGCATCA ACAGTGAAGG GTCCTTCCGC 2040 

TGTGACTGTC CCCCAGGCCT GGCTGTGGGC ATGGATGGAC GTGTGTGTGT TGATACTCAC 2100 

ATGCGCAGTA CCTGCTATGG AGGAATCAAG AAAGGAGTGT GTGTGCGTCC TTTCCCCGGT 2160 

GCAGTGACCA AGTCCGAATG CTGCTGTGCC AATCCAGACT ATGGTTTTGG AGAACCCTGC 2220 

CAGCCATGCC CTGCAAAAAA TTCAGCTGAA TTCCACGGCC TTTGTAGTAG TGGAGTAGGT 2280 

ATCACTGTGG ATGGAAGAGA TATCAATGAA TGTGCTTTGG ATCCTGATAT ATGTGCCAAT 2340 

GGGATTTGTG AAAACTTACG TGGTAGTTAC CGTTGTAATT GCAACAGTGG CTATGAACCA 2400 

GATGCCTCTG GAAGAAACTG TATTGACATT GATGAATGTT TAGTAAACAG ACTGCTTTGT 2460 

GATAACGGAT TGTGCCGAAA CACGCCAGGA AGTTACAGCT GTACGTGCCC ACCAGGGTAT 2520 

GTGTTCAGGA CTGAGACAGA GACCTGTGAA GATATAAATG AATGTGAAAG CAACCCATGT 2580 

GTCAATGGGG CCTGCAGAAA CAACCTTGGA TCTTTCAATT GTGAATGTTC GCCCGGCAGC 2640 

AAACTCAGCT CCACAGGATT GATCTGTATT GACAGCCTGA AGGGGACCTG TTGGCTCAAC 2700 

ATCCAGGACA GCCGCTGTGA GGTGAATATT AATGGAGCCA CTCTGAAATC TGAATGCTGT 2760 

GCCACCCTCG GAGCCGCCTG GGGGAGCCCC TGTGAGCGGT GTGAACTAGA TACAGCTTGC 2B20 

CCAAGAGGGC TTGCCAGGAT TAAAGGTGTT ACGTGTGAAG ATGTTAATGA GTGTGAGGTG 2880 

TTCCCTGGCG TTTGTCCAAA TGGACGCTGT GTCAACAGTA AGGGATCTTT TCATTGCGAG 2940 

TGCCCTGAAG GCCTTACGTT GGATGGGACT GGCCGTGTAT GTTTGGATAT TCGCATGGAG 3 000 

CAGTGTTACT TGAAGTGGGA TGAAGATGAA TGCATCCACC CCGTTCCTGG AAAGTTCCGC 3060 

ATGGATGCCT GCTGCTGTGC TGTCGGGGCG GCTTGGGGCA CCGAGTGTGA GGAGTGCCCC 3120 

AAACCTGGCA CCAAGGAATA CGAGACACTG TGCCCCCGCG GGGCTGGCTT TGCTAACCGA 3180 

GGGGATGTTC TTACTGGGCG GCCATTTTAC AAAGACATCA ATGAATGCAA AGCATTTCCT 32 40 

GGGATGTGCA CTTATGGGAA GTGCAGAAAT ACAATCGGAA GCTTCAAATG CCGTTGCAAT 3300 

AGTGGCTTTG CTCTAGACAT GGAGGAAAGA AACTGCACGG ACATCGACGA GTGCAGGATT 3 360 

TCTCCTGACC TCTGTGGCAG TGGAATCTGC GTCAATACAC CGGGCAGCTT TGAGTGCGAG 3420 

TGCTTCGAAG GCTATGAAAG TGGCTTCATG ATGATGAAGA ACTGCATGGA CATTQACGGA 3480 

TGTGAACGTA ACCCTCTCCT TTGTAGGGGT GGCACCTGTG TGAACACTGA GGGCAGCTTT 3540 

CAGTGTGACT GCCCACTGGG ACACGAGCTG TCACCATCCC GTGAGGACTG TGTGGATATT 3600 

AATGAATGCT CCCTGAGTGA CAATCTCTGC AGAAATGGAA AATGTGTGAA CATGATTGGA 3660 

ACCTATCAGT GCTCTTGCAA TCCTGGATAT CAGGCTACGC CAGACCGCCA GGGCTGTACA 3720 

GATATTGATG AATGTATGAT AATGAACGGA GGCTGTGACA CCCAGTGCAC AAATTCAGAG 3780 

GGAAGCTACG AATGCAGCTG CAGTGAGGGT TATGCCCTGA TGCCAGATGG GAGATCGTGT 3840 

GCAGACATTG ATGAATGTGA AAACAATCCT GATATCTGTG ATGGCGGCCA GTGTACCAAC 3900 

ATTCCTGGAG AGTATCGCTG CCTCTGCTAT GATGGCTTCA TGGCTTCCAT GGACATGAAA 3960 

ACATGCATTG ATGTCAATGA ATGTGACCTA AATTCAAATA TCTGCATGTT TGGGGAATGT 4020 

GAGAACACAA AGGGATCCTT CATTTGCCAC TGTCAGCTGG GTTACTCAGT GAAGAAGGGG 4080 

ACCACAGGAT GTACAGATGT GGATGAGTGT GAAATTGGTG CTCATAACTG CGACATGCAT 4140 

GCCTCATGTC TGAATATCCC AGGAAGCTTC AAGTGTAGCT GCAGAGAAGG CTGGATTGGA 4200 

AACGGCATCA AGTGTATTGA TCTGGACGAA TGTTCTAATG GAACCCACCA GTGTAGCATC 4260 

AATGCTCAGT GTGTAAATAC CCCGGGCTCA TACCGCTGTG CCTGCTCCGA AGGTTTCACT 4320 

GGTGATGGCT TTACCTGCTC AGATGTTGAT GAGTGTGCAG AAAACATAAA CCTCTGTGAG 4380 



51 
I 

SEFELRLLSM 60 
SFHHEESLEE 120 
INIYEIIKPA 180 
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AACGGACAGT GCCTTAATGT CCCGGGTGCA TATCGCTGCG AGTGTGAGAT GGGCTTCACT 4440 

CCAGCCTCAG ACAGCAGATC CTGCCAAGAT ATTGATGAAT GCTCCTTCCA AAACATTTGT 4500 

GTCTCTGGAA CATGTAATAA CCTGCCTGGA ATGTTTCATT GCATCTGCGA TGATGGTTAT 4560 

GAATTGGACA GAACAGGAGG GAACTGTACA GATATTGATG AGTGTGCAGA TCCTATAAAC 4620 

TGTGTCAATG GCCTATGTGT CAACACGCCT GGTCGCTATG AGTGTAACTG CCCACCCGAT 4680 

TTTCAGTTGA ACCCAACTGG TGTGGGTTGT GTTGACAACC GTGTGGGCAA CTGCTACCTG 4740 

AAGTTTGGAC CTCGAGGAGA TGGGAGTCTG TCTTGCAACA CCGAGATCGG GGTGGGCGTC 4800 

AGTCGCTCTT CATGCTGCTG CTCTCTGGGA AAGGCCTGGQ GAAACCCCTG TGAGACATGC 4860 

CCCCCTGTCA ATAGCACTGA ATATTACACC CTGTGTCCCG GAGGTGAAGG CTTCAGACCT 4920 

AACCCCATCA CAATCATTTT AGAAGACATT GACGAATGCC AGGAGTTACC AGGTCTCTGC 4980 

CAGGGTGGAA ACTGCATCAA CACTTTTGGG AGCTTCCAGT GTGAGTGCCC ACAAGGCTAC 5040 

TACCTCAGCG AGGATACCCG CATCTGTGAG GATATTGATG AGTGTTTTGC ACATCCTGGT 5100 

GTGTGTGGGC CTGGGACCTG CTATAACACC CTGGGAAATT ACACCTGCAT TTGCC CACCT 5160 

GAGTACATGC AGGTCAATGG AGGCCACAAC TGCATGGACA TGAGAAAAAG CTTTTGCTAC 5220 

CGAAGCTATA ATGGAACCAC TTGTGAGAAT GAGTTGCCTT TCAATGTGAC AAAAAGGATG 5280 

TGCTGCTGCA CATATAATGT GGGCAAAGCT GGGAACAAAC CTTGTGAACC ATGCCCAACT 5340 

CCAGGAACAG CTGACTTTAA AACCATATGT GGAAATATTC CTGGATTCAC CTTTGACATT 5400 

CACACAGGAA AAGCTGTTGA CATTGATGAA TGTAAAGAGA TTCCAGGCAT TTGTGCAAAT 5460 

GGTGTGTGCA TTAACCAGAT TGGCAGTTTC CG CTGTGAAT GCCCTACAGG ATTCAGTTAC S520 

AATGACCTGC TGTTGGTTTG TGAAGATATA GATGAGTGCA GCAATGGTGA TAATCTCTGC 5580 

CAGCGGAATG CAGACTGCAT CAATAGTCCT GGTAGTTACC GCTGTGAATG TGCCGCGGGT 5640 

TTCAAACTTT CACCCAATGG GGCCTGTGTA GATCGCAATG AATGTTTAGA AATTCCTAAC 5700 

GTTTGCAGTC ATGGCTTGTG TGTTGATCTG CAAGGAAGTT ACCAGTGCAT CTGCCACAAT 5760 

GGCTTTAAGG CTTCTCAGGA CCAGACCATG TGCATGGATG TTGATGAGTG CGAGCGGCAC 5820 

CCATGTGGAA ATGGAACTTG TAAAAACACC. GTTGGATCCT ATAACTGTCT GTGCTACCCA 5880 

GGGTTTGAAC TCACTCATAA TAATGATTGC CTGGACATAG ATGAGTGCAG TTCCTTTTTT 5940 

GGTCAGGTGT GCAGAAATGG ACGTTGTTTT AATGAAATTG GTTCTTTCAA GTGTCTATGT 6000 

AACGAAGGTT ATGAACTTAC CCCAGATGGC AAAAACTGTA TAGACACTAA TGAGTGTGTC 6060 

GCCCTTCCCG GCTCTTGCTC TCCTGGTACC TGTCAGAATT TGGAGGGATC CTTCAGATGC 6120 

ATCTGTCCCC CAGGGTATGA AGTAAAAAGC GAGAACTGCA TTGATATAAA TGAATGTGAT 6180 

GAAGATCCCA ACATTTGTCT TTTTGGTTCC TGTACTAATA CTCCAGGGGG CTTCCAGTGC 6240 

CTCTGCCCCC CTGGCTTTGT ACTATCTGAT AATGGACGGA GATGCTTTGA TACTCGCCAG 6300 

AGCTTCTGCT TCACAAATTT TGAAAATGGA AAGTGTTCTG TACCCAAAGC TTTCAACACC 6360 

ACAAAAGCAA AATGCTGCTG TAGTAAGATG CCAGGAGAGG GCTGGGGGGA CCCCTGTGAG 6420 

CTGTGCCCCA AAGACGATGA AGTTGCATTT CAGGATTTGT GTCCATATGG CCATGGAACT 6480 

GTCCCTAGTC TTCATGATAC ACGTGAAGAT GTCAATGAGT GTCTTGAGAG CCCAGGCATT 6540 

TGTTCAAATG GTCAATGTAT CAACACCGAC GGATCTTTTC GCTGTGAATG TCCAATGGGC 6600 

TACAACCTTG ACTACACTGG AGTACGCTGT GTGGATACTG ATGAGTGTTC AATCGGCAAT 6660 

CCGTGTGGAA ATGGTACATG CACCAATGTT ATTGGGAGTT TTGAATGCAA TTGCAATGAA 6720 

GGCTTTGAGC CAGGGCCCAT GATGAATTGT GAAGATATCA ACGAATGTGC CCAGAACCCA 6780 

CTGCTGTGTG CTTTACGCTG CATGAACACT TTTGGGTCCT ATGAATGCAC GTGCCCGATT 6840 

GGCTATGCCC TCAGGGAAGA TCAAAAGATG TGCAAAGATC TGGATGAATG TGCTGAAGGG 6900 

TTACA CGACT GTGAATCTAG GGGCATGATG TGTAAGAATC TAATCGGCAC CTTCATGTGC 6960 

ATCTGCCCTC CTGGAATGGC CCGAAGGCCC GATGGAGAAG GCTGTGTAGA TGAAAATGAA 7020 

TGCAGGACCA AGCCAGGAAT CTGTGAAAAT GGACGTTGTG TTAACATTAT TGGAAGCTAT 7080 

AGATGTGAGT GTAATGAAGG ATTCCAGTCA AGTTCTTCAG GCACTGAATG CCTTGACAAT 7140 

CGACAGGGTC TCTGCTTTGC AGAGGTACTG CAGACAATAT GTCAAATGGC ATCCAGTAGT 7200 

CGCAATCTCG TCACTAAGTC AGAATGCTGC TGTGATGGTG GGCGAGGCTG GGGCCACCAG 7260 

TGCGAGCTTT GCCCACTTCC TGGAACTGCC CAGTACAAAA AGATATGTCC TCATGGCCCA 7320 

GGATATACAA CTGATGGAAG AGATATTGAT GAATGTAAGG TAATGCCAAA CCTCTGCACC 7380 

AATGGTCAGT GCATCAATAC CATGGGCTCA TTCCGATGCT TCTGCAAGGT TGGCTACACC 7440 

ACAGACATCA GTGGAACCTC TTGTATAGAQ CTTGATGAAT GCTCCCAGTC CCCGAAACCA 7500 

TGCAACTACA TCTGCAAGAA CACTGAGGGG AGTTATCAGT GTTCATGTCC GAG GGGGT AT 7560 

GTCCTGCAAG AGGATGGAAA GACATGCAAA GACCTTGATG AATGTCAAAC AAAGCAGCAT 7620 

AACTGCCAGT TCCTCTGTGT CAACACCCTG GGGGGGTTTA CCTGTAAATG TCCACCTGGT 7680 

TTCACACAGC ATCACACTGC TTGTATCGAC AACAACGAAT GTGGGTCTCA ACCTTTGCTT 7740 

TGTGGAGGAA AGGGAATCTG TCAAAACACT CCAGGCAGTT TCAGCTGTGA ATGCCAAAGA 7800 

GGGTTCTCTC TTGATGCCAC CGGACTGAAC TGTGAAGATG TTGATGAATG TGATGGGAAC 7860 

CACAGGTGCC AACACGGCTG CCAGAACATC CTGGGTGGCT ACAGATGTGG CTGCCCCCAA 7920 

GGCTACATCC AGCACTACCA GTGGAATCAG TGTGTCGATG AGAATGAATG CTCCAATCCC 7980 

AATGCCTGTG GCTCTGCTTC CTGCTACAAC ACCCTGGGGA GTTACAAGTG CGCCTGCCCC 8040 

TCGGGGTTCT CCTTCGACCA GTTCTCCAGT GCCTGCCACG ACGTGAATGA GTGCTCGTCC 8100 

TCCAAGAACC CCTGCAATTA CGGCTGCTCT AACACGGAGG GGGGCTACCT CTGTGGCTGC 8160 

CCCCCTGGGT ATTACAG AG T GGGACAAGGC CACTGTGTCT CAGGAATGGG ATTTAACAAG 8220 

GGG CAGTACC TGTCACTGGA TACAGAGGTC GATGAGGAAA ATGCTCTGTC CCCAGAAGCA 8280 

TGCTACGAGT GCAAAATCAA CGGCTATCCT AAGAAAGACA GCAGGCAGAA GAGAAGTATT 8340 

CATGAACCTG ATCCCACTGC TGTTGAACAG ATCAGCCTAG AGAGTGTCGA CATGGACAGC 8400 

CCCGTCAACA TGAAGTTCAA CCTCTCCCAC CTCGGCTCTA AGGAGCACAT CCTGGAACTA 8460 

AGGCCCGCCA TCCAGCCCCT CAACAACCAC ATCCGTTATG TCATCTCTCA AGGGAACGAT 8520 

GACAGCGTCT TCCGCATCCA CCAAAGGAAT GGGCTCAGCT ACTTGCACAC GGCCAAGAAG 8580 

AAGCTCATGC CCGGCACATA CACACTGGAA ATCACTAGCA TCCCTCTCTA CAAGAAGAAG 8640 

GAGCTTAAGA AACTGGAAGA GAGCAATGAG GATGACTACC TCCTAGGGGA GCTTGGGGAG 8700 

GCTCTCAGAA TGAGGCTGCA GATTCAGCTC TATTAACCGT TCACAGACTT GGGCCCAGGC 8760 

TCAAATCCTA GCACAGCCAG TCTGCAGAAG CATTTGAAAA GTCAAGGACT AATTTTAAAG 8820 

AGGAAAAATA ATAATAACTC TTGTTTCTTT CCTCCCTGTC TTAGACTTTG AATGTTGACC 8880 

CTCACAGGGA GGGATAATTT AGACTCTGGT ATGGCCAAAG ATTTGAGCTC AAAGGCAACC 8940 

GTGGTTACTG TATTTTTTAT ATAACTTCAT TTTAAAATAT ATTAAAAGAA ACCTAAATGT 9000 

TCAAGATATC AGCATATGGC ACTAAATGCA CAAAAATAAT GTGAGCTTTT tTTTTTTTTT 9060 

CCTGTTAGCA GTCTGTAACA CTTTGGGTAT TTTGCTATAG TTGCTAATTA AAAAAATATA 9120 

GATGTTTATT TATTTTTAAT GCAGTAATAT ATGGAGAAAT GAACAAACTA TGTAAACAAA 9180 

AAGGGAAACT CACTTGTTTT TCTTTAGATT TATAAAT TTG AGCTATTTTT TTTAGAGGTG 9240 

CTTTTTAAAA ATCCAATAGA TACAAGAGAT GTTTCCTTTG GTTTTCTGCC AGTCATCCAG 9300 

CTGATACACA CCTGATCGAT TTTAAAGAAA GCCACACAGA GCTGAATCGG GCAGTGCTAA 9360 

TCAATAATTT AAAAGACATG AATGTCATTA GATCCTTTAT AACGTAGATC GAAGCCAAAG 9420 

CAGCTCATTT GTGACAACAT TTCATATCAC CAGACACACC AGGCAACAGA AGTTGAAGCA 9480 

CAACCACTGT AGCAAAATAC CTTGACTGCT TGTGAGACCA TTAGCATTGC AGGCCAAACC 9540 

GTACTGTATT TCCTTCTCAT AACCTCAAGG AACCATATGT GCTACCCACA ACACCTCATT 9600 
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CTTACCCAGQ GTGCGCTGCG TCCTCATGGT 
TTGAAAGGGA ACACCTGGCA TTCTGTGGTG 
ATTATGTTCA AGTTATTTCA GGATTGCCAT 
GGAATATATG TTGTTGTTGT TGTTTTAAAC 
TGTAGTTATA CACCATATGC CTCATTTTAT 
ACAATGAATT GATGTTTAGT TTGCTTTAGT 
TATTAAGAGC ACGTATCCAT TATTCTTCTC 
CCAAACCTCA TATGTGAAAT GGCCAAAGCA 
CTGTGCTGAC CAAAGATTAG TAACCAGTTA 
TTAATAACTA AAAAAAAACT CGTGCC 



ACTGTAGGCA GCTGAAGAAC CGCCGTTCCC 9660 
TTTCGTGCTG TCTTAAATAA TGGTGCATTT 9720 
ATGTGCAAAC AAATCATGCA ATGCAGCCAA 9780 
CCATTTTTTT TTTAGAATTT TCATTAATAC 9840 
CATAGCCTAT TGTGTATGAA AGATGTTTGT 9900 
CATTTAAAAA GATATTGTAC CAGGATGTGC 9960 
AACCCAAGAA CCTGTTTCCT GGACCAGTGA 10020 
CATGCAGGCT CCTGGTTGTT CCTCTCAAAC 10080 
TACCCAGTAT TTTGAGGTTT TATTGTTTTT 10140 



Seq ID NO j 459 Protein sequence 
Protein Accession #: NP_001990.1 



1 ll 21 31 41 51 

I I I I I I 

MGRRRRLCLQ LYFLWLGCW LWAQGTAGQP QPPPPKPPRP QPPPQQVRSA TAGSEGGFLA 60 

PEYREEGAAV ASRVRRRGQQ DVLRGPNVCG SRFHSYCCPG WKTLPGGNQC IVPICRNSCG 120 

DGPCSRPNMC TCSSGQI8ST CGSKSIQQCS VRCMNGGTCA DDHCQCQKGY IGTYCGQPVC 180 

ENGCQNGGRC IAQPCACVYG PTGPQCERDY RTGPCFTQVN NQMCQGQLTG IVCTKTLCCA 240 

TTGRAWGHPC EMCPAQPQPC RRGFIPNIRT GACQDVDECQ AIPGICQGGN CINTVGSFEC 300 

RCPAGHKQSB TTQKCEDIDE CSIIPGICET GECSNTVGSY FCVCPRGYVT STDGSRCIDQ 360 

RTGMCFSGLV NGRCAQELPG RMTKMQCCCE PGRCWGIGTI PEACPVRGSE BYRRLCMDGL 420 

PMGGIPGSAG SRPGGTGGNG FAPSGNGNGY GPGGTGPIPI PGGNGPSPGV GGAGVGAGGQ 480 

GPIITGLTIL NQTIDICKKH ANLCLNGRCI PTVSSYRCEC NMGYKQDANG DCIDVDECTS 540 

NPCTNGDCVN TPGSYYCKCH AGFQRTPTKQ AC1DIDECIQ NGVLCKNGRC VNSDGSFQCI 600 

CNAGPELTTD GKNCVDHD EC TTTNMCLNGM CINEDGSPKC ICKPGFVLAP NGRYCTDVDE 660 

CQTPGICMNG HCINSEGSFR CDCPPGLAVG MDGRVCVDTH MRSTCYGGIK KGVCVRPFPG 720 

AVTKSECCCA NPDYGFGEPC QPCPAKNSAE FHGLCSSGVG ITVDGRDINE CALDPDICAN 780 

GICENLRGSY RCNCNSGYEP DASGRNCIDI DECLVNRLLC DNGLCRNTPG SYSCTCPPGY 840 

VFRTETETCE DINECESNPC VNGACRNNLG SFNCECSPGS KLSSTGIjICI DSLKGTCWLN 900 

IQDSRCEVNI NGATLKSECC ATLGAAWGSP CERCELDTAC PRGLARIKGV TCEDVNECEV 960 

PPGVCPNGRC VNSKGSFHCE CPEGLTLDGT GRVCLDIRME QCYLKWDEDE CIHPVPGKFR 1020 

MDACCCAVGA AWGTECBECP KPGTKEYETli CPRGAGPANR GDVLiTGRPFY KDINECKAFP 1080 

GMCTYGKCRN TIGSFKCRCN SGFALDMEER NCTDIDECRI SPDLCGSGIC VNTPGSFECE 1140 

CPEGYESGPM MMKNCMD I DG CERNPLLCRG GTCVNTEGSF QCDCPLGHEL SPSREDCVDI 1200 

NECSIjSDNLC RNGKCVNMIG TYQCSCNPGY QATPDRQGCT DIDECMIMNG GCDTQCTN9E 1260 

GSYECSCSEG YALMPDGRSC ADIDECENNP DICDGGQCTN IPGEYRCLCY DGFMASMDMK 1320 

TCIDVNECDb NSNICMFGEC ENTKGSFICH CQLGYSVKKG TTGCTDVDEC EIGAHNCDMH 1380 

ASCLNIPGSP KCSCREGWIG NGIKCIDLDE CSNGTHQCS1 NAQCVHTPGS YRCACSEGPT 1440 

GDG FTCSDVD ECAENINLCE NGQCLNVPGA YRCECEMGPT PASDSRSCQD IDECSFQNIC 1500 

VSGTCNNLPG MFHCICDDGY ELDRTGGNCT DIDECADPIN CVNGLCVNTP GRYECNCPPD 1560 

FQLNPTGVGC VDNRVGNCYL KFGPRGDGSL SCNTEIGVGV SRSSCCCSLG KAWGNPCETC 1620 

PPVNSTEYYT LCPGGEGPRP NPITIILEDI DECQELPGLC QGGNCINTFG SFQCECPQGY 1680 

YLSEDTRICE DIDBCFAHPG VCGPGTCYNT LGNYTCICPP EYMQVNGGHN CMDMRKSFCY 1740 

RSYNGTTCEN ELPFNVTKRM CCCTYNVGKA GNKPCEPCPT PGTADFKTIC GNIPGFTFDI 1800 

HTGKAVDIDE CKEIPGICAN GVCINQIGSF RCECPTGFSY NDLLLVCEDI DECSNGDNLC 1B60 

QRNADCINSP GSYRCECAAG FKLSPNGACV DRNECLEIPN VCSHGLCVDL QGSYQCICHN 1920 

GFKASODQTM CMDVDECERH PCGNGTCKNT VGSYNCLCYP GFELTHNNDC LDIDECSSFF 1980 

GQVCRNGRCF NEIGSFKCLC NEGYELTPDG KNCIDTNECV ALPGSCSPGT CQNLEGSFRC 2040 

ICPPGYEVKS ENCIDINECD EDPNICLFGS CTNTPGGFQC LCPPGFVLSD NGRRCPDTRQ 2100 

SFCFTNPENG KCSVPKAFNT TKAKCCCSKM PGEGWGDPCE LCPKDDEVAF QDLCPYGHGT 2160 

VPSLHDTRED VNECLESPGI C3NGQCIMTD GSFRCECPMG YNLDYTGVRC VDTDECSIGN 2220 

PCGNGTCTNV IGSPECNCNE GFEPGPMMNC ED1NECAQNP LLCALRCMNT FGSYECTCPI 22 80 

GYALREDQKM CKDLDECAEG LHDCESRGMM CKNLIGTFMC ICPPGMARRP DGEGCVDENE 2340 

CRTKPGICEN GRCVNIIGSY RCECNEGFQS SSSGTECLDM RQGLCFAEVL QTICQMASSS 2400 

RNIjVTKSECC CDGGRGWGHQ CELCPLPGTA QYKKICPHGP GYTTDGRDID ECKVMPNLCT 2460 

NGQCINTMGS FRCFCKVGYT TDISGTSCID LDECSQSPKP CKYICKNTEG SYQCSCPRGY 2520 

VLQEDGKTCK DLDECQTKQH NCQFLCVNTL GGFTCKCPPG FTQHHTACID NNECGSQPLL 2580 

CGGKGICQNT PGSPSCECQR GFSLDATGLN CEDVDECDGN HRCQHGCQNI LGGYRCGCPQ 2640 

GYIQHYQWNQ CVDENECSNP NACGSASCYN TLGSYKCACP SGPSPDQFSS ACHDVNECSS 2700 

SKNPCNYGCS NTEGGYLCGC PPGYYRVGQG HCVSGMGFNX GQYLSLDTEV DEENALSPEA 2760 

CYECKINGYP KKDSRQKRSI HEPDPTAVEQ ISLESVDMDS PVNMKFNI»SH LGSKEHILEL 2820 

RPAIQPLNNH IRYVISQGND DSVFRIHQRN GLSYUTTAKK KLMPGTYTLE ITSIPLYKKK 2880 
ELKKLEESNE DDYLLGELGE ALRMRLQIQL Y 

Seq ID NO: 460 DNA sequence 

Nucleic Acid Accession ft: NM_0133 72.1 

Coding sequence: 63.. 617 

1 11 21 31 41 51 

I I I I I I 

GCGGCCGCAC TCAGCGCCAC GCGTCGAAAG CGCAGGCCCC GAGGACCCGC CGCACTGACA 60 

GTATGAGCCG CACAGCCTAC ACGGTGGGAG CCCTGCTTCT CCTCTTGGGG ACCCTGCTGC 120 

CGGCTGCTGA AGGGAAAAAG AAAGGGTCCC AAGGTGCCAT CCCCCCGCCA GACAAGGCCC 180 

AGCACAATGA CTCAGAGCAG ACTCAGTCGC CCCAGCAGCC TGGCTCCAGG AACCGGGGGC 240 

GGGGCCAAGG GCGGGGCACT GCCATGCCCG GGGAGGAGGT GCTGGAGTCC AGCCAAGAGG 300 

CCCTGCATGT GACGGAGCGC AAATACCTGA AGCGAGACTG GTGCAAAACC CAGCCGCTTA 360 

AGCAGACCAT CCACGAGGAA GGCTGCAACA GTCGCACCAT CATCAACCGC TTCTGTTACG 420 

GCCAGTGCAA CTCTTTCTAC ATCCCCAGGC ACATCCGGAA GGAGGAAGGT TCCTTTCAGT 480 

CCTGCTCCTT CTGCAAGCCC AAGAAATTCA CTACCATGAT GGTCACACTC AACTG CCCTG 540 

AACTACAGCC ACCTACCAAG AAGAAGAGAG TCACACGTGT GAAGCAGTGT CGTTGCATAT 600 

CCATCGATTT GGATTAAGCC AAATCCAGGT GCACCCAGCA TGTCCTAGGA ATGCAGCCCC 660 

AGGAAGTCCC AGACCTAAAA CAACCAGATT CTTACTTGGC TTAAACCTAG AGGCCAGAAG 720 

AACCCCCAGC TGCCTCCTGG CAGGAGCCTG CTTGTGCGTA GTTCGTGTGC ATGAGTGTGG 780 

ATGGGTGCCT GTGGGTGTTT TTAGACACCA GAGAAAACAC AGTCTCTGCT AGAGAGCACT 840 

CCCTATTTTG TAAACATATC TGCTTTAATG GGGATGTACC AGAAACCCAC CTCACCCCGG 900 
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CTCACATCTA AAGGGGCGGG GOCGTGGTCT GGTTCTGACT TTGTQTTTTT GTGCCCTCCT 960 

GGGGACCAGA ATCTCCTTTC GGAATGAATG TTCATGGAAG AGGCTCCTCT GAGGGCAAGA 1020 

GACCTGTTTT AGTGCTGCAT TCGACATGGA AAAGTCCTTT TAACCTGTGC TTGCATCCTC 1080 

CTTTCCTCCT CCTCCTCACA ATCCATCTCT TCTTAAGTTG ATAGTGACTA TGTCAGTCTA 1140 

ATCTCTTGTT TGCCAAGGTT CCTAAATTAA TTCACTTAAC CATGATGCAA ATGTTTTTCA 1200 

TTTTGTGAAG ACCCTCCAGA CTCTGGGAGA GGCTGGTGTG GGCAAGGACA AGCAGGATAG 1260 

TGGAGTGAGA AAGGGAGGGT GGAGGGTGAG GCCAAATCAG GTCCAGCAAA AGTCAGTAGG 1320 

GACATTGCAG AAGCTTGAAA GGCCAATACC AGAACACAGG CTGATGCTTC TGAGAAAGTC 1380 

TTTTCCTAGT ATTTAACAGA ACCCAAGTGA ACAGAGGAGA AATGAGATTG CCAGAAAGTG 1440 

ATTAACTTTG GCCGTTGCAA TCTGCTCAAA CCTAACACCA AACTGAAAAC ATAAATACTG IS 00 

ACCACTCCTA TGTTCGGACC CAAGCAAGTT AGCTAAACCA AACCAACTCC TCTGCTTTGT 1560 

CCCTCAGGTG GAAAAGAGAG GTAGTTTAGA ACTCTCTGCA TAGGGGTGGG AATTAATCAA 1620 

AAACCKCAGA GGCTGAAATT CCTAATACCT TTCCTTTATC GTGGTTATAG TCAGCTCATT 1680 

TCCATTCCAC TATTTCCCAT AATGCTTCTG AGAGCCACTA ACTTGATTGA TAAAGATCCT 1740 

GCCTCTGCTG AGTGTACCTG ACAGTAAGTC TAAAGATGAR AGAGTTTAGG GACTACTCTG 1800 

TTTTAGCAAG ARATATTKTG GGGGTCTTTT TGTTTTAACT ATTGTCAGGA GATTGGGCTA 1860 

RAGAGAAGAC GACGAGAGTA AGGAAATAAA GGGRATTGCC TCTGGCTAGA GAGTAAGTTA 1920 

GGTGTTAATA CCTGGTAGAA ATGTAAGGGA TATGACCTCC CTTTCTTTAT GTGCTCACTG 1980 

AGGATCTGAG GGGACCCTGT TAGGAGAGCA TAGCATCATG ATGTATTAGC TGTTCATCTG 2040 

CTACTGGTTG GATGGACATA ACTATTGTAA CTATTCAGTA TTTACTGGTA GGCACTGTCC 2100 

TCTGATTAAA CTTGGCCTAC TGGCAATGGC TACTTAGGAT TGATCTAAGG GCCAAAGTGC 2160 

AGGGTGGGTG AACTTTATTG TACTTTGGAT TTGGTTAACC TGTTTTCTTC AAGCCTGAGG 2220 

TTTTATATAC AAACTCCCTG AATACTCTTT TTGCCTTGTA TCTTCTCAGC CTCCTAGCCA 2280 

AGTCCTATGT AATATGGAAA ACAAACACTG CAGACTTGAG ATTCAGTTGC CGATCAAGGC 2340 

TCTGGCATTC AGAGAACCCT TGCAACTCGA GAAGCTGTTT TTATTTCGTT TTTGTTTTGA 2400 

TCCAGTGCTC TCCCATCTAA CAACTAAACA GGAGCCATTT CAAGGCGGGA GATATTTTAA 2460 

ACACCCAAAA TGTTGGGTCT GATTTTCAAA CTTTTAAACT CACTACTGAT GATTCTCACG 2S20 

CTAGGCGAAT TTGTCCAAAC ACATAGTGTG TGTGTTTTGT ATACACTGTA TGACCCCACC 2580 

CCAAATCTTT GTATTGTCCA CATTCTCCAA CAATAAAGCA CAGAGTGGAT TTAATTAAGC 2640 

ACACAAATGC TAAGGCAGAA TTTTGAGGGT GGGAGAGAAG AAAAGGGAAA GAAGCTGAAA 2700 

ATGTAAAACC ACACCAGGGA GGAAAAATGA CATTCAGAAC CAGCAAACAC TGAATTTCTC 2760 

TTGTTGTTTT AACTCTGCCA CAAGAATGCA ATTTCGTTAA TGGAGATGAC TTAAGTTGGC 2820 

AGCAGTAATC TTCTTTTAGG AGCTTGTACC ACAGTCTTGC ACATAAGTGC AGATTTGGCT 2880 

CAAGTAAAGA GAATTTCCTC AACACTAACT TCACTGGGAT AATCAGCAGC GTAACTACCC 2940 

TAAAAGCATA TCACTAGCCA AAGAGGGAAA TATCTGTTCT TCTTACTGTG CCTATATTAA 3000 

GACTAGTACA AATGTGGTGT GTCTTCCAAC TTTCATTGAA AATGCCATAT CTATACCATA 3060 

TTTTATTCGA GTCACTGATG ATGTAATGAT ATATTTTTTC ATTATTATAG TAGAATATTT 3120 

TTATGGCAAG ATATTTGTGG TCTTGATCAT ACCTATTAAA ATAATGCCAA ACACCAAATA 3180 

TGAATTTTAT GATGTACACT TTGTGCTTGG CATTAAAAGA AAAAAACACA CATCCTGGAA 3240 

GTCTGTAAGT TGTTTTTTGT TACTGTAGGT CTTCAAAGTT AAGAGTGTAA GTGAAAAATC 3300 

TGGAGGAGAG GATAATTTCC ACTGTGTGGA ATGTGAATAG TTAAATGAAA AGTTATGGTT 3360 

ATTTAATGTA ATTATTACTT CAAATCCTTT GGTCACTGTG ATTTCAAGCA TGTTTTCTTT 3420 

TTCTCCTTTA TATGACTTTC TCTGAGTTGG GCAAAGAAGA AGCTGACACA CCGTATGTTG 3480 

TTAGAGTCTT TTATCTGGTC AGGGGAAACA AAATCTTGAC CCAGCTGAAC ATGTCTTCCT 3540 

GAGTCAGTGC CTGAAT CTTT ATTTTTTAAA TTGAATGTTC CTTAAAGGTT AACATTTCTA 3600 

AAGCAATATT AAGAAAGACT TTAAATGTTA TTTTGGAAGA CTTACGATGC ATGTATACAA 3660 

ACGAATAGCA GATAATGATG ACTAGTTCAC ACATAAAGTC CTTTTAAGGA GAAAATCTAA 3720 

AATGAAAAGT GGATAAACAG AACATTTATA AGTGATCAGT TAATGCCTAA GAGTGAAAGT 3780 

AGTTCTATTG ACATTCCTCA AGATATTTAA TATCAACTGC ATTATGTATT ATGTCTGCTT 3 840 

AAATCATTTA AAAACGGCAA AGAATTATAT AGACTATGAG GTACCTTGCT GTGTAGGAGG 3 900 

ATGAAAGGGG AGTTGATAGT CTCATAAAAC TAATTTGGCT TCAAGTTTCA TGAAT CTGT A 3960 

ACTAGAATTT AATTTTCACC CCAATAATGT TCTATATAGC CTTTGCTAAA GAGCAACTAA 4020 
TAAATTAAAC CTATTCTTTC AAAAAAAAA 

Seq ID NO: 461 Protein sequence 
Protein Accession #i NP_03 7504.1 

1 11 21 31 41 51 

I I I I I I 

MSRTAYTVGA LLLLLGTLLP AAEGKKKGSQ GAIPPPDKAQ HNDSEQTQSP QQPGSRNRGR 60 
GQGRGTAMPG EEVLESSQEA LHVTERKYLK RDWCKTQPLK QTIHEEGCNS RTIIMRFCYG 120 
QCNSFYIPRH IRKEEGSFQS CSFCKPKKPT TMMVTLNCPE LQPPTKKKRV TRVKQCRCIS 180 
IDLD 

Seq ID no i 462 dna sequence 

Nucleic Acid Accession ft: Eos sequence 

Coding sequence: 1..2733 

1 11 21 31 41 51 

) I I I I I 

ATGAAAGTTG GAGTGCTGTG GCTCATTTCT TTCTTCACCT TCACTGACGG CCACGGTGGC 60 
TTCCTGGGGA AAAATGATGG CATCAAAACA AAAAAAGAAC TCATTGTGAA TAAGAAAAAA 120 
CATCTAGGCC CAGTCGAAGA ATATCAGCTG CTGCTTCAGG TGACCTATAG AGATTCCAAG 180 
GAGAAAAGAG ATTTGAGAAA TTTTCTGAAG CTCTTGAAGC CTCCATTATT ATGGTCACAT 240 
GGGCTAATTA GAATTATCAG AGCAAAGGCT ACCACAGACT GCAACAGCCT GAATGGAGTC 300 
CTGCAGTGTA CCTGTGAAGA CAGCTACACC TGGTTTCCTC CCTCATGCCT TGATCCCCAG 360 
AACTGCTACC TTCACACGGC TGGAGCACTC CCAAGCTGTG AATGTCATCT CAACAACCTC 420 
AGCCAGAGTG TCAATTTCTG TGAGAGAACA AAGATTTGGG GCACTTTCAA AATTAATGAA 480 
AGGTTTACAA ATGACCTTTT GAATTCATCT TCTGCTATAT ACTCCAAATA TGCAAATGGA 540 
ATTGAAATTC AACTTAAAAA AGCATATGAA AGAATTCAAG GTTTTGAGTC GGTTCAGGTC 600 
ACCCAATTTC GAAATGGAAG CATCGTTGCT GGGTATGAAG TTGTTGGCTC CAGCAGTGCA 660 
TCTGAACTGC TGTCAGCCAT TGAACATGTT GCCGAGAAGG CTAAGACAGC CCTTCACAAG 720 
CTGTTTCCAT TAGAAGACGG CTCTTTCAGA GTGTTCGGAA AAGCCCAGTG TAATGACATT 780 
GTCTTTGGAT TTGGGTCCAA GGATGATGAA TATACCCTGC CCTGCAGCAG TGGCTACAGG 840 
GGAAACATCA CAGCCAAGTG TGAGTCCTCT GGGTGGCAGG TCATCAGGGA GACTTGTGTG 900 
CTCTCTCTGC TTGAAGAACT GAACAAGAAT TTCAGTATGA TTGTAGGCAA TGCCACTGAG 960 

GCAGCTGTGT CATCCTTCGT GCAAAATCTT T CTGT CAT CA TTCGGCAAAA CCCATCAACC 1020 
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ACAGTGGGGA ATCTGGCTTC GGTGGTGTCG ATTCTGAGCA ATATTTCATC TCTGTCACTG 1060 

GCCAGCCATT TCAGGGTGTC CAATTCAACA ATGGAGGATG TCAT CAGTAT AGCTGACAAT 1140 

ATCCTTAATT CAGCCTCAGT AACCAACTGG ACAGTCTTAC TGCGGGAAGA AAAGTATGCC 1200 

AGCTCACGGT TACTAGAGAC ATTAGAAAAC ATCAGCACTC TGGTGCCTCC GACAGCTCTT 1260 

CCTCTGAATT TTTCTCGGAA ATTCATTGAC TGGAAAGGGA TTCCAGTGAA CAAAAGCCAA 1320 

CTCAAAAGGG GTTACAGCTA TCAGATTAAA ATGTGTCCCC AAAATACATC TATTCCCATC 1380 

AGAGGCCGTG TGTTAATTGG GTCAGACCAA TTCCAGAGAT CCCTTCCAGA AACTATTATC 1440 

AGCATGGCCT CGTTGACTCT GGGGAACATT CTACCCGTTT CCAAAAATGG AAATGCTCAG 1500 

GTCAATGGAC CTGTGATATC CACGGTTATT CAAAACTATT CCATAAATGA AGTTTTCCTA 1560 

TTTTTTTCCA AGATAGAGTC AAACCTGAGC CAGCCTCATT GTGTGTTTTG GGATTTCAGT 162 0 

CATTTGCAGT GGAACGATGC AGGCTGCCAC CTAGTGAATG AAACTCAAGA CATCGTGACG 1680 

TGCCAATGTA CTCACTTGAC CTCCTTCTCC ATATTGATGT CACCTTTTGT CCCCTCTACA 1740 

ATCTTCCCCG TTGTAAAATG GATCACCTAT GTGGGACTGG GTATCTCCAT TGG AAGT CTC 1800 

ATTTTATGCC TGATCATCGA GGCTTTGTTT TGGAAGCAGA TTAAAAAAAG CCAAACCTCT 1860 

CACACACGTC GTATTTGCAT GGTOAACATA GCCCTGTCCC TCTTGATTGC TGATGTCTGG 1920 

TTTATTGTTG GTGCCACAGT GGACACCACG GTGAACCCTT CTGGAGTCTG CACAGCTGCT 1980 

GTGTTCTTTA CACACTTCTT CTACCTCTCT TTGTTCTTCT GGATGCTCAT GCTTGGCATC 2040 

CTGCTGGCTT ACCGGATCAT CCTCGTGTTC CATCACATGG CCCAGCATTT GATGATGGCT 2100 

GTTGGATTTT GCCTGGG TTA TGGGTGCCCT CTCATTATAT CTGTCATTAC CATTGCTGTC 2160 

ACGCAACCTA GCAATACCTA CAAAAGGAAA GATGTGTGTT GGCTTAACTG GTCCAATGGA 2220 

AGCAAACCAC TCCTGGCTTT TGTTGTCCCT GCACTGGCTA TTGTGGCTGT GAACTTCGTT 2280 

GTGGTGCTGC TAGTTCTCAC AAAGCTCTGG AGGCCGACTG TTGGGGAAAG ACTGAGTCGG 2340 

GATGACAAGG CCACCATCAT CCGCGTGGGG AAGAGCCTCC TCATTCTGAC CCCTCTGCTA 2400 

GGGCTCACCT GGGGCTTTGG AATAGGAACA ATAGTGGACA GCCAGAATCT GGCTTGGCAT 2460 

GTTATTTTTG CTTTACTCAA TGCATTCCAG GGATTTTTTA TCTTATGCTT TGGAATACTC 2520 

TTGGACAGTA AGCTGCGACA ACTTCTGTTC AACAAGTTGT CTGCCTTAAG TTCTTGGAAG 2580 

CAAACAGAAA AGCAAAACTC ATCAGATTTA TCTGCCAAAC CCAAATTCTC AAAGCCTTTC 2640 

AACCCACTGC AAAACAAAGG CCATTATGCA TTTTCTCATA CTGGAGATTC CTCCGACAAC 2700 
ATCATGCTAA CTCAGTTTGT CTCAAATGAA TAA 

Seq ID NOi 463 Protein sequence 
Protein Accession #s Eos sequence 

1 11 21 31 41 51 

I I I I ! I 

MKVGVLWLIS FFTFTDGHGG FLGKNDGIKT KKELIVNKKK HLGPVEEYQL LLQVTYRDSK 60 

EKRDLRNPLK LLKPPLLWSH GLIRIIRAKA TTDCNSLNGV LQCTCEDSYT WPPPSCLDPQ 120 

NCYLHTAGAL PSCECHLNNL SQSVNPCERT KIWGTFKINE RFTNDLLNSS SAIYSKYANG 180 

IBIQIiKKAYE RIQGPESVQV TQFRNGSIVA GYEWGSSSA SELLSAIEHV AEKAKTALHK 240 

LFPLEDGSFR VFGKAQCNDI VFGFGSKDDE YTLPCSSGYR GNITAKCESS GWQVIRETCV 300 

LSLItEELNKN FSMIVGNATE AAVSSFVQNL SVIIRQNPST TVGNliASWS ILSNISSLSL 360 

ASHFRVSNST MEDVISIADN ILNSASVTNW TVLLREEKYA SSRLLETLEN ISTLVPPTAL 420 

PLNFSRKFID WKGIPVNKSQ LKRGYSYQIK MCPQNTSIPI RGRVLIGSDQ FQRSLPBTII 480 

SMASLTLGNI LPVSKNGNAQ VNGPVISTVI QNYSINEVFL FFSKIESNLS QPHCVFWDFS 540 

HLQWNDAGCH IiVNETQDIVT CQCTHLTSFS ILMSPFVPST IFPWKWITY VGLGISIGSL 600 

ILCIjIIEALF WKQIKKSQTS HTRRICMVNI ALSLLIADVW FIVGATVDTT VNPSGVCTAA 660 

VP FTHFFYLS LFFWMLMLGI LLAYRI ILVP HHMAQHLMMA VGFCLGYGCP LIISVITIAV 720 

TQPSNTYKRK DVCWLNWSNG SKPLLAFWP ALAIVAVNFV WLLVLTKLW RPTVGERLSR 780 

DDKATIIRVG KSLLILTPLL GLTWGFGIGT IVDSQNLAWH VIFAIiLNAFQ GFFILCFGIL 840 

LDSKltRQLLF NKLSALSSWK QTEKQNSSDL SAKPKFSKPF NPLQNKGHYA FSHTGDSSDN 900 
IMLTQFVSNE 

Seq ID NOt 464 DNA sequence 

Nucleic Acid Accession #: AB035089.1 

Coding sequence i 9845.. 10219 

1 11 21 31 41 51 

1111-11 

GGGCATGCAG CCATCGGGGA AAATCCATAG TGCAGATAAA GCAAGGAGGA AGAAGAAGGA 60 

CAGTTCTAGT AAAAGGGAGA ACATCAATAT AGGATGTTTC TTAGCAATAG AAAAAGAAGG 120 

CCAAGAGGAA TTAGGGAGAG AGTTATAAGA GATCAGCAAG GGGACAGGGT TAGATTTGGT 180 

TTGGTTTGAA AGCATACAGT AAATATGATG TCTGTCCCTG GCAGTGTTGG CAGAGTAGGA 240 

AGGAGGAAGG GAGGCAAGAG ATAATATCAT TTTCTCTGTG CTCCAACTGT ACTTACATAT 3 00 

GAGACTATTT CCCTCTCTGC TTTTCAAACC TTACTGGAGT TGTTTTCCCT CATGAAAACC 360 

AAGAAAGGAA AGCTAGTTAG TCTTGTTCTG AGGTTGTTCA ATGTATACAT ATCTATATCT 420 

GTAGACAGAA TCCTTGGGAA TACAGTAATT GACATATATT CTGTTATTTG ATGCTTGAAA 480 

AATCTCCTCC ACTAACCAGT TTCCCTATAG ATTGCCACAA GCACA TAATA AGAAACAATA 540 

AATAAAATGT TCTCTTGACT TTGTTACTTA ACAATGCTGA GAAAACTTTA CAGCCTTCAT 600 

AAGGAAGTGA GGTCCAGGAA AATGTAGGAG ATATTTCTTA ACCAATCTAT AAAGGCATTA 660 

GTAATGACAG GATATTTCCT GAAAGTGTAA TTTCCCATTG AGGATTTGTT TTTAATTTCT 720 

GGATTCCTGG AGCCAATGAA GTTGGTGTAT GTTTATGAAA TATCAAGAGA CATAAGTTGG 780 

CAAGTGTTCA TATGCAAAAA CTTCTTGGAA TTTCTGAGTT CTCTGTGGCA ATATATGACA 840 

TCAGGATATG TCCAGTCTCA CACACCAGGA TATGTCCTTT CTAGCCTGTC TATCACATGC 900 

TAGGAGAACT ATTTAGGAAC AGAAAAAAAT GCCTGAAATG ATTTCTCATT TGAACTCATC 960 

CAAGCTTTCT CTAAATTTAA GCAAACTCCT GGTCATTTTC AGTTAGTACC TTTCCTTAAG 1020 

TTCAACCTTC AGGGCAAACC TCCGTGCCTC AGACGTTTAG CCATAGTCTG AAATTCTCTT 1080 

CCATAGATTG GTCCCCTGTA ACCCCGGTTT GTCTCAGCTT GTTATCCTGT TTTTTTCTTC 1140 

CCTCCATTCC CAGGATGAGC TTGTTGCTTC TGTCCTATGA GACATTAGAT TCCTTTTCTT 1200 

TGGTACCCGA GTAAATCCAT CCTACTCCAA TAGAGGAAGG TCCATTTTTG TCTTATAGCG 1260 

CTGGATGCAG ACTCAGCTGA GAAGACCATT ATTCATTTTT GGAATTCTTT AT CTCAG ATA 1320 

TTTCCTCTTC TTTCTTTTTC TTCTATCTTT GGATTTTTAG TCCATCAACG CCCCATTAGT 1380 

CTATTCCCCG ACTTCAATCA GGGAACTTAT ACCTCTTAAA CTCATTCAGA GACTCAAAAC 1440 

ATATATATTG ATACAGGAGA CCTAAGAAGA GCATGTCTTG GGGGTTGAGG AAACAGGCAG 1500 

GTGAGAAATT TCCAGATTGG AAACACAGCT TCCTTTCTCC CATCCAGCCC CTACTTTCAG 1560 

CCTATGTGTT TCTGGCACCT TGTTGTAGAT AAATCTCCCT TGACTTTGTG ATGTGCTGAG 1620 

AAAACAAACT CACGGCTGGT GTTAAAAAGG GCCCATGACA ATACCAAGTG TTGGGGAGAA 1680 

TGTGGAGAAA TCAGAACTCT ATTCACGGT C GGTTGGAATG CACACTTGTG CAGAATTCTA 1740 
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TGGAGAAGAG TCTGGCATTT CCTCAAAATG TTAACCTGGA TTTACCATAT GACCCAGCGA 1800 

TTTCATTCAT AGGTTTATAC TCAAAAGAAA TGAAGAAATA TGCCATGCAA AAAAATGTAC 1860 

ATGAAAGGTC ACAACATCAT TATTCATAAT AGTAAAAGGA TGGAAACAAC ACAAATGTCC 1920 

ATCAACTTAT GATTAAAGAA AATCTGGTCT ATT CAT AG AA TGGAATATTA TTCGACCACA 1980 

AAAAGGAATG ATGTACTGAT CCATGCAATG ATGTGGACAA ACCATGAAAA TAACACTAGA 2040 

TTAAAGAAGC CAGTCACAAA AGGACTTACT GTATGATTCC ATTTACCTGA AATGTTTGGA 2100 

ATAGGCAAAT CCATAGAAAC AGGAGGTAGA TTCCTGGTTT CCAGGGTCTC CAGGAAGGGA 2160 

AGAATGAAGT ACAAGATTTC TTTTGGAGGT AGTGAAATTG TTGTGGAATG AGATCATGAT 2220 

GATGATAGCA CAACTTTGTG AATATAATAA AATCATTGAA TTGTACAGTT GAATTTATGG 2280 

TATATAAATT ATATGTTAAT AAAAAGGGGG TCCACAAAAC AAACAGCCCC CCACTCTGGT 2340 

TGTCAGGGAG ATATTGGATT AAATGGCCTT GGACAACAAC CCCTCTCCCT GGCCACAGAC 2400 

ATTCTTCAGA TTACAAGATA TTCCAGGGGA AACACTGGAA TGAGTCTGAA GCCAGGTGCT 2460 

AAACAGAAGG ACCATTGAGA AATGTTGTGA TCCTGACAGG TCAAGCAATT TATTTTTCGG 2520 

CTTCATTTTT AAATGTAAAA TTAGAAAGCT GCCATTTAAA ATGGCCCGTC TGTTTCAATT 2580 

GCTCTTCTCA GTGTCAGCCT GTTAACTCAA TGTGTTAGTC TGTTTTCATG CTGCTGATAA 2640 

AAACATACCT GAGACTGGCA AGAAAAAGAG GTTTAATTGG GCTTAGAGTT CCACGTGATT 2700 

GGGGAGGCCT CAGAATCACA GTAGGAGGCA AAAGTTATTC TTACATGGTG GCTGCAAGAG 2760 

AAGATGAGGA AGAAGCAAAA GAAGAAACCC CTGATAAACC CATCGGATCT CCTGAGGCTT 2820 

ATTAACTATC ATGAGAATAG CACAAGAAAG ACCGGCCCCC ATGATTCAAT TACCTCTACC 2880 

TGGGTCCCTC CAATAACATG TGGAAATTCT GGTAGATACA ATTCAAGTTG AGATTTGGGT 2940 

GGGAACACAG CCAAACCATA TCACTCAGCA AGGCAGATAA CTTTCTCACT GAGCCTATGC 3000 

AACAGAAAAC CATCTGGGAT GGTTGTAAGG GGCACAGGAA GTGACTGGTA GGATCACTGC 3060 

CAAAGCTGAG CACTCAGGAG AAGGCAATAG AATCCTATTC TCCATAGTAT GCTATAAGAT 3120 

ACTGAAGTAC ACTTCTTCAC TATCTCTTTG GACTTAGAAT TAGCACTACA TTCCTTGTTA 3180 

TACAGAAAAA TTACTAAGGA AATTCATAGG ATGACAAAAA CTTTCAGAAC TGAAAAACAG 3240 

GAAATGTAAG CTTTTTAGTT CTTTGGTATT CGAAGTATGC CTAAAAGACA ATGCAAAATC 3300 

CAAGAAAAGA ATGGTGGGGT TTTTGTTTGT TTGGTTTTGT TTTTGTTTTA CAGCTGGAGT 3360 

AGAATACAAA GGGATGGAGT TGAAACAAAT GAGAGGAAAT TGGAATTCTA AACTTATTCT 3420 

CATTGGCATT AGAAAGGCAC CTACATGTAT TTCACATGAG CCGGTGACTG CTGACTTGCA 3480 

TTCTTATTTT TTCCCTATAG ATTAAAAAGG AGGTACAATG GTAGAACTGT AATCCTGTCC 3540 

TTTGTCATAA ATTTTCATAT TCATAAAGGT GAGTGTTAGC CCGCTTGTGA AATCTGAAGT 3600 

TGAGTAACTT CAAATACTAA CCACAGAGGG AAAGGCAGCA AGAGGAGAGG CATAAATTTA 3660 

GGATCTCACC CTTCATTCCA CAGACACACA CAGCCTCTCT GCCCACCTCT GCTTCCTCTA 3720 

GGAACACAGG TAAQAGCTTC AAGCCTCTCC AGCTTAATAA CATGAATTAT TTTTGAGAAT 3780 

AATAATGATA CTGTGTTCTA TATCATGCAT CTCCTGCATT CTGTCTGATT ATATTTTACT 3840 

TATTCTGCCA GAGCAAAATT AAAATACCTA TTTCATCTGA TTTGTCCTTT ATCTAAATTG 3900 

CTTAGTTCCA AGTAAACCAA GGCACTTTTA GGAACACAGA GGGAGAGTGC CTTGCAGCCA 3 960 

GAGAGTCTTG AAGGAGATGT CAGGGACGCA TCTTAACAGC TGGTTGGATG TGATCCACAG 4020 

AGGTCTCCTG TTAGCATTCA TTGTAAAGCC ATCCTACCTA GCTCTAGTGT AACCAGCAAT 4080 

GAAAGAAAGA TAAAGAGGGT CGATTACTTA TTTACAATAG TCTTTAAAAA CGTAGTTTTG 4140 

TAAGCCTTCT AATTAGGACA TTAATATATT TAATATATGC ACATTGTAGA AAGATTGAAG 4200 

CGTTAAAAAT AAGAGAAAAA CTTTAAATGT CAAAATCTCA CAACCCAGAT ATATCATTTC 4260 

TTTAAGAAAA TTGTACTACA AAATACCATT CCATTTATTA AAGTCATTCT GACAGGAATC 4320 

TGATGCTTTT CCAGGAGTTC CAGATCACAT CGAGTTCACC ATGAATTCAC TCAGTGAAGC 4380 

CAACACCAAG TTCATGTTCG ATCTGTTCCA ACAGTTCAGA AAATCAAAAG AGAACAACAT 4440 

CTTCTATTCC CCTATCAGCA TCACATCAGC ATTAGGGATG GTCCTCTTAG GAGCCAAAGA 4500 

CAACACTGCA CAACAAATTA GCAAGGTAGC TATCAGCATC ATTACGTTGT CCTGTTGCAG 4560 

TTTTTCTCTG GTTCCGTCGG CTAGCACGCA GATGGTAATA GATGTGGTGG TCTGATGGGT 4620 

AGCACAGGGG GCTGTGCAGG AATTCCCATA ACTGTGAGAC CACTGACTTA AACAGATCTT 4680 

TTGAGTAAAG TTTTCTTGTC CCGCTTCATG TCTCTTCCAG GTTCTTCACT TTGATCAAGT 4740 

CACAGAGAAC ACCACAGAAA AAGCTGCAAC AT AT CATGTG AGTCACAGAG CACTCTGATT 4800 

CAGCTTTAGA TCCCTGAACA GGTCATAGTT TAAACCTGGA ACTTCACAAA AACTAAGAAA 4860 

AGGCCAGTTT TAGGGAAAAT CTTGGACACA AAGATTGAGA CATACAGAGT GGGTTGGCAT 4920 

TTCATGGCAC ATAATTATTA TTCCTCATTT CTGCGTTACT AAAAGACAGT CAGCACTGTA 4980 

CCTCAGAGCA TAGGTCTGGA TCAGGATAGG CTGGGTTCAG ACTCCAGCTT TGCTCTTCAC 5040 

AAATGATGAA TAAGAGCAGG ACACAACTGC TCGGAGTCCC AGTGACCTCA TCCCAGAAAA 5100 

CTAAGGGTAA GAAAAAATCT GACTCAATAC ATGCAAATAC ATGCAAATGT TTACAACAGT 5160 

GCCTTGCCCA TAAAAGTCAT AATAAATGTT ATTATTATTA TAAAGTAGCT ATAATTATAC 5220 

TAATCATAAT AATGTGAAAA TAATTTAATT TTCATTGAGT CATTAATGAG ATTCAGAGGA 5280 

ATAAGCACAA GTCCAAGTAT ATTTTGGAAA ATGATTGCTA TGGAATATAT TGGTTTAGAG 5340 

CCTTAATAGT GCAAAATGCT TTGCTGGAAG GTAGAAAGTT CTAGATTTAA ACAGGCTTAG 5400 

GTTCAAAACT TGGCACTTCT AATTTATGTC TCTATAAACA GGGTTTTTTT CCCCATTCTC 5460 

TGAGCTTTCT TGTGTTCATC TGAATTGAAC TAAAGACTTA GAGTTACCCA TGTAAAGTCC 5520 

TTAGCCATGG ACCTGGCATA CACTCTTCTT ACGTGCAGAG AATGACCATC ATGAGGAAAG 5580 

AGCCACAGAT CAGTCAATGT GTCCTACAAG ATAATAGCAC CAACAGGTAT AACAGGGCTT 5640 

CCTGGCATAA TCTATTTAAA ATATCCAACC TTCAACATAC TCGTATCCTT GATGACTGTT 5700 

AGAAGTGAAA TATGGTCCTT GCCCATAAGG AGCTGAGAGT TTAACTGGGA AGCTAAACCT 5760 

AACCCTTTAA ACCAACAAGG AGAAAATCTA CTGGTAGACA GCGCTGCATC TTTAGTTCAG 5820 

AAGAGAAAAG ATTGCAGTAC GTTAGAGCAA GAAGAATTTT CTGGAAGAAG TCAAATATAA 5880 

GGTGGATTTT GAAGGGTATT TGAGGTGAAA TACACCAATT AT CAGGG AAT AACATCAAAG 5940 

GTCCTCAATG AGACTACCAG CATTTAGGGA CTGATCTAAC AGACTT AG CA TGGGTTTAGT 6000 

ATTTACATTG ATACAGCAAT TGAATGATCT CCTTTTTTGA TGTTTGAAGG TTGATAGGTC 6060 

AGGAAATGTT CATCACCAGT TTCAAAAGCT TCTGACTGAA TTCAACAAAT CCACTGATGC 6120 

ATATGAGCTG AAGATCGCCA ACAAGCTCTT CGGAGAAAAG ACGTATCAAT TTTTACAGGT 6180 

AATTTCACCT GGCCTACCCA CATTTCATTT GCATCCTGAT GTCTGTGTCT CTGAGTGGCC 6240 

AAATGGAAGA AAGCAAGGCA GATGAGCCTG GCCGACCCAG GTGGAGAGCA TTTACTCAGA 6300 

GTGCATTAGC TCCATTTCCA CAACTCTCCC CCACTGGAGT GTCCCAGACC CCAACGATAC 63 60 

ATCACTGAAG TGTGGATTTA GGGATAATCT TGTGATAAAA GAGGAGGTTG TGTAATAGAG 6420 

TGAGTAAGAG TAATAAGTAA TAAGATACCA TCGATAAACT GGCACTGACT CAGTCACATA 6480 

CGATACATCT TGGTGGGAAA TGTATGACTA ATGGGATATT ATTGGAATGG GCAGGCTTGG 6540 

GTGAGTTCCT GAGAATAGTT GAGGAAGTAC CAGGAAATAT TGAATGCACA GGATGAAAGA 6600 

CAAAAACAAA GATCAGAAAC ATCATGGTTA AAATTACTGG AGAGAAGTCT GAGAAGCAAT 6660 

GAATCTCCTT CAGGGAAGCC TGCTCTGCAG TTTGCAAACC ACAGCCTCTT CTGCTTCTGC 6720 

CTTTTGCCAA GATGATATTG ACCTTCAGTG ACCTCTTTCT TGTGCCAGCC CACATTCCCC 6780 

TTTTGCATTG CCTACATGAC ACCTGTATAA AAATATC CAT GGACAGGAGA TACTGCATCT 6840 

ATTCAGGGTC TGGATTCAGC TTACTGTTGT TACAAATAAG TAAGTTTGGT AATATATAGT 6900 

TACATAAATT ACTCCTAATT CCTACTTCTT CCTTCATATC TCAAAGGAAT ATTTAGATGC 6960 



363 
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CATCAAGAAA TTTTACCAQA CCAGTGTGGA ATCTACTGAT TTTGCAAATG CTCCAGAAGA 7020' 
AAGTCGAAAO AAGATTAACT CCTGGGTGGA AAGTCAAACG AATGGTAGGA GAGCCACCCA 7080 
TTATAGAAAC ACCTTTGAGA AACCTATGCC AGTGAGCCTT GTGCTTGACA CTGCATGGGG 7140 
GAACAGGTGT GGGGATTGAG ATGGGTTTGC AGGGAGGGCT GAAGAGGGCA CTCCAGATGA 7200 
AGGATTTGTC CAAATGAATA TGAAGAGAGC CTAGGGGAGC CAAGGAGGAA ATCACAGGAA 7260 
GCCAATTAGA TGGAAACACA TCTGGAGAAT TATTTGCTTA TGGCCCTGCA TGACAATAGC 7320 
TTTGTGGATC CCCTGTCTCC GCTCAGACCT ATTTTGAGAT CAT AT CCTTT ACTTTAAATC 7380 
AGACTCAAAT TTTTATGATG AATATTTAAT AGAAAACATT AGAAAGCGTC TCTCGTCTCC 7440 
TTTACTAATT GGGAAACAAG CAGCTCTCTG GTAAATCACC CTTTTGTCTC TGAGCTGGAG 7S0O 
CTGCCTGGAT CACATCTGTA GCCAATGTGT TCTGCAGGGA TTATCACAGC TCTCTTCCCC 7560 
ATCAAGGGCA AAGAGCTTGA CAAAGTCTCC ATTCTACAGA CATCTTTCTT ACCTCCCACC 7620 
TCTCATTACA GGCCAAACTT ACAGCAACTC AACATGAGAG TGAATAGGAA GATACCCCCG 7680 
GAAGTAGTGT CTGACAGCAC AGGACATGCG TTTCATATTA CAGAGCTCAA GTCACTCATC 7740 
CTAAAATGCA ATCAGGGCCT CCTTCCTCTG AATGGGGACC CCGTAGTTAA AAAAAAATAA 7800 
AAGTAGGAAG AGGAGGGAGG GAGAAAGGAA AGACACATGT TGGAAGAGTA GACAAAATCA 7860 
GTTTATCAGT ATTCCAAATC AGATGATTGG AGACATTCAT ACACAGAGAA CGTGAACTCC 7920 
TTCTCTATCA CAAGAAGTGA TGTCT CCATC AAGGGTAACT TTATACGACT GGAGCCTTGA 7980 
AGAAAGCTGC ATCTGGTGAA CCACTGGTCA GTGAGTCTAA CAATTCAAAG ATCAAAGTCA 8040 
GTGAGTCTCA AGCAGGGATT TGGGTCAATA ATTAACGATC AGTCACGAAC ATTTGCAAAG 8100 
CATCTTCCAG ACAAGCCATT TGTAGCTTGT GTAAAAGACT CTTTTATTCT TTCCCTTGCA 8160 
GAAAAAATTA AAAACCTATT TCCTGATGGG ACTATTGGCA ATGATACGAC ACTGGTTCTT 8220 
GTGAACGCAA TCTATTTCAA AGGGCAGTGG GAGAATAAAT TTAAAAAAGA AAACACTAAA 8280 
GAGGAAAAAT TTTGGCCAAA CAAGGTATTG TCTATATTTT ATTTATATAG TGTAATATGT 8340 
TAATACATGG AATGTTAAAC ATTTCTGATG GAATGTAACA TGATAAGTAA AAAATAAAAA 8400 
TTGTTCATGT CTGTTATTTT GTTGTTTTAC TCTTATAACT TTATTTAGTT AGGAATACCT 8460 
GAAAAACTAT TGTTTCTAAC TCATGGAATT CCTGGGTTAT TTCTTAGAAG AAGAAGGATG 8520 
TGTTGCTATC TCAATAATAT TATCTTTTTT GTCTTGTGTT TCACGTGTTA TTTGTTGGAC 8580 
ACATTGATTT ATTGCAGAAT ACATACAAAT CTGTACAGAT GATGAGGCAA TACAATTCCT 8640 
TTAATTTTGC CTTGCTGGAG GATGTACAGG CCAAGGTCCT GGAAATACCA TACAAAGGCA 8700 
AAGATCTAAG CATGATTGTG CTGCTGCCAA ATGAAATCGA TGGTCTGCAG AAGGTAAGAA 8760 
CTTGCATCTA CAACTCTTCC TTCTACTGCC GGACATTTTT CCAAAGATAC CAAGTTTAAA 8820 
CAAGGTAAAA GCTTATGACC GAGTTGCCTC AAAATGATGA AAAATTCTAA ATGAGGAATG 8880 
ATGACTCACC TTCATATTAC AAATATTTGA GCATAGGGCC TGACACAAAC TGAAAGCTTA 8940 
GTTTTTGTTT GTTTGTTTGT TTTTATTATT ATTATTATAA TACTTTAAGC TTTAGGGTAC 9000 
ATGTGCACAA TGTGCAGGTT AGTTACATAT GTATACATGT GCCATGCTGG TGTGCTGCAC 9060 
CCATTAACTC ATCATTTAGC GTTAGGTATA TCTCCTAATG CTATCCCTCC CCCCTCCCCC 9120 
CACCCCACAA CAGTCCTCAG AGTGTGATGT TACCTTCCTG TGTCCAAGTG TTCTCATTGT 9180 
TCAATTCCCA TCTATGATTT AATTCCATCT ATGGCTTAGT TAATGATTAA TTTATTAGAG 9240 
TTACATGCAT TGGATATCAA TTTGATGATA TTATTATGCA GCAATTTAAA CTTGACTGGG 9300 
AGAAATATAT ACCAATGTGA GGAAAGTTTA CAAATAGGCC GAGTAGAAAA GGGAATACAA 93 60 
ATTTAGGAAT TTAGGGAATT ACAATTTAAT AATTGCAATG TGTACTAAAT AATGTATACA 9420 
GAAAAATATG ATGAGCCTAT TAAAAATTGA CACATGTAGT AGGCTGTTGG CACAAGAAAT 9480 
AGTGATACAT ACAGTTCATT GTGTACAAAA TAATGTAATC ATATTTTACA TGTGTATCAT 9540 
ACAGTTGTAT ACATACATAT GTACACATAT ACATATACGT AAAAACATGA TTCTGTTTTT 9600 
ACATACATGT ATATACATAT ACACATATAA CCCAATGTAT TTATATATTC AGGACTCATA 9660 
TTTTACCTAT TAGAATAATA ATGTCTATTA AAGTGAACCT TCTGTATTTC ACATTTATTG 9720 
CCAAAATAAC GAATCTCCAC ATAGTCAATT CATTGTTAAG GTGTATTAGA GATCGACAGT 9780 
TAGTCATATC AGTTTCTTTT TTCCATTTGT ATAGCTTGAA GAGAAACTCA CTGCTGAGAA 9840 
ATTGATGGAA TGGACAAGTT TGCAGAATAT GAGAGAGACA TGTGTCGATT TACACTTACC 9900 
TCGGTTCAAA ATGGAAGAGA GCTATGACCT CAAGGACACG TTGAGAACCA TGGGAATGGT 9960 
GAATATCTTC AATGGGGATG CAGACCTCTC AGGCATGACC TGGAGCCACG GTCTCTCAGT 10020 
ATCTAAAGTC CTACACAAGG CCTTTGTGGA GGTCACTGAG GAGGGAGTGG AAGCTGCAGC 10080 
TGCCACCGCT GTAGTAGTAG TCGAATTATC ATCTCCTTCA ACTAATGAAG AGTTCTGTTG 10140 
TAATCACCCT TTCCTATTCT TCATAAGGCA AAATAAGACC AACAGCATCC TCTTCTATGG 10200 
CAGATTCTCA TCCCCATAGA TGCAATTAGT CTGTCACTCC ATTTAGAAAA TGTTCACCTA 10260 
GAGGTGTTCT GGTAAACTGA TTGCTGGCAA CAACAGATTC TCTTGGCTCA TATTTCTTTT 10320 
CTATCTCATC TTGATGATGA TAGTCATCAT CAAGAATTTA ATGATTAAAA TAGCATGCCT 10380 
TTCTCTCTTT CTCTTAATAA GCCCACATAT AAATGTACTT TTCCTTCCAG AAAAATTTCC 10440 
CTTGAGGAAA AATGTCCAAG ATAAGATGAA TCATTTAATA CCGTGTCTTC TAAATTTGAA 10500 
ATATAATTCT GTTTCTGACC TGTTTTAAAT GAACCAAACC AAATCATACT TTCT CTTCAA 10560 
ATTTAGCAAC CTAGAAACAC ACATTTCTTT GAATTTAGGT GATACCTAAA TCCTTCTTAT 10620 
GTTTCTAAAT TTTGTGATTC TATAAAACAC AT CAT CAATA AAATAATGAC ATAAAATCAT 10680 
TTTTGCTTTA CCTGTTTTCT CTCTGGAAAG GGCAAGTGTC CAGTTACACA TAGGAAAGAT 10740 
AATTTAGAGA TATATTAATC ATATATAAAG GAAAATTAAA AACAGAGTAG TTCATGATGA 10800 
GCCTGGAGTA GAAGGCATAT CCCAGAACAG GAGGAGCCTT GTAAACCACA TAGGAACTTC 10860 
CTATTTTATG CTAAAGGGAT AAGAAACTCA TTACAGGCTT TGATGGTTGT TTGTCAAAGA 10920 
GGGG CAT AAA ATTATCATAT CCACAT CTAG AAAATACATC TCTGGCTACG CTGATATCAA 10980 
TGGATGCGAG GAAAGAACAG TGTGGTTACC ATATATAAAT TAGGAAATCA TTAGAGTATT 11040 
GGGAGTGGAA ATGGAGAGAA AGAAAGAGCC TGGGGGAATT ATTTAGGAAA TAATAGTTAC 11100 
AGAAAGACAT CTAAGTTGCT GACCTATCTG ACTGGATGGA TGGAAGAATA TCTTGTTTCT 11160 
GAGAGAAAAA AAGACTTTGG GTTTAAATTT GTACTTGATG AATTAAGGTA CTTTTAATAT 11220 
TCAAATGGAT TTGCCTGGCA GGCACTTGAA GATATTAGTC TAAATCTCAG AAACAGAATA 11280 
TGATCTGAAG CTCTAAATTT GTGATATTCA ATATAAATAC TTTAGAGTCA TTGGGATAAA 11340 
TATGGTAGTT GTAGCTAAAA GCAAAAATAA GATACTAGGG AGAAAGGATA AAGTTAGAAG 11400 
AAAGAAGAAT CTAGAATTGA CCTTGAAGTA TATCAGCATG TGTAAAGATC AGGAATTGAT 11460 
CATTTTTATT TTCCAGAAAG TAGCTTTTCT TAGGGTTCCA TATTTACTCC CAT AG ATT CT 11520 
TCCC 

Seq ID NO: 465 Protein sequence 
Protein Acceasion #: BAB21525.1 



1 11 21 31 41 51 

I I I I I I 

MNSLSEANTK FMPOLFQQFR KSKENNIFYS PISITSALGM VLLGAKDNTA QQISKVLHFD 60 

QVTENTTEKA ATYHVDRSGN VHHQFQKLLT EFNKSTDAYE LKIANKLFGE KTYQFLQEYL 120 

DAIKKFYQTS VESTDFANAP EESRKKINSW VESQTNEKIK NLFPDGTIGN DTTLVLVNAI 180 

YFKGQWENKF KKENTKEEKF WPNKNTYKSV QMMRQYNSFK FALLEDVQAK VLEIPYKGKD 240 
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LSMIVLLPNE IDGLQKLEEK LTAEKLMEWT SLQNMRETCV DLHLPRPKME ESYDLKDTLR 
TMGMVNIFNG DADLSGMTWS HGLSVSKVLH KAFVEVTEEG VEAAAATAW WELSSPSTN 
EEFCCNHPFL FFIRQNKTNS ILFYGRFSSP 

Seq ZD NO: 466 DNA sequence 

Nucleic Acid Accession #: NM_001910.1 

Coding sequence: SO.. 1240 



PCT/US02/12476 



300 
360 



GGAGAGAAGA 
CCTTCTTTTG 
GCCCCTCAGG 
GTTCTGGAAA 
GAGTGCCAAG 
TGGCTCCCCA 
CCCCTCTGTG 
GTCCAGCACA 
GTCCGGGATC 
GTTTGGAGAA 
TCTGGGCCTG 
GATGGCTCAG 
AGGTGGTGCG 
CCTGAATTGG 
GGTGGGAGGC 
TTCCCTCATC 
CCCCGTGGAT 
CTTCACCATT 
CGTGGATGGA 
TGGGCCCCTC 
TGGGAATAAC 
CCTGCCTGTC 
GTTATTTTCC 
ACATGAGAAT 
ACTCCCACCA 
TGATTATGAA 
GCAGAGATCA 
CACACGGCCA 
GTACCTGGAT 
TAACATCCTT 
CTGTTTTTGT 
TTTGAAATGT 
TTGAAATCCC 
AGTTAGATCT 
CTTGTTGCAT 
CTGCTGGGTG 



11 

I 

AAGGAGGGGG 
CTGCTGGTGC 
AGGCATCCGT 
TCCCATAATT 
GAACCCCTCA 
CCACAGAACT 
TACTGCACTA 
TACAGCCAGC 
ATTGGAGCCG 
AGTGTCACAG 
GGATACCCCT 
AACCTGGTGG 
GGGAGCGAGC 
GTCCCAGTCA 
ACTGTTATGT 
ACTGGCCCTT 
GGAGAATATG 
AACGGAGTCC 
ATGCAGTTCT 
TGGATCCTGG 
CGTGTGGGAC 
TGACAGACCT 
AGAGAATGTA 
ACACACACAC 
CCGTCATGAT 
AATCAAAAAT 
TGGTATAATA 
GGCCTGTTTA 
CATTCTGAAG 
AAATATACAA 
AGTTGGATTG 
CTGTAAGTCT 
GAGGTGTCAT 
GGAGGTCTGA 
CCTGTCAGGA 
GGTTTGGAGT 



21 
I 

CAAGGGAGAA 
TCCTGGAGCT 
CCCTCAAGAA 
TGGACATGAT 
TCAACTACTT 
TCACTGTCAT 
GCCCAGCCTG 
CAGGTCAATC 
ACCAAGTCTC 
AG C CAGG CCA 
CCTTGGCTGT 
ACTTGCCGAT 
TGATTTTTGG 
CCAAGCAAGC 
TCTGCTCCGA 
CCGACAAGAT 
CTGTGGAGTG 
CCTATACCCT 
GCAGCAGTGG 
GGGATGTCTT 
TGGCCCCAGC 
TGAATATGTT 
GCTGTTTCCA 
ACACACATAT 
GGAGGAATTA 
TTTCACATTT 
AATCCCTTTG 
TCTACACTGC 
CAAATTCCGA 
TCGGAATTCA 
TTTGTATTAQ 
CTTTCCATCT 
TTGACATGGT 
TTTTGTGGCA 
GGCAGATAAT 
TCTTGGCTTT 



31 

I 

GCTGCTGGTC 
GGGAGAGGCC 
GAAGCTGCGG 
CCAGTTCACC 
GGATATGGAA 
CTTCGACACT 
CAAGACGCAC 
TTTCTCCATT 
TGTGGAAGGA 
GACCTTTGTG 
GGGAGGAGTG 
GTTTTCTGTC 
AGGCTACGAC 
TTACTGGCAG 
GGGCTGCCAG 
TAAGCAGCTG 
TGCCAACCTT 
CAGCCCAACT 
CTTTCAAGGA 
CATTCGACAG 
AGTCCCCTAA 
AGGCTGGGGC 
GGGTTGCAAC 
ACACACACAC 
CGTTATACAT 
GATTATGAAA 
CAACTCCACT 
TGCCCACTCC 
GCATTACATC 
AGCATCTCCC 
GATTCAAGCA 
ACAGAGTTTA 
TCTCTGAACT 
AAAATACTTC 

AATCATT CAT 



41 
I 

GGACTCACAA 
CAAGGATCCC 
GCACGGAGCC 
GAGTCCTGCT 
TACTTCGGCA 
GGCTCCTCCA 
AGCAGGTTCC 
CAGTATGGAA 
CTAACCGTGG 
GATGCAGAGT 
ACTCCAGTAT 
TACATGAGCA 
CACTCCCATT 
ATTGCACTGG 
GCCATTGTGG 
CAAAACGCCA 
AACGTCATGC 
GCCTACACCC 
CTTGACATCC 
TTTTACT CAG 
GGAGGGGCCT 
ATTCTTTACA 
TTGAATTAAG 
ACACTTCACA 
TCATATTTTG 
ATCTCCAAAC 
CAGCCCTGAC 
TCTCTCCAGC 
ATTTTGTCCA 
ATTGTCCCAC 
AGGCCCATAT 
GCACATTTGA 
TATCTTTCCT 
CTAGGTGGTG 
CTCTATTGGT 
TACAAAGTTC 



Seq ID NO: 467 Protein sequence 
Protein Accession #: NP_001901.1 



MKTLLLLLLV 
SMDQSAKEPL 
QPSQSSTYSQ 
FDGILGLGYP 
FSGSLNWVPV 
IGAAPVDGEY 
HPPAGPLWIL 



11 
I 

LLELGEAQGS 
INYLDMEYFG 
PGQSFSIQYG 
SIAVGGVTPV 
TRQAYWQIAL 
AVECANLNVM 
GDVFIRQFYS 



21 

I 

LHRVPLRRHF 
TISIGSPPQN 
TGSIiSGIIGA 
FDNMMAQNIiV 
DNIQVGGTVM 
PDVTFTINGV 
VFDRGNNRVG 



31 
I 

SLKKKLRARS 
FTVTFDTGSS 
DQVSVEGLTV 
DLPMFSVYMS 
FCSEGCQAIV 
PYTLSPTAYT 
LAPAVP 



41 

I 

QLSEFWKSHN 
NLWVPSVYCT 
VGQQFGESVT 



Seq ID NO: 468 DNA sequence 

Nucleic Acid Accession #: NM_018058.1 

Coding sequence: 319.. 1575 



1 
I 

TACGCGCTGC 
GACGGCCGGG 
TACACCGACA 
GTCAACGTGG 
AGAAAGGGCT 
CCTGATGCCC 
CTCAGAGATG 
GTGGGCCCCA 
AACTTCCTTT 
GTGGACGACC 
AAAGTGGACA 
ACCCATGGGA 
GTCCGCACGG 
AACATTGCCT 
GGAGACCCCC 
ACAGGGGGTG 
GGAGAGTCCA 
TGGCTGCGAG 
CTCTACACCA 
TGTGAGATGG 
GTGACGTGGC 



11 
I 

GGGACCGGCA 
AGGAGATCTA 
AGTTGTTCAA 
CCCGTGGTGT 
CTGGACGCTA 
TCATTGAAAT 
TGGCTGCTGA 
TCCTCAGCAG 
TCCACAACCG 
CCCACCAGCA 
TCGTCTATGG 
AGGTCCGCTT 
TCATCACCGC 
ACOGCAGCTC 
TCATCGAGGA 
TGGTGACCGA 
TGGCTCAGCC 
TGGTGCCACG 
AGAAGAGTGG 
AGCCCGTGGC 
CAGATGGCAA 



21 
I 

GGGGAACGCC 
CTTCCTCAAC 
GTTCCGCAAT 
GGCCAGCCTC 
CTCTATCTAC 
GGACCCTGAG 
GGCTGGGGTC 
CAGTGCCTCG 
GGGCGATGGC 
TGGGCGAGGT 
CAACTGGAAT 
CCGGGACATC 
CGACTTTGAC 
CTCAGCCAAC 
GCTCAATCCC 
CTTCGACGGA 
GCTGTCCGTC 
CACCCGGGTT 
GGCCCACCTG 
ACACTTTGGC 
GATGGTGAGC 



31 
I 

ATCGGGGTCA 
ACCAATAATG 
AACCGGTGGG 
TTTGCCGGAC 
ATTGCCAATT 
GCCAGTGACC 
AGCAAATATA 
GATATCTTCT 
ACCTTTGTGG 
GTCGCCCTGG 
GGCCCCCACC 
GCCTCACCCA 
AATGACCAGG 
CGCCTCTTCC 
GGCGACGCCT 
GACGGGATGC 
TTCCGGGGCA 
GGGGCCTTTG 
AGGATCATCG 
CTGGGGAAGG 
CGGAACGTGG 



DTGTSLITGP 
LLDFVDGMQF 



41 
I 

CAGCCTGCGA 
CCTTCTCGGG 
AAGACATCCT 
GCTCTGTGGC 
ACGCCTACGG 
TCTCCCGGGG 
CAGGGGGCCG 
GCGACAATGA 
ACGCTGCGGC 
CTGACTTCAA 
GCCTCTATCT 
AGTTCTCCAT 
AGCTGGAGAT 
GCGTCATCCG 
TGGAGCCTGA 
TGGACCTCAT 
ATCAGGGCTT 
CCAGGGGAGC 
ACGGGGGCTC 
ATGAAGCCAG 
CCAGCGGGGA 



51 
I 

TGAAAACGCT 
TTCACAGGGT 
AGCTCTCTGA 
CAATGGACCA 
CTATCTCCAT 
ACCTCTGGGT 
AGCCTTCCCA 
CCGGGAGCTT 
TTGGCCAGCA 
TTGATGGAAT 
TTGACAACAT 
GTAACCCAGA 
TCTCTGGGAG 
ATAACATCCA 
ACACAGGGAC 
TTGGGGCAGC 
CGGATGTCAC 
TACTGGACTT 
ACCCTCCAGC 
TCTTTGACCG 

CCTACAAAAA 
ACCAAACAGA 
CATACACACC 
TATTGATTTT 
ATATGCACAA 
AACCCATCCA 
TCCACATGCT 
TAAATATTTC 
AAATGTTTGG 
ATTGCATTTA 
ACGTTGCTGG 
ATAAAATGGT 
CTGGGTACTT 
AATGTTAAGA 
AGCATTTT 



51 
I 

LDMIQFTESC 
SPACKTHSRF 
EPGQTFVDAE 
LIFGGYDHSH 
SDKIKQLQNA 
CSSGFQGLDI 



51 
I 

CATCGACGGG 
GGTGGCCACG 
GAGCGATGAG 
CTGTGTGGAC 
TAATGTGGGC 
CATTCTGGCG 
AGGCGTCAGC 
GAATGGGCCT 
CAGTGCTGGT 
CCGTGATGGC 
GCAAATGAGC 
GCCCTCCCCT 
CTTCTTCAAC 
TAGAGAGCAC 
GGGCCGGGGC 
CTTGTCCCAT 
CAACAACAAC 
TAAGGTCGTG 
AGGCTACCTG 
CAGTGTGGAG 
GATGAACTCA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 



60 
120 
180 
240 
300 
360 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
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GTGCTGGAGA TCCTCTACCC CCGGGATGAG GACACACTTC 
ACACCAATGA ATGCATCCAG TTCCCATTCG TGTGCCCTCG 
ACACCTATGG AAGCTACAGG TGCCGGACCA ACAAGAAGTG 
ACGAGGATGG CACAGCCTGC GTGGGGACTC TCGGCCAGTC 
CCCCCACCGC TGCTGCTGCC ACTGCCGCTG CTGCTGCCGC 
CACCGGTCCT CGT AGATGG A . GATCTCAATC TGGGGTCGGT 
CCAGCTGCTG AGCAGGGGTG GGACATGAAC CAGCGGATGG 
AAGTGGGCTT GTGCTGCTGC CTAGACAGTA GGGATGTAAA 
CCCAAGCCCA TCCATGCACA TTACTTAGCT AACAATTAGG 
CTGTGCTGGG CACATAG CTG TGATCACAGC AGACAGGGTC 
ATTCCAGTGG GTCTAATGAC CATATCTTAG GACACAGATG 
CTGCACAGGA AGTATGAGGA CTTTAGTGTC CTGAGTTCAA 
AAAGCTATGT GACCTTACAC CAGTCACTTA ACTTGTTAGC 
AAATGGGGAT TAAGAATAGA ATCTTGGGGT TAGTGTGGAG 
GACACTTGGC ACAAAACCTG GCACATAGTA AAGGCTCAAT 
GGGCTTTGTC AACACGTG 

Seq ID HO: 469 Protein sequence 
Protein Accession #t HP 060528.1 



PCT/US02/12476 



i 

I 

MDPEASDLSR 
RGDGTFVDAA 
FRDIASPKFS 
ELNPGDALEP 
RTRVGAFARG 
KMVSRNVASG 
GAGPTRSAVG 



11 
1 

GILALRDVAA 
ASAGVDDPHQ 
MPSPVRTVIT 
EGRGTGGWT 
AKWLYTKKS 
EMNSVLEILY 
ATSPTRMAQP 



21 
I 

EAGVSKYTGG 
HGRGVA1ADP 
ADFDNDQELE 
DFDGDGMLDL 
GAHLRIIDGG 
PRDEDTLQDP 
AWGLSASHRA 



31 

1 

RGVSVGPILS 
NRDGKVDIVY 
IFPNNIAYRS 
ILSHGESMAQ 
SGYLCEMEPV 
APLETPMNAS 
PAPPPPPLLL 



Seq ID NO: 470 DNA sequence 
Nucleic Acid Accession #: AJ279016 
Coding sequence: 1..1962 



1 
I 

ATGTCCAGGA 
CAGCGGGCTG 
AGTAATCCCA 
TTTGAGATCG 
CAGAAGCGGC 
GACCGGCAGG 
GAGATCTACT 
TTGTTCAAGT 
CGTGGTGTGG 
GGACGCTACT 
ATTGAAATGG 
GCTGCTGAGG 
CTCAGCAGCA 
CACAACCGGG 
CACCAGCATG 
GTCTATGGCA 
GTCCGCTTCC 
ATCACCGCCG 
CGCAGCTCCT 
ATCGAGGAGC 
GTGACCGACT 
GCTCAGCCGC 
GTGCCACGCA 
AAGAGTGGGG 
CCCGTGGCAC 
GATGGCAAGA 
CTCTACCCCC 
TTCTCCCAGC 
GTGTGCCCTC 
AACAAGAAGT 
CTCGGCCAGT 
GCTGCTGCCG 
CTGGGGTCGG 
CCAGCGGATG 
AGGGATGTAA 
TAACAATTAG 
CAGACAGGGT 
GGACACAGAT 
CCTGAGTTCA 
AACTTGTTAG 
TTAGTGTGGA 
AAAGGCTCAA 



11 
I 

TGTTACCGTT 
AACCCATGTT 
CCCAGCTCAA 
TCGTGGCGGG 
TGGTGAACAT 
GGAACGCCAT 
TCCTCAACAC 
TCCGCAATAA 
CCAGCCTCTT 
CTATCTACAT 
ACCCTGAGGC 
CTGGGGTCAG 
GTGCCTCGGA 
GCGATGGCAC 
GGCGAGGTGT 
ACTGGAATGG 
GGGACATCGC 
ACTTTGACAA 
CAGCCAACCG 
TCAATCCCGG 
TCGACGGAGA 
TGTCCGTCTT 
CCCGGTTTGG 
CCCACCTGAG 
ACTTTGGCCT 
TGGTGAGCCG 
GGGATGAGGA 
AGGAAAATGG 
GAGACAAGCC 
GCAGTCGGGG 
CACCGGGCCC 
CTGCTGGAGC 
TGGTTAAGGA 
GAGTCCAGCA 
AGGCCTGGGA 
GGAGACTCGT 
CGCTGCCCTG 
GTGCCCAGGG 
AATCCTGATT 
CCATCCATTA 
GATTAGATTA 
TAAAAACAAG 



21 
I 

CCTGCTGCTG 
CACTGCAGTC 
CTATGGTGTG 
GTACAATGGA 
CGCGGTCGAT 
CGGGGTCACA 
CAATAATGCC 
CCGGTGGGAA 
TGCCGGACGC 
TGCCAATTAC 
CAGTGACCTC 
CAAATATACA 
TATCTTCTGC 
CTTTGTGGAC 
CGCCCTGGCT 
CCCCCACCGC 
CTCACCCAAG 
TGACCAGGAG 
CCTCTTCCGC 
CGACGCCTTG 
CGGGATGCTG 
CCGGGGCAAT 
GGCCTTTGCC 
GAT CAT CG AC 
GGGGAAGGAT 
GAACGTGGCC 
CACACTTCAG 
CCATTGCATG 
CGTATGTGTC 
CTACGAGCCC 
CCGCCCCACC 
TGCCACTGCT 
GAGCTGCGAG 
GGGGAGTGGG 
GCTAGACCCT 
AAGGCCAGGC 
ATGGCGCTTA 
AGGTGGTGTC 
CAGGAACTCA 
TCGCATCTGC 
AATGTATGTA 
TGCCTCTCAC 



31 
I 

CTCTGGTTTC 
ACCAACTCAG 
GCAGTTACTG 
CCCAACCTGG 
GAGCGCAGCT 
GCCTGCGACA 
TTCTCGGGGG 
GACATCCTGA 
TCTGTGGCCT 
GCCTACGGTA 
TCCCGGGGCA 
GGGGGCCGAG 
GACAATGAGA 
GCTGCGGCCA 
GACTTCAACC 
CTCTATCTGC 
TTCTCCATGC 
CTGGAGATCT 
GTCATCCGTA 
GAGCCTGAGG 
GACCTCATCT 
CAGGGCTTCA 
AGGGGAGCTA 
GGGGGCTCAG 
GAAGCCAGCA 
AGCGGGGAGA 
GACCCAGCCC 
GACACCAATG 
AACACCTATG 
AACGAGGATG 
ACCCCCACCG 
GCACCGGTCC 
CCCAGCTGCT 
AAAGTGGGCT 
CCCCAAGCCC 
CCTGTGCTGG 
CATTCCAGTG 
ACTGCACAGG 
CAAAGCT AT G 
AAAATGGGGA 
AGACACTTGG 
TGGGCTTTGT 



AGGACCCAGC 
AGACAAGCCC 
CAGTCGGGGC 
ACCGGGCCCC 
TGCTGGAGCT 
GGTTAAGGAG 
AGTCCAGCAG 
GGCCTGGGAG 
GAGACTCGTA 
GCTGCCCTGA 
TGCCCAGGGA 
ATCCTGATTC 
CATCCATTAT 
ATTAGATTAA 
AAAAACAAGT 



41 

1 

SSASDIPCDN 
GNWNGPHRLY 
SSANRLPRVI 
PLSVFRGNQG 
AHFGLGKDEA 
SSHSCALETS 
PLPIiLLPLLE 



41 
I 

TGCCCATCAC 
TTCTGCCTCC 
ATGTGGACCA 
TTCTGAAGTA 
CACCCTACTA 
TCGACGGGGA 
TGG CCACGTA 
GCGATGAGGT 
GTGTGGACAG 
ATGTGGGCCC 
TTCTGGCGCT 
GCGTCAGCGT 
ATGGGCCTAA 
GTGCTGGTGT 
GTGATGGCAA 
AAATGAGCAC 
CCTCCCCTGT 
TCTTCAACAA 
GAGAGCACGG 
GCCGGGGCAC 
TGTCCCATGG 
ACAACAACTG 
AGGTCGTGCT 
GCTACCTGTG 
GTGTGGAGGT 
TGAACTCAGT 
CACTGGAGTG 
AATGCATCCA 
GAAGCTACAG 
GCACAGCCTG 
CTGCTGCTGC 
TCGTAGATGG 
GAGCAGGGGT 
TGTGCTGCTG 
ATCCATGCAC 
GCACATAGCT 
GGTCTAATGA 
AAGTATGAGG 
TGACCTTACA 
TTAAGAATAG 
CACAAAACCT 
CAACACG 



CCCACTGGAG 
GTATGTGTCA 
TACGAGCCCA 
CGCCCCACCA 
GCCACTGCTG 
AGCTGCGAGC 
GGGAGTGGGA 
CTAGACCCTC 
AGGCCAGGCC 
TGGCGCTTAC 
GGTGGTGTCA 
AGGAACTCAC 
CGCATCTGCA 
ATGTATGTAA 
GCCTCTCACT 



51 
I 

ENGPNFLFHN 
LQMSTHGKVR 
RREHGDPLIE 
FNNNWLRWP 
SSVEVTWPDG 
PYVSTPMEAT 
LPLliHRSS 



51 
I 

TGAGGGGTCC 
TGACTATGAC 
TGATGGGGAC 
TGACCGGGCC 
CGCGCTGCGG 
CGGCCGGGAG 
CACCGACAAG 
CAACGTGGCC 
AAAGGGCTCT 
TGATG CCCTC 
CAGAGATGTG 
GGGCCCCATC 
CTTCCTTTTC 
GGACGACCCC 
AGTGGACATC 
CCATGGGAAG 
CCGCACGGTC 
CATTGCCTAC 
AGACCCCCTC 
AGGGGGTGTG 
AGAGTCCATG 
GCTGCGAGTG 
CTACACCAAG 
TGAGATGGAG 
GACGTGGCCA 
GCTGGAGATC 
TGGCCAAGGA 
GTTCCCATTC 
GTGCCGGACC 
CGTGGGGACT 
CACTGCCGCT 
AGATCTCAAT 
GGGACATGAA 
CCTAGACAGT 
ATTACTTAGC 
GTGATCACAG 
CCATATCTTA 
ACTTTAGTGT 
CCAGTCACTT 
AATCTTGGGG 
GGCACATAGT 



1320 
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Seq id NOx 471 Protein sequence 
Protein Accession #: CAC08451 



1 11 21 31 41 51 

I I I J I I 

MSRMLPFLLL LWFLPITEGS QRAEPMFTAV TNSVLPPDYD SNPTQLNYGV AVTDVDHDGD 
FEIWAGYNG PNLVLKYDRA QKRLVNIAVD ERSSPYYALR DRQGNAIGVT ACDIDGDGRE 
E I Y FLNTNNA FSGVATYTDK LFKFRNNRWE DILSDEVNVA RGVASLFAGR SVACVDRKGS 



60 
120 
180 
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GRYSIYIANY AYQNVGPDAL IEMDPEASDL SRGILALRDV AAEAGVSKYT GGRGVSVGPI 240 

LSSSASDIFC DNBNGPNFLF HNRGDGTFVD AAASAGVDDP HQHGRGVALA DFNRDGKVDI 300 

VYGNWNGPHR LYLQMSTHGK VRFRDIASPK FSMPSPVRTV ITADFDNDQE LEIFFNNIAY 360 

RSSSANRLFR VIRREHGDPL IEELNPGDAL EPEGRGTGGV VTDFDGDGML DLILSHGESM 420 

AQPLSVFRGN QGFNNNWLRV VPRTRFGAFA RGAKWLYTK KSGAHLRIID GGSGYLCEME 480 

PVAHFGLGKD EASSVEVTWP DGKMVSRNVA SGEMNSVLEI LYPRDEDTLQ DPAPLECGQG 540 

FSQQENGHCM DTNECIQFPP VCPRDKPVCV NTYGSYRCRT NKKCSRGYEP NEDGTACVGT 600 
LGQSPGPRPT TPTAAAATAA AAAAAGAATA APVLVDGDLN LGSWKESCE PSC 

Seq XD NO: 472 DMA sequence 
Nucleic Acid Accession #> FGENESHH 
Coding sequence? 1..4794 

1 11 21 31 41 51 

I I I I 

ATGGCGTGTC CGGGAGGACT CCCAGCCCGT TGCTCTGGTT GGATGGGACT GGGTGGGCCC 60 

AGCGGCTCCT CCCCAGCATC CCCTCCCCAT TCCTCCTCCA GGTACAATGG ACCCAACCTG 120 

GTTCTGAAGT ATGACCGGGC CCAGAAGCGG CTGGTGAACA TCGCGGTCGA TGAGCGCAGC 180 

TCACCCTACT ACGCGCTGCG GGACCGGCAG GGGAACGCCA TCGGGGTCAC AGCCTGCGAC 240 

ATCGACGGGG ACGGCCGGGA GGAGATCTAC TTCCTCAACA CCAATAATGC CTTCTCGGGC 300 

CACAGCAGCT CAGCGCAGGT CCCTTCTGGG CTCCACAGAA ACAGGCCTGT GCTGAAGCCT 360 

CCACCTACAA CCCCTGCAGG CCTCCTGGGT CTGCCTCCAC TCAGCGGAAG GGACTTTTCC 420 

TCCTCCCTGG GTCAGGCTTC TCCGGACAGC AGGCAGGGAG AGAGGGTGCC GGTTCCCTGC 480 

TGTCGGGGTG GACTGAGACC TACCCATGAA CCAGAACCAT TTCTTCTGAG ACCCAAATCA 540 

GGGGTGGCCA CGTACACCGA CAAGTTGTTC AAGTTCCGCA ATAACCGGTG GGAAGACATC 600 

CTGAGCGATG AGGTCAACGT GGCCCGTGGT GTGGCCAGCC TCTTTGCOGG ACGCTCTGTG 660 

GCCTGTGTGG ACAGAAAGGG CTCTGGACGC TACTCTATCT ACATTGCCAA TTACGCCTAC 720 

GGTAATGTGG GCCCTGATGC CCTCATTGAA ATGGACCCTG AGGCCAGTGA CCTCTCCCGG 780' 

GGCATTCTGG CGCTCAGAGA TGTGGCTGCT GAGGCTGGGG TCAGCAAATA TACAGAAGGC 840 

TTCTCCCACA CTGCCTCTCC AAGCATTGGT GAGATATCTG GCAGAACCGA GGAGCGGGAA 900 

GGAGGAGACC CAGAGGAGGC AGATGAGGAG CACAGTGGGG ATGGAAGCAC GAGCCAACTG 960 

TGCCGGCTGG GCTGGAAGGA CGGGCAGTTC AAGGAAGAAG CAGCAGCTTT GGTGGAGGAA 1020 

CAGAGGGAGG CTGGGGCAGC TGGCGTGCCC AGAGGACGTG TTCGAACAGC TCTGCAGACT 1080 

TCCAAAAGCC ATTTGGCTGA CAAGAACCTA TTTGGCCCAC CATGTTACTA TTCTGTCTGC 1140 

GCGCCTTCTC CAGCCCACCC TTTCCCTGCC CGCCAAGCCC CCCAACACTA CCCTGTAGCC 1200 

CCCCTTGTCA CTCAGCTAAT GACACATGGA CGT CTGG CTG GAAAACTAGC CCGGAGTGTC 1260 

CCCCACCCCC GAGCCCCAGG AATGGACCCC AAATGTAAGG GCCGCCATGC TGAGCCCGGC 1320 

CTGATGGCTG AGGCTTTGGG CGCGTGGCCA QCGCTCAGCA CCACTGTGGT GCCAGGGGGC 1380 

CTGAGAAGCT GGGAGGAAAG CAGGCAGAAG GGGCAGGCCA TGTCCAGATG TGCACTCAGG 1440 

GAGCTGGGAG GTCCCTGGAG CCAAGCCACA CAGCACCTGC CTGCTAGAGA GCTGTATGAC 1500 

CTGGGAGAAC CTCCCATTTT ACAAAGAACA GACGGAGATC CAGGGAGGAG AAGGGACTCG 1560 

CCCAAGGTCA CACAGGAGTG CCATCTAGTG GCCACCATGC CAGCTCTCGG GGGACTCGAG 1620 

GGCCCCGGGA GGGTGGCCAA GCGAGAGATT GGGAGAGAGA CTGGGGCAGT AGGAAGACCA 1680 

CTCTCCCATC CCCTGGTCCC CAACTTCCCC AGCTGCTTGA GGCCTCTTGA AGCCGGGACA 1740 

GTGCCGGGAG CTGCCCTGCC TGGGAATCCT GGGAACTGGG TTCTGGACAT GGCCAAGGCC 1800 

CTGGCGTGGA ACCAGATGGA AAAAGAGGAG GGGAAGATTC ATGGAGACCA TGAGCCCAGA 1860 

TTTAGGCTCA GGAAAGCACG GGAAGCAGAA TTCCCCCCAG GCTCCTCTGA GGAGCCTCTG 1920 

CTGCAGTTCC CCTCAGGCCT CAGAGGCAGC CCTGTCCTCC AGGTGGGCCT GGGGCTTGCT 1980 

TCTGCCACTC ACTGTGGGTC GATGTCTTTT CTAGGGGGCC GAGGCGTCAG CGTGGGCCCC 204 0 

ATCCTCAGCA GCAGTGCCTC GGATATCTTC TGCGACAATG AGAATGGGCC TAACTTCCTT 2100 

TTCCACAACC GGGGCGATGG CACCTTTGTG QACGCTGCGG CCAGTGCTGA ACGTCGTTTA 2160 

GCCTTCATCG TTCACCTCAA ATATCACCTC TGCAGAGATT TTCCTCACTC CCTGTGCCAC 2220 

CT AG CAG AAA CTGGTCCTTC CTCCTCCTGC TGCCCGTGGC ATGCACGTCT TCTTCAGGCT 2280 

CCACATTGCC ATCATGGTTT GTCTATGAGC TTTACAAGGA CCGGGTCACG GTTCTATTCA 2340 

TTCTTGACGC AAGGCTTGGC CTCCAGTGCC CACCGGAGGA CACTCAGCCT CCAGGGTTCT 2400 

CAGGGGGCCC CACCCTGCCT TCTGGCAAGA GCTCCCTGTG TCCTGGGGTC TCTGATCCCC 2460 

ACTGCCTATT ACATTGTCCT GTGGTCTGCC ATCCCAGAGA GCCTGATGAC CCA CAG CT AT 2520 

TTGTCCTCTG AAAGAGTCAA CGTGGGTGTG GACGACCCCC ACCAGCATGG GCGAGGTGTC 2580 

GCCCTGGCTG ACTTCAACCG TGATGGCAAA GTGGACATCG TCTATGGCAA CTGGAATGGC 2640 

CCCCACCGCC TCTATCTGCA AATGAGCACC CATGGGAAGG TCCGCTTCCG GGACATCGCC 2700 

TCACCCAAGT TCTCCATGCC CTCCCCTGTC CGCACGGTCA TCACCGCCGA CTTTGACAAT 2760 

GACCAGGAGC TGGAGATCTT CTTCAACAAC ATTGCCTACC GCAGCTCCTC AGCCAACCGC 2820 

CTCTTCCGAT GCTCCATCCT GGCTCGTGGC TCTTCATCCT TGACAGCTGG TGGGAGGAAC 2880 

GGTCAGGGAG AAGGTTTAAG AATCAGAAGG GGAGGGTTCC CAGGGCCAGG GGGTCAGGCC 2940 

AAGGTCAACA CAGGTCCCCT GATGAAGAAA CAGAAAGGAA GGAAGGACGA GGACTGGGCA 3000 

AGAGGCTGTG GGAATGCAGG GCAAAGCCTG GCCAAGGAGC CGGCCTCTGC TATTGCAGGG 3060 

AAAGGGAAGG GAAATGTGGC CCAAAGTGTG CCCAGAACCC AAGCGCCACA AGATACAAAG 3120 

CCACACTACC ACAAAAAGGG GCTACAGGGT CCAATCACTA CCAGGAAAAG GOGCTA.CGGG 3180 

GTCCAATCAC TACCAGGAAA AGGGGCTACG GGGTCCAATC ACTACCAGGA AAAGGGGCTA 3240 

CGGGGTCCAA TCACTACCAG GAAAAGGGGC TACGGGGTCC AATCACTACC AGGAAAAGGG 3300 

GCTACGGGCT CCAATCACTA CCAGGAAAAG GGGCTACAGG GTCCAATCAC TACCAGGAAA 3360 

AGGGGCTACG GGCTCCAATC ACTACCAGGA AAAGGGGCTA CAGGGTCCAA TCACTACCAG 3420 

AGAAAGGGGC TACGGGCTCC AATCACTACC AGGAAAAGGG GCTACGGGGT CCAATCACTA 3480 

CCAGGAAAAG GGGCTACAGG GTCCAATCAC TACCAGGAAA AGGGGCTACG GGGTCCAATC 354 0 

ACTACCAGGA AAAGGGGCTA CGGGCTCCAA TCACTACCAG GAAAAGGGGC TACGGGGTCC 3 600 

AATCACTACC AGGAAAAGGG GCTACAGGGT CCAATCACTA CCAGGAAAAG GGGCTACAGG 3660 

GTCCAATCAC TACCACAGAA AGGGGCTACG GGCTCCAATC ACTACCAGGA AAAGGGGCTA 3720 

CGGGGTCCAA TCACTACCAG GAAAAGGGGC TACGGGCTCC AATCACTACC AGGAAAAGAG 3780 

GCTATGGGGT CCAATCACTA CCAGGAAAAG GGGCTACGGG CTCCAATCAC TACCAGGAAA 3840 

AGGGGCTATG GGGTCCAATC ACTACCACAG AAAGGGGCTA CGGGGTCCAA CGTCATCCGT 3 900 

AGAGAGCACG GAGACCCCCT CATCGAGGAG CTCAATCCCG GCGACGCCTT GGAGCCTGAG 3 960 

GGCCGGGGCA CAGGGGGTGT GGTGACCGAC TTCGACGGAG ACGGGATGCT GGACCTCATC 4020 

TTGTCCCATG GAGAGTCCAT GGCTCAGCCG CTGTCCGTCT TCCGGGGCAA TCAGGGCTTC 4080 

AACAACAACT GGCTGCGAGT GGTGCCACGC ACCCGGTTTG GGGCCTTTGC CAGGGGAGCT 4140 

AAGGTCGTGC TCTACACCAA GAAGAGTGGG GCCCACCTGA GGATCATCGA CGGGGGCTCA 4200 

GGCTACCTGT GTGAGATGGA GCCCGTGGCA CACTTTGGCC TGGGGAAGGA TGAAGCCAGC 4260 

AGTGTGGAGG TGACGTGGCC AGATGGCAAG ATGGTGAGCC GGAACGTGGC CAGCGGGGAG 4320 



367 
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ATQAACTCAG TGCTGGAGAT CCTCTACCCC CGGGATGAGG 
CCACTGGAGT GTGGCCAAGG ATTCTCCCAG CAGGAAAATG 
GAATGCATCC AGTTCCCATT CGTGTGCCCT CGAGACAAGC 
GGAAGCTACA GGTGCCGGAC CAACAAGAAG TGCAGTCGGG 
GGCACAGCCT GCGTGGGTAC TGAGCTAGGC TCTAGGCATA 
CCCAAAAAGG AGCTGCAACT TTCCCAAGGC ATCTGCACCC 
CCGGGTTGCC GGCTGCTCCT CAAAAGAGCT CAGCTCCAGG 
CAGAAAGCTC CAGGTATTCC AGAAGCCCAA GTGTATGAAC 

Seq ID NO: 473 Protein sequence 
Protein Accession #: FGENESH predicted 



1 
I 

MACPGGI#PAR 
SPYYAIiRDRQ 
PPTTPAGLLG 
GVATYTDKLF 
GNVGPDALIE 
GGDPEEADEE 
SKSHLADKNL 
PHPRAPGMDP 
ELGGPWSQAT 
GPGRVAKREI 
LAWNQMEKEE 
SATHCGSMSF 
AFIVHLKYHL 
FLTQGLASSA 
LSSERVNVGV 
SPKFSMPSPV 
GQGEGLRIRR 
KGKGNVAQSV 
RGP I TTRKRG 
RKGLRAPITT 
NHYQEKGLQG 
AMGSNHYQEK 
GRGTGGWTD 
KWLYTKKSG 
MNSVLEILYP 
GSYRCRTNKK 
PGCRLLLKRA 



11 
I 

CSGWMGLGGP 
GNAIGVTACD 
LPPLSGRDFS 
KFRNNRWEDI 
MDPEASDLSR 
HSGDGSTSQL 
FGPPCYYSVC 
KCKGRHAEPG 
QHLPARELYD 
GRETGAVGRP 
GKIHGDHEPR 
LGGRGVSVGP 
CRDFPHSLCH 
HRRTbSLQGS 
DDPHQHGRGV 
RTVITADFDN 
GGFPGPGGQA 
PRTQAPQDTK 
YGVQSLPGKG 
RKRGYGVQSL 
PI TTRKRG YR 
GLRAPITTRK 
FDGDGMUD L I 
AHLRIIDGGS 
RDEDTLQDPA 
CSRGYEPNED 
QLQAAPSTLL 



21 
I 

SGSSPASPPH 
IDGDGREEIY 
SSLGQASPDS 
LSDEVNVARG 
GILALRDVAA 
CRLGWKDGQF 
APSPAHPFPA 
LMAEALGAWP 
LGEPPILQRT 
LSHPLVPNFP 
FRIiRKAREAE 
ILSSSASDIF 
LAETGPSSSC 
0GAPPCLLAR 
ALADFNRDGK 
DQELEIFFNN 
KVNTGPLMKK 
PHYHKKGLQG 
ATGSNHYQEK 
PGKGATGSNH 
VQSLPQKOAT 
RGYGVQSLPQ 
LSHGESMAQP 
GYLCEMEPVA 
PItECGQGFSQ 
GTACVGTELG 
QKAPGIPEAQ 



31 
I 

SSSRYNGPNL 
FLNTNNAFSG 
RQGERVPVPC 
VASLFAGRSV 
EAGVSKYTEG 
KEEAAALVEE 
RQAPQHYPVA 
ALSTTWPGG 
DGDPGRRRDS 
SCLRPLEAGT 
FPPGSSEEPL 
CDNENGPNFL 
CPWHARLLQA 
APCVLGSLIP 
VDIVYGNWNG 
IAYRSSSANR 
QKGRKDEDWA 
PITTRKRGYG 
GLOGPITTRK 
YQEKGLRGPI 



KGATGSNVIR 
LSVFRGNQGF 
HFGLGKDEAS 
QENGHCMDTN 
SRHTMTWKPR 
VYEQDQE 



Seq ID NO: 474 DNA sequence 

Nucleic Acid Accession #: NM_00366l.l 

Coding sequence « 1..1152 



ATGAGTGCAC 
CAAAACGTTC 
GCTGCTGGCA 
AAGGAAAAAG 
GGATTCGTGG 
GACAACCTTG 
TACAGAAACT 
AGAAGGCTCC 
AATGTGGTGT 
CTGGCACCCT 
ATCACAGCCG 
ACACAAGCCC 
GAGTTTTTGG 
ACACGAGGCA 
GTACCGCATG 
GAACAGGTGG 
ACGGATGTGG 
TCAAAGCACT 
CAGGAGCTGG 
CAAGAACTGT 



11 
I 

TTTTCCTTGG 
CAAGTGGGAC 
CCATGGACCC 
TGAGCACACA 
CTGCTGCTGA 
CAAGACAAAT 
GGTTTCTGAA 
GTGCCCTTGC 
CTGGCTCTCT 
TCACAGAGGG 
CTTTGACCGG 
AAGCCCACGA 
GTGAGAACAT 
TTGGGAAGGA 
CCTCAGCCTC 
AGAGGGTTAA 
CCCCTGTAAG 
TACATGAGGG 
AG GAGAAGCT 
GA 



21 

I 

TGTGGGAGTG 
AGATACTGGA 
AGAGAGCAGT 
GAATCTGCTA 
ACTGCCCAGG 
GATCATGAAA 
AGAGTTTCCT 
AGATGGGGTT 
CAGCATTTCC 
AGGCAGCCTT 
GATTACCAGC 
CCTGGTCATC 
AT CCAACTTT 
CATCCGTGCC 
ACGCCCCCGG 
TGAACCCAGC 
CTTCTTT CTT 
GGCAAAGTCA 
AAACATTCTC 



31 
I 

AGGGCAGAGG 
GATCCTCAAA 
ATCTTTATTG 
CTCCTGCTGA 
AATGAGGCAG 
GACAAAAACT 
CGGTTGAAAA 
CAGAAGGTCC 
TCTGGCATCC 
GTACTCTTGG 
AGTACCATGG 
AAAAGCCTTG 
CTTTCCTTAG 
CTCAGACGAG 
GTGACTGAGC 
ATCCTGGAAA 
GTGCTGGATG 
GAGACAGCTQ 
AACAATAATT 



Seq ID NO: 475 Protein sequence 
Protein Accession #: NP 003652.1 



1 
I 

MSALFLGVGV 
KEKVSTQNLL 
YRNWFLKEFP 
LAPFTEGGSL 
EFLGENISNF 
EQVERVNEPS 
QELEEKLNIL 



11 

I 

RAEEAGARVQ 
IrLLTDNEAWN 
RLKSELEDN I 
VLLEPGMELG 
LSTjAGNTYQLi 
ILEMSRGVKL 
NNNYKILQAD 



21 
I 

QNVPSGTDTG 
GFVAAAELPR 
RRXiRAIiADGV 
ITAALTGITS 
TRGIGKDIRA 
TDVAPVSFFI* 
QEli 



31 
I 

DPQSKPLGDW 
NEADELRKAL 
QKVHKGTTIA 
STMDYGKKWW 
LRRARANLQS 
VLDWYLVYE 



ACACACTTCA 
GCCATTGCAT 
CCGTATGTGT 
GCTACGAGCC 
CAATGACGTG 
CCGTCTGGTC 
CTGCTCCCAG 
AAGATCAGGA 



41 
I 

VLKYDRAQKR 
HSSSAQVPSG 
CRGGLRPTHE 
ACVDRKGSGR 
FSHTASPSIG 
QREAGAAGVP 
PLVTQLMTHG 
LRSWEESRQK 
PKVTQECHLV 
VPGAALPGNP 
LQFPSGLRGS 
FHNRGDGTFV 
PHCHHGLSMS 
TAYYIVLWSA 
PHRIjYLQMST 
LFRCSILARG 
RGCGNAGQSL 
VQ S LPGKGAT 
RGYGLQ9LPG 
TTRKRGYGLQ 
RGP I TTRKRG 
REHGDPLIEE 
NNNWLRWPR 
SVEVTWPDGK 
ECIQFPFVCP 
PKKELQLSQG 



41 
I 

AAGCTGGAGC 
GTAAG CCCCT 
AGGATGCCAT 
CTGATAATGA 
ATGAGCTCCG 
GGCACGATAA 
GTGAGCTTGA 
ACAAAGGCAC 
TGACCCTCGT 
AACCTGGGAT 
ACTACGGAAA 
ACAAATTGAA 
CTGGCAATAC 
CCAGAGCCAA 
CAATCTCAGC 
TGAGCAGAGG 
TAGTCTACCT 
AGGAGCTGAA 
ATAAGATTCT 



41 
I 

AAGTMDPESS 
DNLARQMIMK 
NWSGSLSIS 
TQAQAHDLVI 
VPHASASRPR 
SKHLHEGAKS 



GGACCCAGCC 
GGACACCAAT 
CAACACCTAT 
CAACGAGGAT 
GAAACCAAGG 
CTTTTTCCTG 
CACCCTTCTC 
ATAA 



51 
I 

LVNIAVDERS 
LHRNRPVLKP 
PEPFLLRPKS 
YSIYIANYAY 
EISGRTEERE 
RGRVRTALQT 
RLAGKLARSV 
GQAMSRCALR 
ATMPALGGLE 
GNWVLDMAKA 
PVLQVGLGIA 
DAAASAERRL 
FTRTGSRFYS 
-I PESLMTHSY 
KGKVRFRD1A 
SSSLTAGGRN 
AKEPASAIAG 
GSNHYQEKGL 
KGATGSNHYH 
SLPGKGATGS 
YGLQSLPGKE 
LNPGDALEPE 
TRFGAFARGA 
MVSRNVASGE 
RDKPVCVNTY 
ICTPVWSFFL 



51 
I 

GAGGGTGCAA 
CGGTGACTGG 
TAAGTATTTC 
GGCCTGGAAC 
TAAAGCTCTG 
AGGCCAGCAG 
GGATAACATA 
CACCATCGCC 
CGGCATGGGT 
GGAGTTGGGA 
GAAGTGGTGG 
GGAGGTGAGG 
TTACXy^ACTC 
TCTTCAGTCA 
TGAAAGCGGT 
AGTCAAGCTC 
CGTGTACGAA 
GAAGGTGGCT 
GCAGGCGGAC 



SI 
I 

IFIEDAIKYF 
DKKWHDKGQQ 
SGILTLVGMG 
KSLDKLKEVR 
VTEPISAESG 
ETAEELKKVA 
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60 
120 
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60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
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960 
1020 
1080 
1140 



60 
120 
180 
240 
300 
360 



Seq ID NO: 476 DNA sequence 

Nucleic Acid Accession #: NM_014452.1 

Coding sequence: 1..1968 



11 



21 



31 



41 



368 



WO 02/086443 

I I I I 1 I 

ATGGGGACCT CTCCGAGCAG CAGCACCGCC CTCGCCTCCT GCAGCCGCAT CGCCCGCCGA 60 

GCCACAGCCA CGATGATCGC GGGCTCCCTT CTCCTGCTTG GATTCCTTAG CACCACCACA 120 

GCTCAGCCAG AACAGAAGGC CTCGAATCTC ATTGGCACAT ACCGCCATGT TGACCGTGCC 180 

ACCGGCCAGG TGCTAACCTG TGACAAGTGT CCAGCAGGAA CCTATGTCTC TGAGCATTGT ' 240 

ACCAACACAA GCCTGCGCGT CTGCAGCAGT TGCCCTGTGG GGACCTTTAC CAGGCATGAG 300 

AATGGCATAG AGAAATGCCA TGACTGTAGT CAGCCATGCC CATGGCCAAT GATTGAGAAA «l! 360 

TTACCTTGTG CTGCCTTGAC TGACCGAGAA TGCACTTGCC CACCTGGCAT GTTCCAGTCT 420 

AACGCTACCT GTGCCCCCCA TACGGTGTGT CCTGTGGGTT GGGGTGTGCG GAAGAAAGGG 480 

ACAGAGACTG AGGATGTGCG GTGTAAGCAG TGTGCTCGGG GTACCTTCTC AGATGTGCCT S40 

TCTAGTGTGA TGAAATGCAA AGCATACACA GACTGTCTGA GTCAGAACCT GGTGGTGATC 600 

AAGCCGGGGA CCAAGGAGAC AGACAACGTC TGTGGCACAC TCCCGTCCTT CTCCAGCTCC 660 

ACCTCACCTT CCCCTGGCAC AGCCATCTTT CCACGCCCTG AGCACATGGA AACCCATGAA 720 

GTCCCTTCCT CCACTTATGT TCCCAAAGGC ATGAACTCAA CAGAATCCAA CTCTTCTGCC 780 

TCTGTTAGAC CAAAGGTACT GAGTAGCATC CAGGAAGGGA CAGTCCCTGA CAACACAAGC 840 

TCAGCAAGGG GGAAGGAAGA CGTGAACAAG ACCCTCCCAA ACCTTCAGGT AGTCAACCAC 900 

CAGCAAGGCC CCCACCACAG ACACATCCTG AAGCTGCTGC CGTCCATGGA GGCCACTGGG 960 

GGCGAGAAGT CCAGCACGCC CATCAAGGGC CCCAAGAGGG GACATCCTAG ACAGAACCTA 1020 

CACAAGCATT TTGACATCAA TGAGCATTTG CCCTGGATGA TTGTGCTTTT CCTGCTGCTG 1080 

GTGCTTGTGG TGATTGTGGT GTGCAGTATC CGGAAAAGCT CGAGGACTCT GAAAAAGGGG 1140 

CCCCGGCAGG ATCCCAGTGC CATTGTGGAA AAGGCAGGGC TGAAGAAATC CATGACTCCA 1200 

ACCCAGAACC GGGAGAAATG GAT CTACTAC TGCAATGGCC ATGGTATCGA TAT CCTGAAG 1260 

CTTGTAGCAG CCCAAGTGGG AAGCCAGTGG AAAGATATCT ATCAGTTTCT TTGCAATGCC 1320 

AGTGAGAGGG AGGTTGCTGC TTTCTCCAAT GGGTACACAG CCGACCACGA GCGGGCCTAC 1380 

GCAGCTCTGC AGCACTGGAC CATCCGGGGC CCCGAGGCCA GCCTCGCCCA GCTAATTAGC 1440 

GCCCTGCGCC AGCACCGGAG AAACGATQTT GTGGAGAAGA TTCGTGGGCT GATGGAAGAC 1500 

ACCACCCAGC TGGAAACTGA CAAACTAGCT CTCCCGATGA GCCCCAGCCC GCTTAGCCCG 1560 

AGCCCCATCC CCAGCCCCAA CGCGAAACTT GAGAATTCCG CTCTCCTGAC GGTGGAGCCT 1620 

TCCCCACAGG ACAAGAACAA GGGCTTCTTC GTGGATGAGT CGGAGCCCCT TCTCCGCTGT 1680 

GACTCTACAT CCAGCGGCTC CTCCGCGCTG AGCAGGAACG GTTCCTTTAT TACCAAAGAA 1740 

AAGAAGGACA CAGTGTTGCG GCAGGTACGC CTGGACCCCT GTGACTTGCA GCCTATCTTT 1800 

GATGACATGC TCCACTTTCT AAATCCTGAG GAGCTGCGGG TGATTGAAGA GATTCCCCAG 1860 

GCTGAGGACA AACTAGACCG GCTATTCGAA ATTATTGGAG TCAAGAGCCA GGAAGCCAGC 1920 
CAGACCCTCC TGGACTCTGT TTATAGCCAT CTTCCTGACC TGCTGTAG 

Seq ZD NOt 477 Protein sequence 
Protein Accession tf: np_0S5267.1 



1 

I 

MGTSPSSSTA 
TGQVLTCDKC 
LPCAAI/TDRE 
SSVMKCKAYT 
VPSSTYVPKG 
QQGPHHRHIL 
VLWIWCSI 
LVAAQVGSQW 
ALRQHRRNDV 
SPQDKNKGFF 
DDMLHFLNPE 



11 
I 

IASCSRIARR 
PAGTYVSEHC 
CTCPPGMPQS 
DCLSQNLWI 
MNSTESNSSA 
KLLPSMEATG 
RK5SRTLKKG 
KDIYQFLCNA 
VEKIRGLMED 
VDESEPLLRC 
ELRVIEEIPQ 



21 
I 

ATATMIAG9L 
TNTSLRVCSS 
NATCAPHTVC 
KPGTKETDNV 
8VRPKVLSSI 
GEKSSTPIKG 
PRQDPSAIVE 
SEREVAAFSN 
TTQLETDKLA 
DSTSSGSSAL 
AEDKLDRLPE 



31 
I 

LLLGPLSTTT 
CPVGTPTRHE 
PVGWGVRKKG 
CGTLPSFSSS 
QEGTVPDNTS 
PKRGHPRQNL 
KAGLKKSMTP 
GYTADHERAY 
IiPMSPSPLSP 
SRNGSFITKE 
3 IGVKSQEAS 



41 
I 

AQPEQKASNL 
NGIEKCHDCS 
TETEDVRCKQ 
TSPSPGTAIP 
SARGKEDVNK 
HKHFDINEHL 
TQNREKWIYY 
AALQHWTIRG 
SPIPSPNAKL 
KKDTVLRQVR 
QTLLDSVYSH 



51 
I 

IGTYRHVDRA 
QPCPWPMIEK 
CARGTFSDVP 
PRPEHMETHE 
TLPNLQWNH 
PWMIVLFIiLL 
CNGHGIDILK 
PEASLAQLIS 
ENSALLTVEP 
LDPCDLQPIF 
LPDLL 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



Seq ID NO z 478 DKA sequence 
Nucleic Acid Accession #i XM_044533 
Coding sequence: 238.. 2751 



1 
I 

GCTCTGCCCA 
AGGGGCTGAG 
GGAGGCGGGG 
AGTCCGGCCG 
CTGCGCACCG 
CGGCCACCGC 
TGGGCGCTCA 
TTCGAAGCTG 
CTGTACGTGG 
GGCGGGGAGT 
TTCAAGGGCA 
AGCGGCAGTC 
AACATGGAGA 
AAGGGCCGTT 
CTCTACACTG 
AGCCTTQGCC 
GCCTCAGCCT 
TTTTTCTTCA 
ATTGCCCGCA 
TCCTTCCTCA 
CTGCAGGATG 
GGGGTCTTCA 
ACAATGAAGG 
CAGCAGTGGT 
AACAGTGCCC 
TTCCTCAAGG 
CCCCAGGCTC 
GATGTCCTCT 
CGGGTGCACA 



11 
I 

AGCCGAGGCT 
TTTGCCAGGG 
GCCCCCGGGG 
AGCCACCTGA 
CGATGGGCCT 
TGCTGCTGCT 
GCCCCCGGAT 
AACACATCTC 
GTGCTCGAGA 
ACCAGGAGCT 
AGGACCCACA 
ACCTGTTCAC 
ACTTCACCCT 
GTCCCTTCGA 
GAACAGTCAG 
CCACCAAGAC 
ACATTCCTGA 
GCGAGACTGG 
TCTGCAAGGG 
AGGCCCAGCT 
TCTTCACGCT 
CTTCCCAGTG 
ATGTGCAGAG 
ACACCGTGAC 
GGGAAAGGAA 
ACCACTTCCT 
GCTACCAGCG 
TCCTGGGCAC 
TCATTGAGGA 



21 

I. 

GCGGGGCCGG 
CCCACTTGAC 
CGACTCGGGG 
GCCCGAGCCG 
GAGGAGCTGG 
CCTGCTGCTG 
CAGCCTGCCT 
CAACTACACA 
GGCCCTCTTT 
GCTTTGGGGT 
GCGCGACTGT 
CTGTGGCACA 
GGCAAGGGAC 
CCCGAATTTC 
CAGCTTCCAA 
CGAGAGCTCC 
GAGCCTGGGC 
CCAGGAATTT 
CGATGAGGGT 
GCTGTGCTCA 
GAGCCCCAGC 
GCACAGGGGA 
AGTCTTCAGC 
CCACCCGGTG 
GATCAACTCA 
GATGGACGGG 
CGTGGCTGTA 
TGGTGACGGC 
GCTGCAGATC 



31 
I 

CGCCGGCGGG 
CCTGTTTCCC 
GCGGACCGOG 
CGGGACACCG 
CTCGCCGCCC 
CTGCTCCTGC 
CTGGGCTCTG 
GCCCTTCTGC 
GCACTCAGTA 
GCAGACGCAG 
CAAAACTACA 
GCAGCCTTCA 
GAGAAGGGGA 
AAGTCCACTG 
GGGAATGACC 
CTCAACTGGC 
AGCTTGCAAG 
GAGTTCTTTG 
GGAGAGCGGG 
CGGCCCGACG 
CCCCAGGACT 
ACTACAGAAG 
GGCCTCTACA 
CCCACACCCC 
TCCCTGCAGC 
CAGGTCCGAA 
CACCGCGTCC 
CGGCTCCACA 
TTCTCATCGG 



41 
I 

AGGACTGCGG 
ACCTCCCGCC 
GGGCGGAGCT 
TCGCTCCTGC 
CATGGGGCGC 
TGCAGCCGCC 
AAGAGCGGCC 
TGAGCAGGGA 
GCAACCTCAG 
AGAAGAAACA 
TCAAGATCCT 
GCCCCATGTG 
ATGTCCTCCT 
CCCTGGTGGT 
CGGCCATCTC 
TGCAAGACCC 
GCGATGATGA 
AGAACACCAT 
TGCTACAGCA 
ATGGCTTCCC 
GGCGTGACAC 
GCTCTGCCGT 
AGGAGGTGAA 
GGCCTGGAGC 
TCCCAGACCG 
GCCGCATGCT 
CTGGCCTGCA 
AGGCAGTGAG 
GACAGCCCGT 



51 
I 

TGCCCCGCGG 
CCCCAGGTCC 
GCCGCCCGTG 
TCTCCGAATG 
GCTGCCGCCT 
GCCTCCGACC 
ATTCCTCAGA 
TGGCAGGACC 
CTTCCTGCCA 
GCAGTGCAGC 

TACCTACATC 
GGAAGATGGC 
TGATGGCGAG 
GCGGAGCCAA 
AGCTTTTGTG 
CAAGATCTAC 
TGTGTCCCGC 
GCGCTGGACC 
CTTCAACGTG 
CCTTTTCTAT 
CTGTGTCTTC 
CCGTGAGACA 
GTGCATCACC 
CGTGCTGAAC 
GCTGCTGCAG 
CCACACCTAC 
CGTGGGCCCC 
GCAGAATCTG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
10B0 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 



369 



WO 02/086443 

CTCCTGGACA CCCACAGGGG GCTGCTGTAT GCGGCCTCAC ACTCGGGCGT AGTCCAGGTG 1800 

CCCATGGCCA ACTGCAGCCT GTACAGGAGC TGTGGGGACT GCCTCCTCGC CCGGGACCCC I860 

TACTGTGCTT GGAGCGGCTC CAGCTGCAAG CACGTCAGCC TCTACCAGCC TCAGCTGGCC 1920 

ACCAGGCCGT GGATCCAGGA CATCGAGGGA GCCAGCGCCA AGGACCTTTG CAGCGCGTCT 1980 

TCGGTTGTGT CCCCGTCTTT TGTACCAACA GGGGAGAAGC CATGTGAGCA AGTCCAGTTC 2040 

CAGCCCAACA CAGTGAACAC TTTGGCCTGC CCGCTCCTCT CCAACCTGGC GACCCGACTC 2100 

TGGCTACGCA AOGGGGCCCC CGTCAATGCC TCGGCCTCCT GCCACGTGCT ACCCACTGGG 2160 

GACCTGCTGC TGGTGGGCAC CCAACAGCTG GGGGAGTTCC AGTGCTGGTC ACTAGAGGAG 2220 

GGCTTCCAGC AGCTGGTAGC CAGCTACTGC CCAGAGGTGG TGGAGGACGG GGTGGCAGAC 2280 

CAAACAGATG AGGGTGGCAG TGTACCCGTC ATTATCAGCA CATCGCGTGT GAGTGCACCA 2340 

GCTGGTGGCA AGGCCAGCTG GGGTGCAGAC AGGTCCTACT GGAAGGAGTT CCTGGTGATG 2400 

TGCACGCTCT TTGTGCTGGC CGTGCTGCTC CCAGTTTTAT TCTTGCTCTA CCGGCACCGG 2460 

AACAGCATGA AAGTCTTCCT GAAGCAGGGG GAATGTGCCA GCGTGCACCC CAAGACCTGC 2S20 

CCTGTGGTGC TGCCCCCTGA GACCCGCCCA CTCAAOGGCC TAGGGCCCCC TAGCACCCCG 2580 

CTCGATCACC GAGGGTACCA GTCCCTGTCA GACAGCCCCC CGGGGTCCCG AGTCTTCACT 2640 

GAGTCAGAGA AGAGGCCACT CAGCATCCAA GACAGCTTCG TGGAGGTATC CCCAGTGTGC 2700 

CCCCGGCCCC GGGTCCGCCT TGGCTCGGAG ATCCGTGACT CTGTGGTGTG AGAGCTGACT 2760 

TCCAGAGGAC GCTGCCCTGG CTTCAGGGGC TGTGAATGCT CGGAGAGGGT CAACTGGACC 2820 

TCCCCTCCGC TCTGCTCTTC GTGGAACACG ACCGTGGTGC CCGGCCCTTG GGAGCCTTGG 2 880 

GGCCAGCTGG CCTGCTGCTC TCCAGTCAAG TAGCGAAGCT CCTACCACCC AGACACCCAA 2 940 

ACAGCCGTGG CCCCAGAGGT CCTGGCCAAA TATGGGGGCC TGCCTAGGTT GGTGGAACAG 3000 

TGCTCCTTAT GTAAACTGAG CCCTTTGTTT AAAAAACAAT TCCAAATGTG AAACTAGAAT 3060 

GAGAGGGAAG AGATAGCATG GCATGCAGCA CACACGGCTG CTCCAGTTCA TGGCCTCCCA 3120 

GGGGTGCTGG GGATGCATCC AAAGTGGTTG TCTGAGACAG AGTTGGAAAC CCTCACCAAC 3180 

TGGCCTCTTC ACCTTCCACA TTATCCCGCT GCCACCGGCT GCCCTGTCTC ACTGCAGATT 3240 

CAGGACCAGC TTGGGCTGCG TGCGTTCTGC CTTGCCAGTC AGCCGAGGAT GTAGTTGTTG 3300 

CTGCCGTCGT CCCACCACCT CAGGGACCAG AGGGCTAGGT TGGCACTGCG GCCCTCACCA 3360 

GGTCCTGGGC TCGGACCCAA CTCCTGGACC TTTCCAGCCT GTATCAGGCT GTGGCCACAC 3420 

GAGAGGACAG CGCGAGCTCA GGAGAGATTT CGTGACAATG TACGCCTTTC CCTCAGAATT 3480 

CAGGGAAGAG ACTGTCGCCT GCCTTCCTCC GTTGTTGCGT GAGAACCCGT GTGCCCCTTC 3540 

CCACCATATC CACCCTCGCT CCATCTTTGA ACTCAAACAC GAGGAACTAA CTGCACCCTG 3600 

GTCCTCTCCC CAGTCCCCAG TTCACCCTCC ATCCCTCACC TTCCTCCACT CTAAGGGATA 3660 

TCAACACTGC CCAGCACAGG GGCCCTGAAT TTATGTGGTT TTTATACATT TTTTAATAAG 3720 
ATGCACTTTA TGTCATTTTT TAATAAAGTC TGAAGAATTA CTGTTT 

Seq ID NO: 479 Protein sequence 
Protein Accession #: XP_044533.3 

1 11 21 31 41 51 

1 I I • I I I 

MLRTAMGIjRS WLAAPWGAIiF PRPPLLLLLIi IiLLLIiQPPPP TWALSPRISL PLGSEERPFL 60 

RFEAEHISNY TALLLSRDGR TLYVGAREAL FALSSNLSFL PGGEYQELLW GADAEKKQQC 12 0 

SPKGKDPQRD CQNYIKILLP LSGSHLPTCG TAAPSPMCTY INMENFTIAR DEKGNVLLED 180 

GKGRCPFDPN FKSTALWDG EIiYTGTVSSF QGNDPAISRS QSLRPTKTES SLNWLQDPAF 240 

VASAYIPESL GSLQGDDDKI YFFFSETGQE FEFFENTIVS RIARICKGDE GGERVLQQRW 300 

TSFLKAQbLC SRPDDGFPFN VLQDVPTIiSP SPQDWRDTLP YGVFTSQWHR GTTEGSAVCV 360 

FTMKDVQRVF SGLYKEVNRE TQQWYTVTHP VPTPRPGACI TNSARERKIN SSLQLPDRVIi 420 

NFLKDHFIiMD GQVRSRMLLI* QPQARYQRVA VHRVPGLHHT YDVLFLGTGD GRLHKAVSVG 480 

PRVHIIEELQ IFSSGQPVQN LLLDTHRGLL YAASHSGWQ VPMANCSLYR SCGDCLLARD 540 

PYCAWSGSSC KHVSLYQPQIi ATRPWIQDIE GASAKDLCSA SSWSPSFVP TGEKPCEQVQ 600 

FQPNTVNTIiA CFLLSNIiATR LWLRNGAPVN ASASCHVLPT GDLLLVGTQQ LGEFQCWSLE 660 

EGFQQLVASY CPEWEDGVA DQTDEGGSVP VI 1 9TSRVSA PAGGKASWGA DRSYWKEFLV 720 

MCTLPVLAVL LPVLPLLYRH RNSMKVFLKQ GECASVHPKT CPWLPPETR PLNGLGPPST 780 
PLDHRGYQSL SDSPPGSRVF TESEKRPLSI QDSFVEVSPV CPRPRVRLGS EIRDSW 

Seq ID NO: 480 DNA sequence 

Nucleic Acid Accession #: NM_004217.1 

Coding sequence: 58.. 1092 

1 11 21 31 41 51 

GGCCGGGAGA GTAGCAGTGC CTTGGACCCC AGCTCTCCTC CCCCTTTCTC TCTAAGGATG 60 

GCCCAGAAGG AGAACTCCTA CCCCTGGCCC TACGGCCGAC AGACGGCTCC ATCTGGCCTG 120 

AGCACCCTGC CCCAGCGAGT CCTCCGGAAA GAGCCTGTCA CCCCATCTGC ACTTGTCCTC 180 

ATGAGCCGCT CCAATGTCCA GCCCACAGCT GCCCCTGGCC AGAAGGTGAT GGAGAATAGC 240 

AGTGGGACAC CCGACATCTT AACGCGGCAC TTCACAATTG ATGACTTTGA GATTGGGCGT 300 

CCTCTGGGCA AAGGCAAGTT TGGAAACGTG TACTTGGCTC GGGAGAAGAA AAGCCATTTC 360 

AT CGTGG CGC TCAAGGTCCT CTTCAAGTCC CAGATAGAGA AGGAGGGCGT GGAGCATCAG 420 

CTGCGCAGAG AGATCGAAAT CCAGGCCCAC CTGCACCATC CCAACATCCT GCGTCTCTAC 480 

AACTAT TTTT ATGACCGGAG GAGGATCTAC TTGATTCTAG AGTATGCCCC CCGCGGGGAG 540 

CTCTACAAGG AGCTGCAGAA GAGCTGCACA TTTGACGAGC AGCGAACAGC CACGATCATG 600 

GAGGAGTTGG CAGATGCTCT AATGTACTGC CATGGGAAGA AGGTGATTCA CAGAGACATA 660 

AAG CCAGAAA ATCTGCTCTT AGGGCTCAAG GGAGAGCTGA AGATTGCTGA CTTCGGCTGG 720 

TCTGTGCATG CGCCCTCCCT GAGGAGGAAG ACAATGTGTG GCACCCTGGA CTACCTGCCC 780 

CCAGAGATGA TTGAGGGGCG CATGCACAAT GAGAAGGTGG ATCTGTGGTG CATTGGAGTG 840 

CTTTGCTATG AGCTGCTGGT GGGGAACCCA CCCTTTGAGA GTGCATCACA CAACGAGACC 900 

TATCGCCGCA TCGTCAAGGT GGACCTAAAG TTCCCCGCTT CTGTGCCCAC GGGAGCCCAG 960 

GACCTCATCT CCAAACTGCT CAGGCATAAC CCCTCGGAAC GGCTGCCCCT GG CCCAGGTC 1020 

TCAGCCCACC CTTGGGTCCG GGCCAACTCT CGGAGGGTGC TGCCTCCCTC TGCCCTTCAA 1080 

TCTGTCGCCT GATGGTCCCT GTCATTCACT CGGGTGCGTG TGTTTGTATG TCTGTGTATG 1140 

TATAGGGGAA AGAAGGGATC CCTAACTGTT CCCTTATCTG TTTTCTACCT CCTCCTTTGT 1200 
TTAATAAAGG CTGAAGCTTT TTGT 

Seq ID NO: 481 Protein sequence 
Protein Accession #: NP_004208 

1 11 21 31 41 51 



370 



5 

10 
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20 
25 
30 
35 
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45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

I I I I 

MAQKENSYPW PYGRQTAPSG LSTLPQRVLR KEPVTPSALV 
SSGTPDILTR HFTIDDFEIG RPLGKGKFGN VYLAREKKSH 
QLRREIEIQA HLHHPNILRI* YNYFYDRRRI YLILBYAPRG 
MEELADALMY CHGKKVIHRD IKPENLLLGL KGELKIADFG 
PPEMIEGRMH NEKVDLWCIG VLCYELLVGN PPFESASHNE 
QDLISKLLRH NPSERItPLAQ VSAHPWVRAN SRRVLPPSAL 

Seq ID NO: 482 DMA sequence 
Nucleic Acid Accession #: AK055663 
Coding sequence! 38.. 1423 



PCT/US02/12476 



LMSRSNVQPT 


AAPGQKVMEN 


60 


FIVALKVIiFK 


SQIEKEGVEK 


120 


ELYKELQKSC 


TFDEQRTATI 


180 


WSVHAPSLRR 


KTMCGTLDYL 


240 


TYRRIVKVDL 


KFPASVPTGA 


300 


QSVA 







1 

I 

AGAACGGCTT 
AAAACCACAA 
CCGAAGGTCC 
GCTTATGTGG 
TTTTGATCTT 
TAGCCCTGTC 
AGTCTTGGCA 
ACAGCCCGAG 
CCTGTTCACG 
TACGAGCTGG 
GGGACTTAGC 
AGCATTTGCT 
CACTGCCTCT 
GTACAGTGGG 
ACTCATCAGA 
GACCCTAGGT 
TGAACAAATG 
TGTTCAAATT 
GAATGTCCTA 
TGATTTGAAC 
ATTTAACACT 
TTATGGTTTT 
TGGAGTTCCA 
TAGATATGGA 
TTATAAGGAA 
TGTTTAATCA 
TATGAAACTA 
GCTTTAAATA 
GTTTTGTAGT 
AGATCTGTCA 
TGCCACTGTG 
CTTAGTTTTT 
ATGCAGTGGC 
TGCCTCAGCC 
TGTATTTTTA 
ACCTCATGAT 
CCTGGCCGAT 
GGGAAAGGGA 
AATTGCTAAA 
TTTTTAGCAG 
GATTTTTGTT 



11 
I 

CCGGCGGGAG 
AGATCCTTTT 
TGGAAGATAC 
TGCAGTTCTA 
TTTAGTTTAA 
TATTCATTTG 
CAGTTGGGAG 
ATACACACGG 
ATGCTTTCTA 
CTTCAAGAGC 
AGTATCTTCC 
CTTTGTATTA 
GCTATAG CT A 
AAAGTCTTAC 
GAGGTATCTA 
TTTGGCTCAT 
GTTCTTG CTC 
TTCAAGGATG 
AACTTTTCAG 
CCAGTTACAT 
CCTGGGAAAA 
GGTCTCAATC 
GGAATTGGAG 
ACTAATAATA 
TATTGACTCC 
TTTACTCTAA 
TATTTTTGTA 
GGCTTCCTTT 
TGACTGCAGT 
CATTACTAAG 
CCCGGCCAAT 
GTTTTGTTTT 
ATGATCTCAG 
TCCCGAGTAG 
GTAAAGACGG 
CCACCCACCT 
ATTTTCTTTA 
AAAATGTCTG 

AAATTTTGGA 
AAAGTTTCTC 



21 
I 

CTGTGCAGCT 
TTGGCAAGTT 
TGCTCTTTGG 
CTAATAGTAT 
TGACATGTTT 
GGTTTGAAAG 
CTCTCTTTAT 
GAAGATTATT 
TTCGGAATAA 
ATGTTGCAGA 
TTCCCCGAAT 
CATATATGCT 
TTGCCTTGAT 
TCCAGACAAC 
CCTTAGATGG 
TGGCTGGATC 
ATGTGACCAA 
ACTGGATTAG 
ATCATCACGT 
CAACTCCAGC 
ATGTGAACCC 
ATGGACACAC 
CAACTCAAGG 
GAATTGGACA 
TTGGCTTCCA 
ATGTTAGATA 
AAATGTATTT 
AGAAAATGTG 
GTGATGTGAC 
ATACGATATT 
ACATTATTAT 
GTTTTTTGAG 
CTCACTGCAA 
CTGGGATTAC 
GGGATTTCAC 
TAGCCTCCCA 
ATGAAATTTA 
TTCAAAAAGT 
AGGTTCTCCT 
ATACATTCTA 
TCCTTTAAAA 



Seq ID NO: 483 Protein sequence 
Protein Accession #x BAB70980.1 



1 
I 

MGTIHLFRKP 
TAYTYLTIFD 
ESAERFLEQP 
RSLCGIIPGL 
GTMYPMSVYS 
VRIRRDANEQ 
MPLLKGTDDL 
SSMLNQGLGV 



11 
I 

QRSFFGKLLR 
LFSLMTCLIS 
EIHTGRLLVG 
SSIFLPRMNP 
GKVLLQTTPP 
MVLAHVTNRL 
NPVTSTPAKP 
PGIGATQGLR 



21 
I 

EFRLVAADRR 
YWVTLRKP3P 
TFVALCFNLP 
FVLIDLAGAF 
HVIGQLDKLI 
YTLVSTLTVQ 
SSPPPEFSFN 
TGFTNIPSRY 



31 
I 

CCTTATCATG 
GTTACGGGAA 
TGTAATAAAC 
AGCTTTAACT 
AATAAGTTAC 
ATTAGAAGTC 
ATTAAAAGAA 
AGTTGGTACT 
ACCTTTTGCT 
TCTTAGTCGA 
GAATCCATTT 
CATTGAAATT 
GACATTTGGC 
ACCACCCCAT 
AGTTTTAGAA 
AGTGCATGTA 
CAGGCTGTAC 
GCCTGCCTTA 
AATCCCAATG 
TAAACCTAGT 
AGTTATTCTT 
ACCTTACAGC 
ATTGAGGACT 
ACCAAGACCA 
ATTTATTTAG 
ATAGTAGTCT 
GTGACAGTGA 
TTTCTTTAAA 
CTTACCTTTA 
TCTTTTTTTT 
TAACTTAAGG 
ATGGAGT CTC 
CCTCTGCCTC 
AGGCACCTGC 
CATGTTGGCC 
AAGTGCTGGG 
TAAATATGCT 
AAAGGTCTCT 
GAATTATGTC 
TCTAGCACAA 
ATTTTAGTAC 



31 

I 

SWKILLFGVI 
VYSFGFERLE 
TMLSIRNKPF 
ALCITYMLIE 
REVSTLDGVZi 
IFKDDWIRPA 
TPGKNVNPVI 
GTNNRIGQPR 



41 

I 

GGGACAATTC 
TTTAGACTTG 
TTGATATGTA 
GCCTATACTT 
TGGGTAACAT 
CTGGCTGTAT 
AGTGCAGAAC 
TTTGTGGCTC 
TATGTCTCAG 
AG CTTGTGTG 
GTTTTGATTG 
AATAATTATT 
ACTATGTATC 
GTTATTGGTC 
GTCCGAAATG 
AGAATTCGAC 
ACTCTAGTGT 
TTGTCTGGGC 
CCTCTTTTAA 
AGTCCACCTC 
CTAAACACAC 
AGCATGCTTA 
GGTTTTACAA 
TGATAGACTC 
TAATCCAACT 
TGTTCACATT 
AATCCTCGTA 
TTTGGATTTT 
TAAGAGCCAC 
TCCGAGACGG 
CTGTACTTTA 
ACTCTGTCGC 
CTGAGTTCAA 
CACCACGCCC 
AGGCTGGTCT 
ATTAGGTGTG 
TCTTGAATAA 
TTTATAGCTT 
TTACAAACTA 
TTTGAATTTT 
ATTTGTAAAT 



41 
I 

NLICTGFLLM 
VLAVFASTVL 
AYVSEAASTS 
INNYFAVDTA 
EVRNEHFWTL 
LLSGPVAANV 
LLNTQTRPYG 
P 



Seq ID NO: 484 DNA sequence 
Nucleic Acid Accession #: 
Coding sequence : l . . 900 



FGENESH predicted 



1 
I 

ATGCCGCCGC 
CCGCGGCGGC 
GCCGTGGGCA 
CGGCCCACTG 
GGCTGCGGCG 
GGACCCCGGG 
CTTCCTAACT 
CCGGTGCGCA 
CTTTGCTACC 
TTTCAAAACA 
GTGCTGCTGG 



11 
I 

GGGAGCTGAG 
GTAGCGCGCC 
AGAGCAGCCT 
CGCTGGACAC 
GGGCTGTGCA 
GAGGAGACTG 
CAGGCTCTCC 
TTGAGCTCTG 
CGGATACCGA 
TCACAGAGAA 
TGGGCACCCA 



21 

I 

CGAGGCCGAG 
CCCAGAGCTG 
CATCGTCAGC 
CTTCTCTGGT 
CCGGGGAGCT 
GAGCAGGCCC 
CCGCCCCGCC 
GGACACAGCG 
TGTCTTCCTG 
ATGGCTGCCC 
GGCCGACCTG 



31 
I 

CCGCCCCCGC 
GGCATCAAGT 
TACACCTGCA 
ACGTACGTTC 
GGGGCGGGCG 
CGAGGTGGCG 
CCTGCAGTGC 
GGACAGGAGG 
GCGTGCTTCA 
GAGATCCGCA 
AGGGACGATG 



41 
I 

TCCGGGCCCC 
GCGTGCTGGT 
ATGGGTACCC 
AATCGCCCGT 
TCTCGGCGGG 
CTGGTGCGGC 
AAGTCCTGGT 
ATTTTGACCG 
GCGTGGTGCA 
CGCACAACCC 
TCAACGTACT 



51 
I 

ATCTCTTTCG 
TAGCAGCTGA 
CTGGCTTCCT 
ACCTGACCAT 
TGAGGAAACC 
TTGCCTCCAC 
GCTTTTTGGA 
TTTGTTTCAA 
AAGCTGCTAG 
GAATTATTCC 
ATCTTGCTGG 
TTGCCGTAGA 
CCATGAGTGT 
AGTTGGACAA 
AACATTTTTG 
GAGATGCCAA 
CTACTCTAAC 
CTGTTGCAGC 
AGGGTACTGA 
CAGAATTTTC 
AAACAAGGCC 
ATCAAGGACT 
ATATACCAAG 
TAACTTATTT 
TTGCATTGAC 
TCATGAAACC 
AATGTTAAAG 
GGTATCTTTG 
TTGATGGAGT 
AGTCTTGCTC 
TTAAGGCTTC 
CCAGGCTGGA 
ATGATTCTCC 
AG CTAATTTT 
TGAACTCCTG 
AGCCACCGCA 
TACACATTTT 
TTCCAAACTT 
AAAGCAAAAA 
TAATTATCAA 



51 

I 

WCSSTNSIAL 
AQLGALF I LK 
WLQEHVADLS 
SAIAIALMTF 
GFGSLAGSVH 
LNFSDHHVIP 
FGUJHGHTPY 



SI 
I 

GACCCCTCCC 
GGGCGACGGC 
CGCGCGCTAC 
GCGGCCGCGT 
AGGGCGCAGA 
CCAGGACGCT 
GGATGGAGCT 
ACTTCGTTCC 
GCCCAGCTCC 
CCAGGCGCCT 
AATTCAGCTG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 



60 
120 
180 
240 
300 
360 
420 



60 
120 
160 
240 
300 
360 
420 
480 
540 
600 
660 
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10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 
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GACCAGGGGG GCCGGGAGGG CCCCGTGCCC CAACCCCAGG CTCAGGGTCT GGCCGAGAAG 
ATCCGAGCCT GCTGCTACCT TGAGTGCTCA GCCTTGACGC AGAAGAACTT GAAGGAAGTA 
TTTGACTCGG CTATTCTCAG TGCCATTGAG CACAAAGCCC GGCTGGAGAA GAAACTGAAT 
GCCAAAGGTG TGCGCACCCT CTCCCGCTGC CGCTGGAAGA AGTTCTTCTG CTTCGTTTGA 

Seq ID NO: 485 Protein sequence 
Protein Accession #t fgenesh predicted 



PCT/US02/12476 



720 
780 
B40 



MPPRELSEAE 
RPTALDTPSG 
LPNSGSPRPA 
FQNITEKWLP 
IRACCYLECS 



11 
I 

PPPLRAPTPP 
TYVQSPVRPR 
PAVQVLVDGA 
EIRTHNPQAP 
ALTQKNLKEV 



21 



31 



PRRRSAPPEL GIKCVLVGDG 
GCGGAVHRGA GAGVSAGGRR 
PVR I ELWDT A GQEDFDRLRS 
VLLVGTQADL RDDVNVLIQL 
FDSAILSAIE HKARLEKKLN 



Seq ID NO: 486 DNA sequence 

Nucleic Acid Accession <h XM_063832.2 

Coding sequence: 1..711 



ATQCCGCCGC 
CCGCGGCGGC 
GCCGTGGGCA 
CGGCCCACTG 
ATTGAGCTCT 
CCGGATACCG 
ATCACAGAGA 
GTGGGCACCC 
GGCCGGGAGG 
TGCTGCTACC 
GCTATTCTCA 
GTGCGCACCC 



11 
I 

GGGAGCTGAG 
GTAGCGCGCC 
AGAGCAGCCT 
CGCTGGACAC 
GGGACACAGC 
ATGTCTTCCT 
AATGGCTGCC 
AGGCCGACCT 
GCCCCGTGCC 
TTGAGTGCTC 
GTGCCATTGA 
TCTCCCGCTG 



21 
I 

CGAGGCCGAG 
CCCAGAGCTG 
CATCGTCAGC 
CTTCTCTGTG 
GGGACAGGAG 
GGCGTGCTTC 
CGAGATCCGC 
GAGGGACGAT 
CCAACCCCAG 
AGCCTTGACG 
GCACAAAGCC 
CCGCTGGAAG 



31 

I 

CCGCCCCCGC 
GGCATCAAGT 
TACACCTGCA 
CAAGTCCTGG 
GATTTTGACC 
AGCGTGGTGC 
ACGCACAACC 
GTCAACGTAC 
GCTCAGGGTC 
CAGAAGAACT 
CGGCTGGAGA 
AAGTTCTTCT 



41 

1 

AVGKSSL1VS 
GPRGGDWSRP 
LCYPDTDVFL 
DQGGREGPVP 
AKGVRTLSRC 



41 

I 

TCCGGGCCCC 
GCGTGCTGGT 
ATGGGTACCC 
TGGATGGAGC 
GACTTCGTTC 
AGCCCAGCTC 
CCCAGGCGCC 
TAATTCAGCT 
TGGCCGAGAA 
TGAAGGAAGT 
AGAAACTGAA 
GCTTCGTTTG 



51 
I 

YTCNGYPARY 
RGGAGAAQDA 
ACPSWQPSS 
QPQAQGLABK 
RWKKFFCFV 



51 
I 

GACCCCTCCC 
GGGCGACGGC 
CGCGCGCTAC 
TCCGGTGCGC 
CCTTTGCTAC 
CTTTCAAAAC 
TGTGCTGCTG 
GGACCAGGGG 
GATCCGAGCC 
ATTTGACTCG 
TGCCAAAGGT 
A 



Seq ID NO: 487 Protein sequence 
Protein Accession ft: XP_063832.1 



11 



21 



31 41 51 

I 1 t I I I 

MPPRELSEAE PPPLRAPTPP PRRRSAPPEL GIKCVLVGDG AVGKSSLIVS YTCNGYPARY 
RPTALDTPSV QVLVDGAPVR IELWDTAGQE DFDRLRSLCY PDTDVPLACP SWQPSSFQN 
ITEKWLPEIR THNPQAPVLL VGTQADLRDD VNVLIQLDQG GREGPVPQPQ AQGLAEKIRA 
CCYLECSALT QKNLKEVPDS AILSAIEHKA RLEKKLNAKG VRTLSRCRWK KFFCPV 

Seq ID NO: 4B8 DNA sequence 

Nucleic Acid Accession #: NM_0143 98.1 

Coding sequences 64.. 1314 



1 
I 

GGCACCGATT 
ACCATGCCCC 
CACGATGGCA 
ACTGCAGCAG 
CCTCACCAAA 
ACAGTAAAAA 
ATTACCTACA 
GTTACTGAAG 
CCACCAGCTC 
ACTCAACCCA 
ACAACCGGTC 
AATACCACCC 
CCATCGTCAG 
GCAGAGATGG 
TACTTCAACA 
AACCTTCTGT 
TCATATTATA 
CAAGGAATCA 
GTGAGTGAAC 
CTTCAAGCCT 
TACACAATTG 
GGTGT CTAT A 
CCCGGGGGGA 
TTGGGAAATT 
AATGAAGTGA 
GTTTATTTTA 
GCCACTCAAA 
AGCCTTCAAA 
ATTTTATTTT 
G CTTTTACT A 
ACTCCTTTTC 
CAGGCTGGAG 
TGATTCTCCT 
GCTAATTTTT 
CTCTTGACCT 
AGCCATTGCG 



11 
I 

CGGGGCCTGC 
GGCAGCTCAG 
GTCAAATGAG 
CAACAGTACA 
CTTTAGCAGC 
TTCCAACAAC 
CCCTGGTCAC 
TTACAGTCGG 
ATACAGCTGG 
GTAACCAGAC 
AGAAGCCTGA 
GCACAGCTGC 
TCAAGACTGG 
GGATACAGCT 
TCGACCCCAA 
TGAATTTTCA 
TCAGTGAAGT 
AACATGCGGT 
AGAGCCTCCA 
TTGATTTTGA 
TGCTTCCTGT 
AAATCCGCCT 
ATGAAAATAA 
CCCTCAGAGT 
GTCATGTGTG 
TGAAAGATAT 
GTCAACATTT 
TTATAAACCA 
ACCCTTGATC 
TCTGTGTTTT 
CACTTTAAAT 
TACAGTGGCA 
GCTTCAGCTT 
GTATTTTTAT 
CAGGTGATCC 
CCCGGCCTTA 



21 
I 

CCGGACTTCG 
CGCGGCGGCC 
AGCAAAAGCA 
GGACATAAAA 
AAGATTCATG 
TACCCCAGCA 
AACCCAGGCC 
CCCTAGCTTA 
AACCAGTTCA 
CACCCTTCCA 
TCAACCCACC 
ACCTGCCTCC 
AATTTATCAG 
GATTGTTCAA 
CGCAACGCAA 
GGGCGGATTT 
GGGAGCCTAT 
GGTGATGTTC 
GTTGTCAGCC 
AGATGACCAC 
GATTGGGGCC 
AAGGTGTCAA 
TGGAATTTAG 

ATTTAAGTTC 
AGTGAGCTGT 
GAGATATGTT 
AGGGTCAATT 
TTAACAAAGC 
ATGGTTTCAT 
TTGTTTTTGT 
CGATCTCGGC 
CCCGAGTAGC 
TATAGACGGG 
ACCCACCTCA 
AATGTTTTTT 



31 
I 

CCGCACGCTG 
GCGCTCTTCG 
TTTCCAGAAA 
AAACCTGTCC 
GATGGTCATA 
ACTACAAAAA 
ACACCCAACA 
GCCCCTTATT 
TCAACCGTCA 
GCAACTTTAT 
CATGCCCCAG 
ACGGTTCCTG 
GTTCTAAACG 
GACAAGGAGT 
GCCTCTGGGA 
GTGAATCTCA 
TTGACCGTCT 
CAGACAGCAG 
CACCTGCAGG 
TTTGGAAATG 
ATCGTGGTTG 
TCATCTGGAT 
AGAACTCTTT 
CAAACAATGT 
AGGCAGCACA 
TTATTTT CTA 
GAATTAACAT 
GTAACTAATA 
CTTTGCTTTG 
GTAACATACA 
TTTTTGAGAC 
TTATGGCAAC 
TGGGATTACA 
TTTCACCATG 
GCCTCCCAAA 
TTAATCATCA 



41 
I 

CAGAACCTCG 
CGTCCCTGGC 
CCAGAGATTA 
AGCAACCAGC 
TCACCTTTCA 
ACACTGCAAC 
ACTCACACAC 
CACTGCCACC 
GCCACACAAC 
CGATAGCACT 
GAACAACGGC 
GGCCCACCCT 
GAAGCAGACT 
CGGTTTTTTC 
ACTGTGG CAC 
CATTTACCAA 
CAGATCCAGA 
TCGGGCATTC 
TGAAAACAAC 
TGGATGAGTG 
GTCTCTGCCT 
ACCAGAGAAT 
CATCCCTTCC 
AAACCACCAT 
TCAATTTCTA 
GTTTCCTTTA 
AATATATGTA 
CTACTGTGTG 
TTATCAAATG 
TATTCCTGGT 
GGAGTTTCAC 
CTCCGCCTCC 
GGCACACACT 
TTGGCCAGAC 
GTGCTGGGAT 
AAAAGAACAA 



51 
i 

CCCAGCGCCC 
CGTAATTTTG 
TTCTCAACCT 
TAAGCAAGCA 
AACAGCGGCC 
CACCAGCCCA 
AGCTCCTCCA 
CACCATCACC 
TGGGAACACC 
GCACAAAAGC 
AGCTGCCCAC 
TGCACCTCAG 
CTGTATAAAA 
ACCTCGGAGA 
CCGAAAATCC 
GGATGAAGAA 
GACAGTTTAC 
CTTCAAGTGC 
CGATGTCCAA 
CTCGTCTGAC 
TATGGGTATG 
CTAATTGTTG 
AGGATGGATG 
CTTCTATTCA 
AATACTTTTT 
GAATATTTTA 
AAGTAGAATA 
TGCATTGAAG 
GACTTTCAGT 
GTAG CACTT A 
TCTTGTCACC 
CGGGTTCAAG 
ACCACGCCTG 
TGGTCTTGAA 
TACAGGCATG 
CATATCTCAG 



60 
120 
180 
240 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



60 
120 
180 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 



372 
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GTTGTCTAAG TGTTTTTATG TAAAACCAAC AAAAAGAACA AATCAGCTTA TATTTTTTAT 2220 

CTTGATGACT CCTGCTCCAG AATTGCTAGA CTAAGAATTA GGTGGCTACA GATGGTAGAA 2280 

CTAAACAATA AGCAAGAGAC AATAATAATG GCCCTTAATT ATTAACAAAG TGCCAGAGTC 2340 

TAGGCTAAGC ACTTTATCTA TATCTCATTT CATTCTCACA ACTTATAAGT GAATGAGTAA 2400 

ACTGAGACTT AAGGGAACTG AATCACTTAA ATGTCACCTG GCTAACTGAT GGCAGAGCCA 2460 

GAGCTTGAAT TCATGTTGGT CTGACATCAA GGTCTTTGGT CTTCTCCCTA CACCAAGTTA 2520 

CCTACAAGAA CAATGACACC ACACTCTGCC TGAAGGCTCA CACCTCATAC CAGCATACGC 2S80 

TCACCTTACA GGGAAATGGG TTTATCCAGG ATCATGAGAC ATTAGGGTAG ATGAAAGGAG 2640 

AGCTTTGCAG ATAACAAAAT AGCCTATCCT TAATAAATCC TCCACTCTCT GGAAGGAGAC 2700 

TGAGGGGCTT TGTAAAACAT TAGTCAGTTG CTCATTTTTA TGGGATTGCT TAGCTGGGCT 2760 

GTAAAGATGA AGGCATCAAA TAAACTCAAA GTATTTTTAA ATTTTTTTGA TAATAGAGAA 2820 

ACTTCGCTAA CCAACTGTTC TTTCTTGAGT GTATAGCCCC ATCTTGTGGT AACTTGCTGC 2880 

TTCTGCACTT CATATCCATA TTTCCTATTG TTCACTTTAT TCTGTAGAGC AGCCTGCCAA 2940 

GAATTTTATT TCTGCTGTTT TTTTTGCTGC TAAAGAAAGG AACTAAGTCA GGATGTTAAC 3000 

AGAAAAGTCC ACATAACCCT AGAATTCTTA GTCAAGGAAT AATTCAAGTC AGCCTAGAGA 3060 

CCATGTTGAC TTTCCTCATG TGTTTCCTTA TGACTCAGTA AGTTGGCAAG GTCCTGACTT 3120 
TAGTCTTAAT AAAACATTGA ATTGTAGTAA AGGTTTTTGC AATAAAAACT TACTTTGG 

Seq ID NOt 489 Protein sequence 
Protein Accession #; NP_05S213.1 

1 11 21 31 41 51 

i i i i i i 

MPRQLSAAAA LFASLAVILH DGSQMRAXAF PETRDYSQPT AAATVQDIKK PVQQPAKQAP 60 

HQTLAARFMD GHITPQTAAT VKIPTTTPAT TKNTATTSPI TYTLVTTQAT PNNSHTAPPV 120 

TEVTVGPSIiA PYSLPPTITP PAHTAGTSSS TVSHTTGNTT QPSNQTTLPA TLSIALHKST 180 

TGQKPDQPTH APGTTAAAHN TTRTAAPAST VPGPTLAPQP SSVKTGIYQV LNGSRLCIKA 240 

BMGIQLIVQD KESVFSPRRY FNIDPNATQA SGNCGTRKSN LLLNFQGGFV NLTFTKDEES 300 

YYISEVGAYL TVSDPETVYQ GIKHAWMPQ TAVGHSFKCV SEQSLQLSAH LOVKTTDVQL 360 
QAFDFEDDHF GNVDECSSDY TIVLPVIGAI WGLCLMGMG VYKIRLRCQS SGYQRI 

Seq ID NOt 4 90 DNA sequence 

Nucleic Acid Accession ftt NM_OOS409.3 

Coding sequence: 94.. 378 

1 11 21 31 41 51 

! I 1 I I 1 

TTCCTTTCAT GTTCAGCATT TCTACTCCTT CCAAGAAGAG CAGCAAAGCT GAAGTAGCAG 60 

CAACAGCACC AGCAGCAACA GCAAAAAACA AACATGAGTG TGAAGGGCAT GGCTATAGCC 120 

TTGGCTGTGA TATTGTGTGC TACAGTTGTT CAAGGCTTCC CCATGTTCAA AAGAGGACGC 180 

TGTCTTTGCA TAGGCCCTGG GGTAAAAGCA GTGAAAGTGG CAGATATTGA GAAAGCCTCC 240 

ATAATGTACC CAAGTAACAA CTGTGACAAA ATAGAAGTGA TTATTACCCT GAAAGAAAAT 300 

AAAGGACAAC GATGCCTAAA TCCCAAATCG AAGCAAGCAA GGCTTATAAT CAAAAAAGTT '360 

GAAAGAAAGA ATTTTTAAAA ATATCAAAAC ATATGAAGTC CTGGAAAAGG GCATCTGAAA 420 

AACCTAGAAC AAGTTTAACT GTGACTACTG AAATGACAAG AATTCTACAG TAGGAAACTG 480 

AGACTTTTCT ATGGTTTTGT GACTTTCAAC TTTTGTACAG TTATGTGAAG GATGAAAGGT 540 

GGGTGAAAGG ACCAAAAACA GAAATACAGT CTTCCTGAAT GAATGACAAT CAGAATTCCA 600 

CTGCCCAAAG GAGTCCAGCA AT TAAATGGA TTTCTAGGAA AAGCTACCTT AAGAAAGGCT 660 

GGTTACCATC GGAGTTTACA AAGTGCTTTC ACGTTCTTAC TTGTTGTATT ATACATTCAT 720 

GCATTTCTAG GCTAGAGAAC CTTCTAGATT TGATGCTTAC AACTATTCTG TTGTGACTAT 780 

GAGAACATTT CTGTCTCTAG AAGTTATCTG TCTGTATTGA TCTTTATGCT ATATTACTAT 840 

CTGTGGTTAC AGTGGAGACA TTGACATTAT TACTGGAGTC AAGCCCTTAT AAGTCAAAAG 900 

CAT CTATGTG TCGTAAAGCA TTCCTCAAAC ATTTTTTCAT GCAAATACAC ACTTCTTTCC 960 

CCAAATATCA TGTAG CACAT CAATATGTAG GGAAACATTC TTATGCATCA TTTGGTTTGT 1020 

TTTATAACCA ATTCATTAAA TGTAATT CAT AAAATGTACT ATGAAAAAAA TTATACGCTA 1080 

TGGGATACTG GCAACAGTGC ACATATTTCA TAACCAAATT AGCAGCACCG GTCTTAATTT 1140 

GATGTTTTTC AACTTTTATT CATTGAGATQ TTTTGAAGCA ATTAGGATAT GTGTGTTTAC 12 0O 

TGTACTTTTT GTTTTGATCC GTTTGTATAA A TG AT AG CAA TATCTTGGAC ACATTTGAAA 1260 

TACAAAATGT TTTTGTCTAC CAAAGAAAAA TGTTGAAAAA TAAGCAAATG TATACCTAGC 1320 

AATCACTTTT ACTTTTTGTA ATTCTGTCTC TTAGAAAAAT ACATAATCTA ATCAATTTCT 1380 

TTGTTCATGC CTATATACTG TAAAATTTAG GTATACTCAA GACTAGTTTA AAGAATCAAA 1440 
GTCATTTTTT TCTCTAATAA ACTACCACAA CCTTTCTTTT TTAAAAAAAA AAA 

Seq ID NO: 491 Protein sequence 
Protein Accession #: np_005400.1 

1 11 21 31 41 51 

I I I I I I 

MSVKGMAIAL AVILCATWQ GFPMFKRGRC LCIGPGVKAV KVADIBKASI MYPSNNCDKI 60 
EVIITLKENK GQRCLNPKSX QARLIIKKVE RKNF 

Seq ID NO: 492 DNA sequence 

Nucleic Acid Accession #x NM_0 00577.1 

Coding s equence : 4 1 . . 5 2 0 

1 11 21 31 41 51 

11)111 

GGCACGAGGG GAAGACCTCC TGTCCTATCA GGCCCTCCCC ATGGCTTTAG AGACGATCTQ 60 

CCGACCCTCT GGGAGAAAAT CCAGCAAGAT GCAAGCCTTC AGAATCTGGG ATGTTAACCA 120 

GAAGACCTTC TATCTGAGGA ACAACCAACT AGTTGCCGGA TACTTGCAAG GACCAAATGT 180 

CAATTTAGAA GAAAAGATAG ATGTGGTACC CATTGAGCCT CATGCTCTGT TCTTGGGAAT 240 

CCATGGAGGG AAGATGTGCC TGTCCTGTGT CAAGTCTGGT GATGAGACCA GACTCCAGCT 300 

GGAGGCAGTT AACATCACTG ACCTGAGCGA GAACAGAAAG CAGGACAAGC GCTTCGCCTT 360 

CATCCGCTCA GACAGTGGCC CCACCACCAG TTTTGAGTCT GCCGCCTGCC CCGGTTGGTT 420 

CCTCTGCACA GCGATGGAAG CTGACCAGCC CGTCAGCCTC ACCAATATGC CTGACGAAGG 480 

CGTCATGGTC ACCAAATTCT ACTTCCAGGA GGACGAGTAG TACTGCCCAG GCCTGCCTGT 540 

TCCCATTCTT GCATGGCAAG GACTGCAGGG ACTGCCAGTC CCCCTGCCCC AGGGCTCCCG 600 



373 
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GCTATGGGGG CACTGAGGAC CAGCCATTGA GGGGTGQACC CTCAOAAGGC GTCACAACAA 6G0 
CCTGGTCACA GGACTCTGCC TCCTCTTCAA CTGACCAGCC TCCATGCTGC CTCCAGAATG 720 
GTCTTTCTAA TGTGTGAATC AGAGCACAGC AGCCCCTGCA CAAAGCCCTT CCATGTCGCC 780 
TCTGCATTCA GGATCAAACC CCGACCACCT GCCCAACCTG CTCTCCTCTT GCCACTGCCT B40 
CTTCCTCCCT CATTCCACCT TCCCATGCCC TGGATCCATC AGGCCACTTG ATGACCCCCA 900 
ACCAAGTGGC TCCCACACCC TGTTTTACAA AAAAGAAAAG ACCAGTCCAT GAGGGAGGTT 960 

TTTAAGGGTT TGTGGAAAAT GAAAATTAGG ATTTCATGAT TTTTTTTTTT CAGTCCCCGT 1020 

GAAGGAGAGC CCTTCATTTG GAGATTATGT TCTTTCGGGG AGAGGCTGAG GACTTAAAAT 1080 

ATTCCTGCAT TTGTGAAATG ATGGTGAAAG TAAGTGGTAG CTTTTCCCTT CTTTTTCTTC 1140 

TTTTTTTGTG ATGTCCCAAC TTGTAAAAAT TAAAAGTTAT GGTACTATGT TAGCCCCATA 1200 

ATTTTTTTTT TCCTTTTAAA ACACTTCCAT AATCTGGACT CCTCTGTCCA GGCACTGCTG 1260 

CCCAGCCTCC AAGCTCCATC TCCACTCCAG ATTTTTTACA GCTGCCTGCA GTACTTTACC 1320 

TCCTATCAGA AGTTTCTCAG CTCCCAAGGC TCTGAGCAAA TGTGGCTCCT GGGGGTTCTT 13 BO 

TCTTCCTCTG CTGAAGGAAT AAATTGCTCC TTGACATTGT AGAGCTTCTG GCACTTGGAG 1440 

ACTTGTATGA AAGATGGCTG TGCCTCTGCC TGTCTCCCCC ACCAGGCTGG GAGCTCTGCA 1500 

GAGCAGGAAA CATGACTCGT ATATGTCTCA GGTCCCTGCA GGGCCAAGCA CCTAGCCTGG 1560 

CTCTTGGCAG GTACTCAGCG AATGAATGCT GTATATGTTG GGTGCAAAGT TCCCTACTTC 1620 

CTGTGACTTC AGCTCTGTTT TACAATAAAA TCTTGAAAAT GCCTAAAAAA AAAAAAAAAA 1680 
AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAA 

Seq ID NO: 493 Protein sequence 
Protein Accession #: NP_000568.1 

1 11 21 31 41 51 

I I I I I I . 

MALETICRPS GRKSSKMQAF RIWDVNQKTP YLRNNQLVAG YLQGPNVNLE EKIDWPIEP 60 
HALFLG I HGG KMCLSCVKSG DETRLQLEAV NITDLSENRK QDKRFAFIRS DSGPTTSFES 120 
AACPGWPLCT AMEADQPVSL TNMPDEGVMV TKFYFQBDE 

Seq ID NO: 494 DNA sequence 

Nucleic Acid Accession #t NM_00208i.l 

Coding sequence) 222.. 1898 

1 11 21 31 41 51 

! I I I ! I 

GGCTGCCCGA GCGAGCGTTC GGACCTCGCA CCCCGCGCGC CCCGCGCCGC CGCCGCCGCC 60 
GGCTTTTGTT GTCTCCGCCT CCTCGGCCGC CGCCGCCTCT GGACCGCGAG CCGCGCGCGC 120 
CGGGACCTTG GCTCTGCCCT TCGCGGGCGG GAACTGCGCA GGACCCGGCC AGGATCCGAG 180 
AGAGGCGCGG GCGGGTGGCC GGGGGCGCCG CCGGCCCCGC CATGGAGCTC CGGGCCCGAG 240 
GCTGGTGGCT GCTATGTGCG GCCGCAGCGC TGGTCGCCTG CGCCCGCGGG GACCCGGCCA 3 00 
GCAAGAGCCG GAGCTGCGGC GAGGTCCGCC AGATCTACGG AGCCAAGGGC TTCAGCCTGA 3 60 
GCGACGTGCC CCAGGCGGAG ATCTCGGGTG AGCACCTGCG GATCTGTCCC CAGGGCTACA 420 
CCTGCTGCAC CAGCGAOATG GAGGAGAACC TGGCCAACCG CAGCCATGCC GAGCTGGAGA 480 
CCGCGCTCOG GGACAGCAGC CGCGTCCTGC AGGCCATGCT TGCCACCCAG CTGCGCAGCT 540 
TCGATGACCA CTTCCAGCAC CTGCTGAACG ACTCGGAGCG GACGCTGCAG GCCACCTTCC 600 
CCGGCGCCTT CGGAGAGCTG TACACGCAGA ACGCGAGGGC CTTCCGGGAC CTGTACTCAG 660 
AGCTGCGCCT GTACTACCGC GGTGCCAACC TGCACCTGGA GGAGACGCTG GCCGAGTTCT 720 
GGGCCCGCCT GCTCGAGCGC CTCTTCAAGC AGCTGCACCC CCAGCTGCTG CTGCCTGATG 780 
ACTACCTGGA CTGCCTGGGC AAGCAGGCCG AGGCGCTGCG GCCCTTCGGG GAGGCCCCGA 840 
GAGAGCTGCG CCTGCGGGCC ACCCGTGCCT TCGTGGCTGC TCGCTCCTTT GTGCAGGGCC 900 
TGGGCGTGGC CAGCGACGTG GTCCGGAAAG TGGCTCAGGT CCCCCTGGGC CCGGAGTGCT 960 

CGAGAGCTGT CATGAAGCTG GTCTACTGTG CTCACTGCCT GGGAGTCCCC GGCGCCAGGC 1020 

CCTGCCCTGA CTATTGCCGA AATGTGCTCA AGGGCTGCCT TGCCAACCAG GCCGACCTGG 1080 

ACGCCGAGTG GAGGAACCTC CTGGACTCCA TGGTGCTCAT CACCGACAAG TTCTGGGGTA 1140 

CATCGGGTGT GGAGAGTGTC ATCGGCAGCG TGCACACGTG GCTGGCGGAG GCCATCAACG 1200 

CCCTCCAGGA CAACAGGGAC ACGCTCACGG CCAAGGTCAT CCAGGGCTGC GGGAACCCCA 1260 

AGGTCAACCC CCAGGGCCCT GGGCCTGAGG AGAAGCGGCG CCGGGGCAAG CTGGCCCCGC 1320 

GGGAGAGGCC ACCTTCAGGC ACGCTGGAGA AGCTGGTCTC TGAAGCCAAG GCCCAGCTCC 1380 

GCGACGTCCA GGACTTCTGG ATCAGCCTCC CAGGGACACT GTGCAGTGAG AAGATGGCCC 1440 

TGAGCACTGC CAGTGATGAC CGCTGCTGGA ACGGGATGGC CAGAGGCCGG TACCTCCCCG 1500 

AGGTCATGGG TGACGGCCTG GCCAACCAGA TCAACAACCC OGAGGTGGAG GTGGACATCA 1560 

CCAAGCCGGA CATGACCATC CGGCAGCAGA TCATGCAGCT GAAGATCATG ACCAACCGGC 1620 

TGCGCAGOGC CTACAACGGC AACGACGTGG ACTTCCAGGA CGCCAGTGAC GACGGCAGCG 1680 

GCTCGGGCAG CGGTGATGGC TGTCTGGATG ACCTCTGCGG CCGGAAGGTC AGCAGGAAGA 1740 

GCTCCAGCTC CCGGACGCCC TTGACCCATG CCCTCCCAGG CCTGTCAGAG CAGGAAGGAC 180 0 

AGAAGACCTC GGCTGCCAGC TGCCCCCAGC CCCCGACCTT CCTCCTGCCC CTCCTCCTCT 1860 

TCCTGGCCCT TACAGTAGCC AGGCCCCGGT GGCGGTAACT GCCCCAAGGC CCCAGGGACA 1920 

GAGGCCAAGG ACTGACTTTG CCAAAAATAC AACACAGACG ATATTTAATT CACCTCAGCC 1980 

TGGAGAGGCC TGGGGTGGGA CAGGGAGGGC CGGCGGCTCT GAGCAGGGGC AGGCGCAGAG . 2040 

GTCCCAGCCC CAGGCCTGGC CTCGCCTGCC TTTCTGCCTT TTAATTTTGT ATGAGGTCCT 2100 

CAGGTCAGCT GGGAGCCAGT GTGCCCAAAA GCCATGTATT TCAGGGACCT CAGGGGCACC 2160 

TCCGGCTGCC TAGCCCTCCC CCCAGCTCCC TGCACCGCCG CAGAAGCAGC CCCTCGAGGC 2220 

CTACAGAGGA GGCCTCAAAG CAACCCGCTG GAGCCCACAG OGAGCCTGTG CCTTCCTCCC 2280 

CGCCTCCTCC CACTGGGACT CCCAGCAGAG CCCACCAGCC AGCCCTGGCC CACCCCCCAG 2340 

CCTCCAGAGA AGCCCCGCAC GGGCTGTCTG GGTGTCCGCC ATCCAGGGTC TGGCAGAGCC 2400 

TCTGAGATGA TGCATGATGC CCTCCCCTCA GCGCAGGCTG CAGAGCCCGG CCCCACCTCC 2460 

CTGCGCCCTT GAGGGGCCCC AGCGTCTGCA GGGTGACGCC TGAGACAGCA CCACTGCTGA 2520 

GGAGTCTGAG GACTGTCCTC CCACAGACCC TGCAGTGAGG GGCCCTCCAT GCGCAGATGA 2580 

GGGGCCACTG ACCCACCTGC GCTTCTGCTG GAGGAGGGGA AGCTGGGCCC AAAGGCCCAG 2640 

GGAGGCAGCG TGGGCTCTGC CAATGTGGGC TGCCCCTOGC ACACAGGGCT CACAGGGCAG 2700 

GCCTTGCTGG GOTCCAGGGC TGTTGGAGGA CCCCGAGGGC TGAGGAGCAG CCAGGACCCG 2760 

CCTGCTCCCA TCCTCACCCA GATCAGGAAC CAGGGCCTCC CTGTTCACGG TGACACAGGT 2820 

CAGGGCTCAG AGTGACCCTC GGCTGTCACC TGCTCACAGG GATGCTGGTG GCTGGTGAGA 2880 

CCCCGCACTG CACACGGGAA TGCCTAGGTC CCTTCCCGAC CCAGCCAGCT GCACTGCAGG 2940 

GCACGGGGAC CTGGATAGTT AAGGGCTTTT CCAAACATGC ATCCATTTAC TGACACTTCC 3000 

TGTCCTTGTT CATGGAGAGC TGTTCGCTCC TCCCAGATGG CTTCGGAGGC CCGCAGGGCC 3060 

CACCTTGGAC CCTGGTGACC TCCTGTCACT CACTGAGGCC ATCAGGGCCC TGCCCCAGGC 3120 
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CTGGACGGGC CCTCCTTCCC TCCTGTGCCC CAGCTGCCAG 
TGTGGTGTTG GGAAGGGGTC CTGCAGGGGG AGGAGGACTT 
TCCTGAACCG ACTGACCCTG AGGAGGCCGC TTAGTGCTGC 
CGCACAGTGG ACGGAGGTCC CCGGTTGCTG GTCAGGTCCC 
CTGACTTTAG ATGTTTTGGG ATCAGGAGCC CCCAACACAG 
CCCTGCCAGT GCCAGGGTGG GCTGGGGACT CTGGCACAGT 
CAGCACTCCC GCTGCACACA GACGGCCTAG GGGTGGCGCT 
TCTCTGGAAG GGGCAGCCCT GAGTGGTCAC TGGTCAGGGC 
CCTTCCTCCA CAAGGTCCCC CCACCGCTCA GTGTCAGCGG 
TCCTTGTATG AATAAAAGGC TGGAAACCTA AA 



Seq ID NO: 495 Protein sequence 
Protein Accession #: NP__002072.1 



1 
I 

MELRARGWWL, 
ICPQGYTCCT 
TLQATPPGAP 
QLLLPDDYLD 
PLGPECSRAV 
TDKFWGTSGV 
RGKLAPRERP 
RGRYLPEVMG 



11 
I 

LCAAAALVAC 
SEMEENLANR 
GELYTQNARA 
CLGKQAEALR 
MKLVYCAHCL 
ESVIGSVHTW 
PSGTLEKLVS 
DGLANQINNP 



21 
1 

ARGDPASKSR 
SHAELETALR 
FRDLYSELRL 
PFGEAPRELR 
GVPGARPCPD 
IAEAINALQD 
EAKAQLRDVQ 
EVEVDITKPD 



31 
I 

SCGEVRQIYG 
DSSRVLQAML 
YYRGANLHLE 
LRATRAFVAA 
YCRNVLKGCL 
NRDTLTAKVI 
DFWISLPGTIi 
MTIRQQIMQIi 



GTGGCCCTGG 
GGAGGGTCTG 
TTTGCTTTTC 
CATGGCTTGT 
GCAAGTCCAC 
GATGCCGGGC 
CAGACCCCAC 
AGTGGCCAAG 
GTGACGTGTG 



41 
I 

AKGFSLSDVP 
ATQLRSFDDH 
BTLABFWARL 
RSPVQGLGVA 
ANQADLDAEW 
QGCGNPKVNP 
CSEKMALSTA 
KIMTNRLRSA 



GGAGGGGTGG 
GGGGCAGCTG 
ATCACCGTCC 
TCTCTGGAAC 
CCCATAATAA 
GCCAGGACAG 
CCTACGCTCA 
CCTGCTGTGT 
TTCTTTTGAG 



51 

I 

QAEISGEHLR 
FQHLLNDSER 
LERLFKQLHP 
SDWRKVAQV 
RNLLDSMVIiI 
QGPGPEEKRR 
SDDRCWNGMA 
YNGNDVDFQD 



Seq ID NO: 496 DNA sequence 

Nucleic Acid Accession fr: NM_001650.2 

Coding sequences 40.1011 



1 
I 

GGGGCAGGCA 
AGGCGGTGGG 
GGGGTCTGGA 
TTTGTTCTCC 
GTCGACATGG 
TTTGGCCATA 
AGGAAGATCA 
ATTGGAGCAG 
ACCATGGTTC 
TTTCAATTGG 
TCAATAGCTT 
ACTGGTGCCA 
GAAAACCATT 
TATGAGTATG 
AAAGCTGCCC 
GAGACGGATG 
GAGGAGAAGA 
CGCACTGAAA 
GAAACAGATT 
GTCTAAACAA 
TCCAAATCTA 
TCTAGTTACC 
CCTGACAGAA 
AGTCAATTCT 



11 
1 

ATGAGAGCTG 
GTAAGTGTGG 
CTCAAGCTTT 
TCAGCCTGGG 
TTCTCATCTC 
TCAGCGGTGG 
GCATCGCCAA 
GAATCCTCTA 
ATGGAAATCT 
TGTTTACTAT 
TAGCAATTGG 
GCATGAATCC 
GGATATATTG 
TCTTCTGTCC 
AGCAAACAAA 
ACCTGATTCT 
AGGGGAAAGA 
GCAGACAAGA 
TGTTATAAAT 
TAAATATTTC 
AAAAAAGAAA 
TTTCATTAAC 
CTCAAAGACA 
TATTTGAATA 



21 
I 

CACTCTGGCT 
ACCTTTGTGT 
CTGGAAAGCA 
ATCCACCATC 
CCTTTGCTTT 
CCACATCAAC 
GTCTGTCTTC 
TCTGGTCACA 
T AC CG CTGGT 
CTTTGCCAGC 
ATTTTCTGTT 
CGCCCGATCC 
GGTTGGGCCC 
AGATGTTGAA 
AGGAAGCTAC 
AAAACCTGGA 
CCAATCTGGA 
CTCCTTAGAA 
TAGAAATGTG 
ATAATTTACA 
TATTTTTAAG 
AACCAATTTT 
CGTCTATCAG 
TTTATTCTAT 



31 
1 

GGGGAAGGCA 
ACCAGAGAGA 
GTCACAGCGG 
AACTGGGGTG 
GGACTCAGCA 
CCTGCAGTGA 
TACATCGCAG 
CCTCCCAGTG 
CATGGTCTCC 
TGTGATTCCA 
GCAATTGGAC 
TTTGGACCTG 
ATCATAGGAG 
TTCAAACGTC 
ATGGAGGTGG 
GTGGTGCATG 
GAGGTATTGT 
CTGTCCTCAG 
CAGGTTTGTT 
AAGGAGGAAC 
ATGTTCTTAA 
AACCGTGTGT 
CTTATTCCTT 
TAAACTGAGT 



Seq ID NO i 497 Protein sequence 
Protein Accession #: NP_001641.1 

1 11 21 31 

III) 
MSDRPTARRW GKCGPLCTRE NIMVAFKGVW TQAFW KAVTA 
GTEKPLPVDM VLISLCFGLS IATMVQCFGH ISGGHINPAV 
AQCLGAIIGA GILYLVTPPS WGGLGVTMV HGNLTAGHGL 
KRTDVTGS I A IAIGFSVAIG HLFAINYTGA SMNPARSFGP 
AVLAGGLYEY VFCPDVEFKR RFKEAFSKAA QQTKGSYMEV 
VIDVDRGEEK KGKDQSGEVL SSV 

Seq ID NO: 498 DNA sequence 

Nucleic Acid Accession ft: AB020684.1 

Coding sequence : 1 . . 1744 



41 
I 

TGAGTGACAG 
ACATCATGGT 
AATTTCTGGC 
GAACAGAAAA 
TTGCAACCAT 
CTGTGGCCAT 
CCCAGTGCCT 
TGGTGGGAGG 
TGGTTGAGTT 
AACGGACTGA 
ATTTATTTGC 
CAGTTATCAT 
CTGTCCTCGC 
GTTTTAAAGA 
AGGACAACAG 
TGATTGACGT 
CTTCAGTATG 
ATTTCCTTCC 
GTTTCATGTC 
GGAAGAAACC 
GCAAATATAT 
CAAGATTTGG 
CTCTACTGGA 
TTAACAATGG 



41 

I 

EFLAMLIFVL 
TVAMVCTRKI 
LVELIITFQL 
AVIMGNWENH 
EDNRSQVETD 



1 
I 

CCCCCTTGTC 
TTGGTACCGG 
GACGGTTACC 
TGCTTGCTTT 
CATATATGGC 
CTTTTTCAAT 
CTCATATCCA 
ACTTTATAGA 
GCAGTTTGCT 
CGGGTACATT 
ACTTTGTTTT 
TTTGGTAATT 



11 
I 

ATTAATACAT 
ATTTATACCA 
AGAGGAGAAG 
TATGTTGCTG 
ACATATTTAA 
CATGGAGAGT 
TTTCTTGTTC 
GGAAGCTTGA 
CAGTTTGTAC 
GATATATGTA 
GTTTTGATGT 
ATTTGGGGTA 



21 
I 

TAAAAAGATT 
AAATAATGGA 
GACTCAGTCC 
TAATTTTTAT 
GTGGCAGCCG 
GTACCCGTGT 
TTCAGATGTT 
TTGCACTCTG 
TTCTTACTCA 
AATTACGGAA 
TTGGGAACTC 
TTCTGGCAAT 



31 
I 

CAATCTTTAC 
CTTGATTGGT 
TATTGAAAGC 
TTTAAATGGA 
ATTAGGAGGC 
AATGTGGACA 
GCTAGTGACT 
CATTTCCAAT 
GATTGCATCA 
GATCATTTAT 
AATGTTATTA 
GAAACCACAT 



41 
I 

CCTGAGGTAA 
ATTCAAACCA 
TGTGAAGGAT 
CTAATGATGG 
CTGGTTACAG 
CCACCTCTCC 
CATATTCTCA 
GTATTTTTCA 
TTATTTGCAG 
ATACACATGA 
ACTTCTTATT 
TTCCTGAAAA 



3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
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51 
I 

ACCCACAGCA 
GGCTTTCAAA 
CATGCTTATT 
GCCTTTACCG 
GGTGCAGTGC 
GGTGTGCACC 
GGGGGCCATC 
CCTGGGAGTC 
GATAATCACA 
TGTCACTGGC 
AATCAATTAT 
GGGAAATTGG 
TGGTGGCCTT 
AGCCTTCAGC 
GAGTCAGGTA 
TGACCGGGGA 
ACTAGAAGAT 
ACCCATTAAG 
ATATTACTCA 
TATTGTGAAT 
ACCTATTTTA 
TTAAGTCTTG 
ATATTGGTAT 
C 



51 

I 

LSLGSTINWG 
SIAKSVFYIA 
VFTIFASCDS 
WIYWVGPIIG 
DIiILKPGWH 



51 
I 

TTTTGGCCAG 
AGATATGTTG 
TGGGAGATCC 
CATTATTCTT 
TGTTGTGCTT 
GTGAAAGCTT 
GGGCTACAAA 
TGCTTCCTTG 
TATATGTTGT 
TTTCTCTTGC 
ATGCTTCTTC 
TAAATGTATC 



60 
120 
160 
240 
300 
360 
420 
480 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



60 
120 

ieo 

240 
300 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



375 
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TGAACTTAGT TTATGGGTTA TTCAAGGATG TTTTTGGTTA TTTGGAACTG TCATACTTAA 780 

ATACTTGACA TCTAAAATTT TTGGTATTGC AGATGACGCT CATATTGGCA ACTTACTAAC 840 

ATCAAAATTC TTTAGTTATA AGGATTTTGA TACTTTATTG TATACCTGTG CAGCGGAGTT 900 

TGACTTTATG GAAAAAGAGA CTCCACTGAG ATACACAAAG ACATTATTGC TTCCAGTTGT 960 

TCTTGTAGTG TTTGTTGCTA TTGTTAGAAA GATTATTAGT GATATGTGGG GTGTCTTAGC 1020 

TAAACAACAG ACACATGTAA GAAAACACCA GTTTGATCAT GGAGAGCTGG TTTACCATGC 1080 

ATTGCAATTG TT AG CAT AT A CAGCCCTTGG TATTTTAATT ATGAGACTAA AACTCTTCTT 1140 

GACACCACAC ATGTGTGTTA TGGCATCACT GATCTGCTCA AGACAGCTAT TTGGATGGCT 1200 

CTTTTGCAAA GTACATCCTG GTGCTATTGT GTTTGCTATA TTAGCAGCAA TGTCAATACA 1260 

AGGTTCAGCA AATCTGCAAA CCCAGTGGAA TATTGTAGGG GAGTTCAGCA ATTTGCCCCA 1320 

AGAAGAACTT ATAGAATGGA TCAAATATAG TACTAAACCA GATGCAGTGT TTGCGGGTGC 13 80 

CATGCCCACG ATGGCAAGTG TTAAGCTCTC TGCACTTCGG CCCATTGTGA ATCATCCACA 1440 

TTATGAAGAC GCAGGCTTGA GAGCCAGAAC AAAAATAGTA TACTCAATGT ATAGTCGGAA 1500 

AGCAGCCGAA GAAGTGAAGC GAGAACTGAT AAAGTTAAAA GTGAACTATT ACATTCTAGA 1560 

AGAGT CATGG TGTGTAAGAA GATCCAAGCC TGGTTGCAGT ATGCCTGAAA TTTGGGATGT 1620 

AGAAGATCCT GCCAATGCTG GGAAAACTCC CTTATGTAAC CTCTTGGTGA AGGATTCCAA 1680 

ACCTCACTTC ACCACTGTAT TCCAGAACAG TGTTTACAAA GTCCTAGAAG TTGTAAAAGA 1740 

ATGACTGCTA CATGACCTGC TGCCTACGGA GAACTACATC TGTAATGGTT TTAATGTTTT 1800 

GCTAAGTCAT GTGTTGTTCA TATCCCAAAA ACTTTTATAG GTAACTGTTT TCAAATAGAA 1860 

AACGTTTTAT TTGGTCAATT TGAATGTCAT TCTAATTATA AAAATGACTT ACACCTTTAT 1920 

CAATTGGTTA CTATTTCAAT GCACCCTTTA AAATTTGCTA TGCAAATGAG TATATGCTTG 1980 

TACTTGACTT TAATATTTGT GCTAAAGTGA GCAAAGCTAC CTGTATAAAG AAAACACAGT 2040 

GGGTTGTGAC AAGGATGACA TGAAAATACA GGACAATTCT GACAATGTAG GGGCTGATTT 2100 

TATAGTGTAA GAACTATTAA TGCCCCTTGC TTCTTTTTTC TGCCTCTTGC TCTTGTCTTT 2160 

TGGACATTTC AGTGATTGTA AGTTCTTCGG TCATGTCAGC CCCTGTCATC AACTTGAGTT 2220 

ACAGTAGATG GGGCAGACAT GGAGTGTTTG CTATATAAAA CTATCTGTTT GTTTTACTTC 2280 

CTTGTGCGCT TTTTGT.TCTC TGTTCTCTTG TTAATGAAGC TTTTCCTGCC CATTATTAAT 2340 

CCAAACTCTT GGACCTTGTG GTTAGGAAAT TCCCTTAACT TCCAGCCATA TGGCATTATC 2400 

GTGTCTCTTT CTCTCTCTCT CTTGCTCTCT CTCTTCTCCT CTTCCCCATA TTTTCTGTCA 2460 

AATAAGTACT GTTT ACT CAT TTAGTTGCTT ATCAAGTACT TATTCTTGGT TTTAAAAAAA 2520 

ATTAATGGTA ACTGTATTTT TCTCATTTTT AGCATTATTC AAATGTTTAT ATTTTAATAC 2580 

CTTTAAACCA CTTTAAAGTT TTTTCATGTT TAATTATAGT TTTAAGAAAA ACTATTTTGA 2640 

ACAACCCCAA ATATAGTGCA TCTAGAAACT AATGTATATT TGATTAGACA TCATTTATAG 2700 

TGGAACAGTA GACTGTAGTA CATGGTAATT TTTCTTTTAC TATTAAGATA CAATAAAACA 2760 

TGACTAATTT TGCTGTCAAA AATGTAAAGA ATAATGATAA ATGGAGTTTT TTATATTTTA 2820 

CTTTTAAGAT TGCCTGTCTT TAATAAGACA AAGCCTTAAG CCTTATGTTA TAATTTTGGT 2880 

TCTAAAAACC ATCATTTCAG TATAAGGAAT AAGTATATTT CGTCCTCCTC TTTAGTTTTT 2940 

TTCTTCCTAT TTATTTTTAT TTTGAAAAAT TTCTACACCT TCTTTGAATT CCTTGTATGA 3000 

ATTTTTGTTT CTTAGAAGTT AATTTGTGTG AAATGAGATT CTTCAAAACG ATGAAACCTC 3 060 

ATAGCTCTGA GAAAAGGTTT TAGGGTTTTA AATTCTAAGC AAAGCGTGAC TATGGCTGAC 3120 

AGACTACACA TTTAATTATA CAGCTTCTCT TTCTTAACCA CAGGCAGATT AACCTCATTG 3180 

TGGATTGTCC TTCAGACCTT AGTCCTCAGG CATGG TTTCT GGTGCCCACT CCTGGAAGCC 3240 

GCTGTTCCCT TTCTACCTTC TTACCAGAGC CCAAGGGCAG GCCTGGTCCC GGGGAAGCAG 3300 

CAGCTTGCTG ACATAAGTCA GCTGCAAAGG CTGAGGAGTG TGCCCTCAGA GAAGCACCGC 3360 

CCCCCAGTCT TGTGCCAGCG CCTAGAGCCG CAGCTCCCAG GGATGCTCCT TCCCTGGAGG 3420 

CAGCCCAGGA GAGGGACTCT GGCAGCGTTC TTCAGATTTG TGGCCACTGT TTCTCATTTG 3480 

CTGGTTGACT GTTTTTATTT CTTAGG CTTT TGCTAGTTTT AGAAAATAGG GAAGCAG CCC 3540 

TTGATTTGTG GATTAAAAGC AACATTTGAG CGATGATGCA CAACAGTCCA GGAAAATGGG 3600 

CGGTGGACAC TTGAGGCTGA GGATGGGAGT TGACATGAGC AGGGAGAGGG AGGTGCGCGC 3660 

TGCTTATCTG TGATTGTTGC TCACCTGAGT GTGGCTGATT GTGTACATCC AGCAGTTACA 3720 

ATTTTTAAAA ATTATACTTT TACATTTATT TTATATTTTT CTCACCCCCA GTAATTTCCT 3780 

TCCAAAGAAG TTCACATGTA ATAAGTAGAA ATTCTGTATA GGAAAAAAGC ATTAAAAATA 3 840 

CTATTATAAC TGCTTCATTT GCTGGGAACC ATTAAAAGTA ATATAAATTA GCTTTTTCCA 3900 

GAAGGATCCT TTTGTAGCAG TGTTTATGAA TGTAACCCCC AGCAAAATAT GGCTATATAT 3960 

TAGGGGAGCC AGTTTGGAGC AGAGGCCTGA AGGTCCCTGC TATGCAGCCG TGGCCACAGC 4020 

TCGCAGCCCA AGCACTGTGG AGCATCCACA CCTTTGATGG CAATGCAGAT TGGTAGCAGG 4080 

TTCCATAGGC GTACAAAACA GTATTAAAGC TCAGTGTTTT GCATATTGTT AGCATTTACA 4140 

AATATTTTTG CTTTAGTATG AGGAAAGTAA GGATGGGCAA AGAAGCGATC AAAATAGCTA 4200 

TTGCTACAAC ATTTTCGAAA ACAAAGTTGG GGCTGTATTT CTTTAAAAAG ATAAGCCTCT 4260 

AAAAATGCTT GGCAAAAAAA ATATAGTGTT AAAATAGGCC AGTGATATTA ATGAGAAAAT 4320 

GAAAGTATGT ATCAGGAATA AAGTGATATT GCATAGGAGT ATTGTATTTT TATGAATTTT 4380 
ATGCCAGTTG TTTACATGTA CTATATATGT TAAATTAAAA AAAATCATGA GAAATG 

Seq ID NO* 499 Protein sequence 
Protein Accession ft: BAA74900.1 

1 11 21 31 41 51 

I I I I I I 

PLVINTLKRF NLYPEVIliAS WYRIYTKIMD LIGIQTKICW TVTRGEGLSP IESCEGLGDP 60 

ACPYVAVIFI LNGLMMALFF IYGTYLSGSR LGGLVTVLCF PFNHGECTRV MWTPPLRESF 120 

SYPFLVLQML LVTHI LRATK LYRGSLIALC ISNVFFMLPW QFAQFVLLTQ IASLFAVYW 180 

GYIDICKLRK IIYIHMISLA LCFVLMFGNS MIiLTSYYASS LVIIWGILAM KPHFLKINVS 240 

ELSLWVIQGC FWLFGTVILK YLTSKIFGIA DDAHIGNLLT SKFFSYKDFD TLLYTCAAEF 300 

DFMEKETPLR YTKTLLLPW LWFVAIVRK IISDMWGVLA KQQTHVRKHQ FDHGELVYHA 360 

LQLLAYTALG ILIMRLKLFL TPHMCVMASL ICSRQLFGWL FCKVHPGAIV FAILAAMSIQ 420 

GSANLQTQWN IVGEFSNLPQ EELIEWIKYS TKPDAVFAGA MPTMASVKLS ALRPIVNHPH 480 

YEDAGLRART KIVYSMYSRK AAEEVKRELI KLKVNYYILE ESWCVRRSKP GCSMPEIWDV 540 
EDPANAGKTP LCNLLVKDSK PHFTTVFQNS VYKVLEWKE 

Seq ID NO: 500 DNA sequence 

Nucleic Acid Accession ft: NM_001276.1 

Coding sequence: 127.. 1278 

1 11 21 31 41 51 

I II I i 1 

AGTGGAGTGG GACAGGTATA TAAAGGAAGT ACAGGGCCTG GGGAAGAGGC CCTGT CT AGG 60 

TAGCTGGCAC CAGGAGCCGT GGGCAAGGGA AGAGGCCACA CCCTGCCCTG CTCTGCTGCA 120 
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GCCAGAATGG GTGTGAAGGC GTCTCAAACA GG CTTTGTGG 
TGCTCTGCAT ACAAACTGGT CTGCTACTAC ACCAGCTGGT 
GGGAGCTGCT TCCCAGATGC CCTTGACCGC TTCCTCTGTA 
GCCAATATAA GCAACGATCA CATCGACACC TGGGAGTGGA 
ATGCTCAACA CACTCAAGAA CAGGAACCCC AACCTGAAGA 
TGGAACTTTQ GGTCTCAAAG ATTTTCCAAG ATAGCCTCCA 
TTCATCAAGT CAGTACCGCC ATTCCTGCGC ACCCATGGCT 
TGGCTCTACC CTGGACGGAG AGACAAACAG CATTTTACCA 
GCCGAATTTA TAAAGGAAGC CCAGCCAGGG AAAAAGCAGC 
TCTGCGGGGA AGGTCACCAT TGACAGCAGC TATGACATTG 
GATTTCATTA GCATCATGAC CTACGATTTT CATGGAGCCT 
CACAGTCCCC TGTTCCGAGG TCAGGAGGAT GCAAGTCCTG 
TATGCTGTGG GGTACATGTT GAGGCTGGGG GCTCCTGCCA 
CCCACCTTCG GGAQGAGCTT CACTCTGGCT TCTTCTGAGA 
TCAGGACCGG GAATTCCAGG CCGGTTCACC AAGGAGGCAG 
ATCTGTGACT TCCTCCGCGG AGCCACAGTC - CATAGAACCC 
GCCACCAAGG GCAACCAGTG GGTAGGATAC GACGACCAGG 
CAGTACCTGA AGGATAGGCA GCTGGCAGGC GCCATGGTAT 
TTCCAGGGCT CCTTCTGCGQ CCAGGATCTG CGCTTCCCTC 
GCACTCGCTG CAACGTAGCC CTCTGTTCTG CACACAGCAC 
CCCCCTCTGG CTCCAGCTGG CCGGGAGCCT GATCACCTGC 
GCCTCAGTCT CCCTCCCTTG GGGCCTATGC AGAGGTCCAC 
GCCCTGGTGG GCAGAGAGGT AGGGATGGGG CTGTGGGGAT 
GACTCGGGAT TAGTACACAC TTGTTGATGA TTAATGGAAA 
TGGCAAGGGA ATTTCTTCAA CTCCCTGCCC CCTAGCCCTC 
TGGCAAGCTC TATCACCAAG GAGCCAAACA TCCTACAAGA 
TACCCCCTGC AAAGCCAGCT TGAAACCTTC ACTTAGGAAC 
ACTTCCCCTT CCTAATTCCA CAGCTGCTCA ATAAAGTACA 
CGCTTTGCTT TGGTCTATCT TTGAGCGCCC ACTAGACCCA 
TCTTCTGGGT TCCTTCCTCT GAGCCTTGGG ACCCCTGAGC 
ATGTT 

Seq 10 NO: 501 Protein sequence 
Protein Accession «i NP_001267.l 



PCT/US02/12476 



1 

I 

MGVKASQTGP 
X SNDH I DTWB 
KSVPPPLRTH 
GKVTIDSSYD 
VGYMLRLGAP 
DPLRGATVHR 
GSFCGQDLRF 



11 
I 

WLVLLQCC3 
WNDVTLYGML, 
GFDGLDLAWL 
IAKISQHLDF 
ASKLVMGI PT 
TLGQQVPYAT 
PLTNAIKDAL 



21 



AYKLVCYYTS 
NTLKNRNPNIi 



I9IMTYDPHG 
FGRSFTLASS 
KGNQWVGYDD 
AAT 



31 
I 

WSQYREGDGS 
KTLIiSVGGWN 
TTLIKEMKAE 
AWRGTTGHHS 
ETGVGAPXSG 
QESVKSKVQY 



Seq ID NO t 502 DNA sequence 
Nucleic Acid Accession #: Nl 
Coding sequence: 181. .669 



006474.1 



1 
I 

GCTGCCTAGG 
TCCGGCCCCC 
TTCCCCCAGC 
ATGTGGAAGG 
GAAGGAGCCA 
GTTGCCATGC 
AAGTCTGGCT 
GAGGATCTGC 
GCCTCAAACG 
GTTGAGAAAG 
GCCATCGGTT 
-TCGCCCTAAA 
TTCTGACTCT 
CGGGCCCATT 
TCACCAGATT 



11 
I 

GTCTGGAAAG 
CCACCGTCGC 
TCAGAATCTT 
TGTCAGCTCT 
GCACAGGCCA 
CAGGTGCCGA 
TGACAACTCT 
CAACTTCAGA 
TGGCCACCAG 
ATGGTTTGTC 
TCATTGGTGG 
GAGCTGAAGG 
GTGGCCCTGT 
CAGATTCCAC 
TGGTTCTTAA 



21 
I 

CTCGGGCACC 
GCTCCTCCAG 
GCTGCTCGGC 
GCTCTTCGTT 
GCCAGAAGAT 
AGATGATGTG 
GGTGGCAACA 
AAGCACAGTC 
TCACTCCACG 
AACAGTGACC 
AATCATCGTT 
GTTACGCCCT 
CCCTGAGCTC 
GGTGACTTTC 
ACTTT 



31 

I 

CTCCCTCTCC 
GCTGGGCCTG 
CCCCAGGAGA 
TTGGGAAGCG 
GACACTGAGA 
GTGACTCCAG 
AGTGTCAACA 
CACGCGCAAG 
GAGAAAGTGG 
CTGGTTGGAA 
GTGGTTATGC 
GCTTGCCAAC 
GTGGGGAGAA 
CGTTTGCCAA 



TCCTGGTGCT 
CCCAGTACCG 
CCCACATCAT 
ATGATGTGAC 
CTCTCTTGTC 
ACACCCAGAG 
TTGATGGGCT 
CCCTAATCAA 
TCCTGCTCAG 
CCAAGATATC 
GGCGTGGGAC 
ACAGATTCAG 
GTAAGCTGGT 
CTGGTGTTGG 
GGACCCTTGC 
TCGGCCAGCA 
AAAGCGTCAA 
GGGCCCTGGA 
TCACCAATGC 
GGGGGCCAAG 
CCTGCTGAGT 
AACACACAGA 
AGTGAGGCAT 
TGTTTACAGA 
CTTATCAAAG 
CACAGTGACC 
GTAATCGTGT 
AGAGTTTAAC 
CTGGACTCAC 
TTGCAGAGAT 



41 

I 

CFPDALDRFL 
FGSQRPSKIA 
FIKEAQPGKK 
PLFRGQEDAS 
PGIPGRFTKE 
LKDRQLAGAM 



41 

I 

GGGGCTCCTG 
TGGCCGCGGT 
GCAACAACTC 
OGTCGCTCTG 
CTACAGGTTT 
GAACCAGCGA 
GTGTAACAGG 
AACAAAGTCC 
ATGGAGACAC 
TCATAGTTGG 
GAAAAATGTC 
GTGCTTTAAA 
GATGACCCTG 
ATTAACCGAG 



GCTCCAGTGC 
GGAAGGCGAT 
CTACAGCTTT 
GCTCTACGGC 
TGTCGGAGGA 
TCGCCGGACT 
GGACCTTGCC 
GGAAATGAAG 
CGCAGCACTG 
CCAACACCTG 
CACAGGCCAT 
CAACACTGAC 
GATGGGCATC 
AGCCCCAATC 
CTACTATGAG 
GGTCCCCTAT 
AAGCAAGGTG 
CCTGGATGAC 
CATCAAGGAT 
GATGCCCCGT 
CCCAGGCTGA 
TTTGAGCTCA 
CGCAATGTAA 
TCCCCAAGCC 
GACACCATTT 
ATACTAATTA 
CCCCTATCCT 
AGTGTGTTGG 
CTCCCCCATC 
GAAGGCCGCC 



51 
I 

CTHIIYSPAN 
SNTQSRRTPI 
QLLLSAALSA 
PDRPSNTDYA 
AGTLAYYEIC 
VWALDLDDFQ 



51 
I 

CTCCCACCCC 
GCTTTTAATT 
AACGGGAACG 
GGTCCTGGCA 
GGAAGGCGGC 
AGACCGCTAT 
CATTCGCATC 
AAGCGCCACA 
ACAGACAACA 
GGTCTTACTA 
GGGAAGGTAC 
AAAAGACCGT 
GGAACATTTG 
GAAAGACCTT 



Seq ID NOt 503 Protein sequence 
Protein Accession #: NP_006465.1 

1 11 21 31 41 51 

I I I I I I 

MWKVSALLFV LGSASLWVLA EGASTGQPED DTETTGLEGG VAMPGAEDDV VTPGTSEDRY 
KSGLTTLVAT SVNSVTGIRI EDLPTSESTV HAQEQSPSAT ASNVATSHST EKVDGDTQTT 
VEKDGLSTVT LVGIIVGVLI* AIGFIGGIIV WMRKMSGRY SP 

Seq ID NO i 504 DNA sequence . 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 62.. 895 



1 
I 

CACTGCTCTG 
TATGATCATC 
CAAGGATGGA 
AGAAGCACGG 
TGAAGGCGGC 



11 

i 

AGAATTTGTG 
TTAATTTACT 
ATT TTT CAT A 
TCTGGCAAAT 
CATCTCGCAA 



21 31 

I 1 
AGCAGCCCCT AACAGGCTGT 
TATTTCTCTT GCTATGGGAA 
ACTCCATATG GCTTGAACGA 
ACAAGCTCAC CTACGCAGAA 
CTTACAAGCA GCTAGAGGCA 



41 51 

I I 
TACTTCACTA CAACTGACGA 
GACACTCAAG GATGGGGATT 
GCAGCCGGTG TGTACCACAG 
GCTAAGGCGG TGTGTGAATT 
GCCAGAAAAA TTGGATTTCA 



180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 



60 
120 
180 
240 
300 
360 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 



60 
120 



60 
120 
180 
240 
300 



377 
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10 
15 

20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



TGTCTGTGCT 
GCCCAACTGT 
TGAAAGATGG 
AGATCCAAAG 
CTGCTACTGQ 
TGACCTTGAA 
TGATGTCCAT 
TACAGGAAAT 
CCAAATCAAA 
TACTACTTCT 
AAAAAAAGGA 
CTCACTGTTA 
TAGGGAAAAT 
ACTGCATAGA 
TTTGTGGTAT 
TCTATGTACA 
TCATTGATTA 
TGTTTTATGC 
ATTGTTGCAA 



GCTGGATGGA 
GGATTTGGAA 
GATGCCTATT 
CAAATTTTTA 
CACATTAGAC 
GATGACCCAG 
GGCTTTGTGG 
GTCATGACCT 
TATGTTGCAA 
ACTGGAAATA 
TGATCAAAAC 
TTATTAACAT 
TGGAAAATAT 
AATAACAAGC 
ATGTATATAT 
GTTTTGTATT 
TTCTACAAAA 
ATTATTTAAG 
TAAATATCCT 



TGGCTAAGGG 
AAACTGG CAT 
GCTACAACCC 
AATCTCCAGG 
TCAAGTATGG 
GTTGCTTGGC 
GAAGATACTG 
TGAAGTTTCT 
TGGATCCTGT 
AAAACTTTTT 
ACACAGTGTT 
TTATTTATTA 
AGGAAACTTT 
GTTAACATTT 
GTACCTATAT 
ATACTTTTTA 
ACATGATTTT 
CCTGTCTCTA 
TGAACACACA 



CAGAGTTGGA 
TATTGATTAT 
ACACGCAAAG 
CTTCCCAAAT 
TCAGCGTATT 
TGATTATGTT 
TGGAGATGAG 
AAGTGATGCT 
ATCCAAATCC 
AGCTGGAAGA 
TATGTTGGAA 
TTTTTCTAAA 
AAACGAGAAA 
TCATATTTTT 
GTATTTGCAT 
AATCTTGAAC 
AAACAGCTGT 
TTGTTGGAAT 
AAAAAAAAAA 



TACCCCATTG 
GGAATCCGTC 
GAGTGTGGTG 
GAGTACGAAG 
CACCTGAGTT 
GAAATATATG 
CTTCCAGATG 
TCAGTGACAG 
AGTCAAGGAA 
TTTAGCCACT 
TCTTTTGGAA 
TGTGAAAGCA 
ATGAAACCTC 
TTCTTTCAGT 
TTGAAATTTT 
TTTATAAACA 
AAAATATTCT 
TTCAGGTCAT 
AA 



TGAAGCCAGG 
TCAATAGGAG 
GCGTCTTTAC 
ATAACCAAAT 
TTTTAGATTT 
ACAGTTACGA 
ACATCATCAG 
CTGGAGGTTT 
AAAATACAAG 
TATAAAAAAA 
CTCCTTTGAT 
ATACATAATT 
TCATAATCCC 
CATTTTTCTA 
GGAATCCTGC 
TTTTCTGAAA 
ATGATATGAA 
TTTCATAAAT 



Seq ID NO: 505 Protein sequence 
Protein Accession ft: Eos Bequence 

1 11 21 31 41 51 

1)1111 

MIILIYLFLL LWEDTQGWGF KDGIFHNSIW LERAAGVYHR EARSGKYKLT YAEAKAVCEP 

EGGHLATYKQ LEAARKIGPH VCAAGWMAKG RVGYPIVKPG PNCGFQKTGI IDYGIRLNRS 

ERWDAYCYNP HAKECGGVFT DPKQIPKSPG FPNEYEDNQI CYWHIRLKYG QRIHLSFLDF 

DLEDDPGCLA DYVEIYDSYD DVHGPVGRYC GDELPDDIIS TGNVMTLKFL SDASVTAGGP 

Q I KYVAMDPV SKSSQGKNTS TTSTGNKNFL AGRPSHL 

Seq ID NO: 50 6 DNA sequence 

Nucleic Acid Accession ft: NM_007115.1 

Coding sequences 69.. 902 



GAATTCGCAC 
CTGACGATAT 
GGGGATTCAA 
ACCACAGAGA 
GTGAATTTGA 
GATTTCATGT 
AGCCAGGGCC 
ATAGGAGTGA 
TCTTTACAOA 
ACCAAATCTG 
TAGATTTTGA 
GTTACGATGA 
TCATCAGTAC 
GAGGTTTCCA 
ATACAAGTAC 
AAAAAAAAAA 
TTGATCTCAC 
TAATTTAGGG 
ATCCCACTGC 
TTGTATTTGT 
„ CCTGCTCTAT 
TGAAATCATT 
ATGAATGTTT 
TAAATATTGT 



11 
I 

TGCTCTGAGA 
GATCATCTTA 
GGATGGAATT 
AGCACGGTCT 
AGGCGGCCAT 
CTGTGCTGCT 
CAACTGATGA 
AAGATGGGAT 
TCCAAAGCGA 
CTACTGGCAC 
CCTTGAAGAT 
TGTCCATGGC 
AGGAAATGTC 
AATCAAATAT 
TACTTCTACT 
AAGGATGATC 
TGTTATTATT 
AAAATTGGAA 
ATAGAAATAA 
GGTATATGTA 
GTACAGTTTT 
GATTATTCTA 
TATGCATTAT 
TGCAATAAAT 



21 
I 

ATTTGTGAGC 
ATTTACTTAT 
TTTCATAACT 
GGCAAATACA 
CTCGCAACTT 
GGATGGATGG 
TTTGGAAAAA 
GCCTATTGCT 
ATTTTTAAAT 
ATTAGACTCA 
GACCCAGGTT 
TTTGTGGGAA 
ATGACCTTGA 
GTTGCAATGG 
GGAAATAAAA 
AAAACACACA 
AACATTTATT 
AATATAGGAA 
CAAGCGTTAA 
TATATGTACC 
GTATTATACT 
CAAAAACATG 
TTAAGCCTGT 
ATCCTT CGGA 



31 

I. 

AGCCCCTAAC 
TTCTCTTGCT 
CCATATGGCT 
AGCTCACCTA 
ACAAGCAGCT 
CTAAGGGCAG 
CTGGCATTAT 
ACAACCCACA 
CTCCAGGCTT 
AGTATGGTCA 
GCTTGGCTGA 
GATACTGTGG 
AGTTTCTAAG 
ATCCTGTATC 
ACTTTTTAGC 
GTGTTTATGT 
TATTATTTTT 
ACTTTAAACG 
CATTTTCATA 
TATATGTATT 
TTTTAAATCT 
ATTTTAAACA 
CTCTATTG TT 
ATTC 



41 

I 

AGGCTGTTAC 
ATGGGAAGAC 
TGAACGAGCA 
CGCAGAAGCT 
AGAGGCAGCC 
AGTTGGATAC 
TGATTATGGA 
CGCAAAGGAG 
CCCAAATGAG 
GCGTATTCAC 
TTATGTTGAA 
AGATGAGCTT 
TGATGCTTCA 
CAAATCCAGT 
TGGAAGATTT 
TGGAATCTTT 
CTAAATGTGA 
AGAAAATGAA 
TTTTTTTCTT 
TGCATTTGAA 
TGAACTTTAT 
GCTGTAAAAT 
GGAATTTCAG 



51 
I 

TTCACTACAA 
ACTCAAGGAT 
GCCGGTGTGT 
AAGGCGGTGT 
AGAAAAATTG 
CCCATTGTGA 
ATCCGTCTCA 
TGTGGTGGCG 
TAOGAAGATA 
CTGAGTTTTT 
ATATATGACA 
CCAGATGACA 
GTGACAGCTG 
CAAGGAAAAA 
AGCCACTTAT 
TGGAACTCCT 
AAGAAATACA 
ACCTCTCATA 
TCAGTCATTT 
ATTTTGGAAT 
GAACATTTTC 
ATTCTATGAT 
GTCATTTTCA 



Seq ID NO i 507 Protein sequence 
Protein Accession ft: NP_ 009046.1 



11 



21 



31 



41 51 
I I 

MIILIYLFLL LWEDTQGWGF KDGIFHNSIW LERAAGVYHR EARSGKYKLT YAEAKAVCEP 
EGGHLATYKQ LEAARKIGPH VCAAGWMAKG RVGYPIVKPG PNXXFGKTGI IDYGIRLNRS 
ERWDAYCYNP HAKECGGVFT DPKRIFKSPG FPNEYEDNQI CYWHIRLKYG QRIHLSFLDF 
DLEDDPGCLA DYVEIYDSYD DVHGPVGRYC GDELPDDIIS TGNVMTLKFL SDASVTAGGP 
Q I KYVAMDPV SKSSQGKNTS TTSTGNKNFL AGRPSHL 



Seq ID NO: 508 DNA sequence 

Nucleic Acid Accession ft: NM_001044.1 

coding sequencer 129.. 1991 



ACCGCTCCGG 
AAAGCCCAGG 
GTGTGCCCAT 
CTAAGGAGCC 
ACGGAGTGCA 
AGGATCGGGA 
TGGACCTGGC 



11 

t 

AGCGGGAGGG 
CCCGGGCGGC 
GAGTAAGAGC 
CAATGCCGTG 
GCTCACCAGC 
GACCTGGGGC 
CAACGTCTGG 



21 
1 

GAGGCTTCGC 
CAGACCAAGA 
AAATGCTCCG 
GGCCCGAAGG 
TCCACCCTCA 
AAGAAGATCG 
CGGTTCCCCT 



31 
I 

GGAACGCTCT 
GGGAAGAAGC 
TGGGACTCAT 
AGGTGGAGCT 
CCAACCCGCG 
ACTTTCTCCT 
ACCTGTGCTA 



41 
I 

CGGCGCCAGG 
ACAGAATTCC 
GTCTTCCGTG 
CATCCTTGTC 
GCAGAGCCCC 
GTCCGTCATT 
CAAAAATGGT 



51 

I 

ACTCGCGTGC 
TCAACTCCCA 
GTGGCCCCGG 
AAGGAGCAGA 
GTGGAGGCCC 
GGCTTTGCTG 
GGCGGTGCCT 



360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1360 



60 
120 
180 
240 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



60 
120 
180 
240 



60 
120 
180 
240 
300 
360 
420 



378 



WO 02/086443 

TCCTGGTCCC CTACCTGCTC TTCATGGTCA TTGCTGGGAT GCCACTTTTC TACATGGAGC 480 

TGGCCCTCGG CCAGTTCAAC AGGGAAGGGG CCGCTGGTGT CTGGAAGATC TGCCCCATAC 540 

TGAAAGGTGT GGGCTTCACG GTCATCCTCA TCTCACTGTA TGTCGGCTTC TTCTACAACG 600 

TCATCATCGC CTGGGCGCTG CACTATCTCT TCTCCTCCTT CACCACGGAG CTCCCCTGGA 660 

TCCACTGCAA CAACTCCTGG AACAGCCCCA ACTGCTCGGA TGCCCATCCT GGTGACTCCA 720 

GTGGAGACAG CTCGGGCCTC AACGACACTT TTGGGACCAC ACCTGCTGCC GAGTACTTTG 780 

AACGTGGCGT GCTGCACCTC CACCAGAGCC ATGGCATCGA CGACCTGGGG CCTCCGCOGT 840 

GGCAGCTCAC AGCCTGCCTG GTGCTGGTCA TCGTGCTGCT CTACTTCAGC CTCTGGAAGG 900 

GCGTGAAGAC CTCAGGGAAG GTGGTATGGA TCACAGCCAC CATGCCATAC GTGGTCCTCA 960 

CTGCCCTGCT CCTGCGTGGG GTCACCCTCC CTGGAGCCAT AGACGGCATC AGAGCATACC 1020 

TGAGCGTTGA CTTCTACCGG CTCTGCGAGG CGTCTGTTTG GATTGACGCG GCCACCCAGG 1080 

TGTGCTTCTC CCTGGGCGTG GGGTTCGGGG TGCTGATCGC CTTCTCCAGC TACAACAAGT 1140 

TCACCAACAA CTGCTACAGG GACGCGATTG TCACCACCTC CATCAACTCC CTGACGAGCT 1200 

TCTCCTCCGG CTTCGTCGTC TTCTCCTTCC TGGGGTACAT GGCACAGAAG CACAGTGTGC 1260 

CCATCGGGGA CGTGGCCAAG GACGGGCCAG GGCTGATCTT CATCATCTAC CCGGAAGCCA 1320 

TCGCCACGCT CCCTCTGTCC TCAGCCTGGG CCGTGGTCTT CTTCATCATG CTGCTCACCC 1380 

TGGGTATCGA CAGCGCCATG GGTGGTATGG AGTCAGTGAT CACCGGGCTC ATCGATGAGT 1440 

TCCAGCTGCT GCACAGACAC CGTGAGCTCT TCACGCTCTT CATCGTCCTG GCGACCTTCC 1500 

TCCTGTCCCT GTTCTGCGTC ACCAACGGTG GCATCTACGT CTTCACGCTC CTGGACCATT 1560 

TTGCAGCCGG CACGTCCATC CTCTTTGGAG TGCTCATCGA AGCCATCGGA GTGGCCTGGT 1620 

TCTATGGTGT TGGGCAGTTC AGCGACGACA TCCAGCAGAT GACCGGGCAG CGGCCCAGCC 1680 

TGTACTGGCG GCTGTGCTGG AAGCTGGTCA GCCCCTGCTT TCTCCTGTTC GTGGTCGTGG 1740 

TCAGCATTGT GACCTTCAGA CCCCCCCACT ACGGAGCCTA CATCTTCCCC GACTGGGCCA 1800 

ACGCGCTGGG CTGGGTCATC GCCACATCCT CCATGGCCAT GGTGCCCATC TATGCGGCCT 1860 

ACAAGTTCTG CAGCCTGCCT GGGTCCTTTC GAGAGAAACT GGCCTACGCC ATTGCACCCG 1920 

AGAAGGACCG TGAGCTGGTG GACAGAGGGG AGGTGCGCCA GTTCACGCTC CGCCACTGGC 1980 

TCAAGGTGTA GAGGGAGCAG AGACGAAGAC CCCAGGAAGT CATCCTGCAA TGGGAGAGAC 2040 

ACGAACAAAC CAAGGAAATC TAAGTTTCGA GAGAAAGGAG GGCAACTTCT ACTCTTCAAC 2100 

CTCTACTGAA AACACAAACA ACAAAGCAGA AGACTCCTCT CTTCTGACTG TTTACACCTT 2160 

TCCGTGCCGG GAGCGCACCT CGCCGTGTCT TGTGTTGCTG TAATAACGAC GTAGATCTGT 2220 

GCAGCGAGGT CCACCCCGTT GTTGTCCCTG CAGGGCAGAA AAACGTCTAA CTTCATGCTG 2280 

TCTGTGTGAG GCTCCCTCCC TCCCTGCTCC CTGCTCCCGG CTCTGAGGCT GCCCCAGGGG 2340 

CACTGTGTTC TCAGGCGGGG ATCACGATCC TTGTAGACGC ACCTGCTGAG AATCCCCGTG 2400 

CTCACAGTAG CTTCCTAGAC CATTTACTTT GCCCATATTA AAAAGCCAAG TGTCCTGCTT 2460 

GGTTTAGCTG TGCAGAAGGT GAAATGGAGG AAACCACAAA TTCATGCAAA GTCCTTTCCC 2520 

GATGCGTGGC TCCCAGCAGA GGCCGTAAAT TGAGCGTTCA GTTGACACAT TGCACACACA 2 580 

GTCTGTTCAG AGGCATTGGA GGATGGGGGT CCTGGTATGT CTCACCAGGA AATTCTGTTT 2640 

ATGTTCTTGC AGCAGAGAGA AATAAAACTC CTTGAAACCA GCTCAGGCTA CTGCCACTCA 2700 

GGCAGCCTGT GGGTCCTTGT GGTGTAGGGA ACGGCCTGAG AGGAGCGTGT CCTATCCCCG 2760 

GACG CATGCA GGGCCCCCAC AGGAGCGTGT CCTATCCCCG GACGCATGCA GGGCCCCCAC 2820 

AGGAGCATGT CCTATCCCTG GACGCATGCA GGGCCCCCAC AGGAGCGTGT ACTACCCCAG 2880 

AACGCATGCA GGGCCCCCAC AGGAGCGTGT ACTACCCCAG GACGCATGCA GGGCCCCCAC 2940 

TGGAGCGTGT ACTACCCCAG GACGCATGCA GGGCCCCCAC AGGAGCGTGT CCTATCCCCG 3000 

GACCGGACGC ATGCAGGGCC CCCACAGGAG CGTGTACTAC CCCAGGACGC ATGCAGGGCC 3060 

CCCACAGGAG CGTGTACTAC CCCAGGATGC ATGCAGGGCC CCCACAGGAG CGTGTACTAC 3120 

CCCAGGACGC ATGCAGGGCC CCCATGCAGG CAGCCTGCAG ACCAACACTC TGCCTGGCCT 3180 

TGAGCCGTGA CCTCCAGGAA GGGACCCCAC TGGAATTTTA TTTCTCTCAG GTGCGTGCCA 3240 

CATCAATAAC AACAGTTTTT ATGTTTGCGA ATGGCTTTTT AAAATCATAT TTACCTGTGA 3300 

ATCAAAACAA ATTCAAGAAT GCAGTATCCG CGAGCCTGCT TGCTGATATT GCAGTTTTTG 3360 

TTTACAAGAA TAATTAGCAA TACTGAGTGA AGGATGTTGG CCAAAAGCTG CTTTCCATGG 3420 

CACACTGCCC TCTGCCACTG ACAGGAAAGT GGATGCCATA GTTTGAATTC ATGCCTCAAG 3480 

TCGGTGGGCC TGCCTACGTG CTGCCCGAGG GCAGGGGCCG TGCAGGGCCA GTCATGGCTG 3540 

TCCCCTGCAA GTGGACGTGG GCTCCAGGGA CTGGAGTGTA ATGCTCGGTG GGAGCCGTCA 3600 

GCCTGTGAAC TGCCAGGCAG CTGCAGTTAG CACAGAGGAT GGCTTCCCCA TTGCCTTCTG 3660 

GGGAGGGACA CAGAGGACGG CTTCCCCATC GCCTTCTGGC CGCTGCAGTC AGCACAGAGA 3720 

GCGGCTTCCC CATTGCCTTC TGGGGAGGGA CACAGAGGAC AGTTTCCCCA TCGCCTTCTG 3780 

GTTGTTGAAG ACAGCACAGA GAGCGGCTTC CCCATCGCCT TCTGGGGAGG GGCTCCGTGT 3840 

AGCAACCCAG GTGTTGTCCG TGTCTGTTGA CCAATCTCTA TTCAGCATCG TGTGGGTCCC 3 900 
TAAGCACAAT AAAAGACATC CACAATGGAA AAAAAAAAAG GAATTC 

Seq ID NO i 509 Protein sequence 
Protein Accession #j NPJ>01035.1 

1 11 21 31 41 51 

] I I I I I 

MSKSKCSVGL MS8WAPAKE PNAVGPKEVE LILVKEQNGV QLTSSTLTNP RQSPVEAQDR 60 

ETWGKKIDFL LSVIGFAVDL ANVWRFPYIiC YKNGGGAFLV PYLLFMVIAG MPLFYMELAL 120 

GQFNREGAAG VWKICPIUCG VGFTVILISI* YVGFFYNVII AWALHYLFSS FTTELPWIHC 180 

NNSWNSPNCS DAHPGDSSGD SSGLNDTPGT TPAAEYFERG VLHLHQSHGI DDLGPPRWQL 240 

TACLVLVIVL IiYFSLWKGVK TSGKWW I TA TMPYWLTAI* LLRGVTLPGA IDGIRAYL9V 300 

DFYRIiCEASV WIDAATQVCP SLGVGFGVLI AFSSYNKFTN NCYRDAIVTT SINSLTSFSS 360 

GFWFSFliGY MAQKHSVPIG DVAKDGPGLI FIIYPEAIAT LPLSSAWAW FFIMLLTLGI 420 

DSAKGGMESV ITGLIDEFQL LHRHRELFTL PIVLATFLLS LFCVTMGGIY VFTLLDHFAA 480 

GTSILFGVLI EAIGVAWFYG VGQFSDDIQQ MTGQRPSLYW RLCWKLVSPC FLLFWWSI 540 

VTFRPPHYGA YIFPDWANAL GWVIATSSMA MVP I YAAYKF CSLPGSPREK LAYAIAPEKD 600 
REIiVDRGEVR QFTLRHWIiKV 

Seq ID NO: 510 DNA sequence 
Nucleic Acid Accession #* NMJ>01216.1 
Coding sequence: 43.. 1422 

1 .11 21 31 

I 1 I I 

GCCCGTACAC ACCGTGTGCT GGGACACCCC ACAGTCAGCC 
AGCCCCTGGC TCCCTCTGTT GATCCCGGCC CCTGCTCCAG 
CTGTCACTGC TGCTTCTGAT GCCTGTCCAT CCCCAGAGGT 
TCCCCCTTGG GAGGAGGCTC TTCTGGGGAA GATGACCCAC 



41 51 
1 I 

GCATGGCTCC CCTGTGCCCC 60 

GCCTCACTGT GCAACTGCTG 120 

TGCCCCGGAT GCAGGAGGAT 180 

TGGGCGAGGA GGATCTGCCC 240 



379 



WO 02/086443 

AGTGAAGAGG ATTCACCCAG AGAGGAGGAT CCACCOGGAG AGGAGGATCT ACCTGGAGAG 300 
GAGGATCTAC CTGGAGAGGA GGATCTACCT GAAGTTAAGC CTAAATCAGA AGAAGAGGGC 360 
TCCCTGAAGT TAGAGGATCT ACCTACTGTT GAGGCTCCTG GAGATCCTCA AGAACCCCAG 420 
AATAATGCCC ACAGGGACAA AGAAGGGGAT GACCAGAGTC ATTGGCGCTA TGGAGGCGAC 480 
CCGCCCTGGC CCCGGGTGTC CCCAGCCTGC GCGGGCCGCT TCCAGTCCCC GGTGGATATC 540 
CGCCCCCAGC TCGCCGCCTT CTGCCCGGCC CTGCGCCCCC TGGAACTCCT GGGCTTCCAG 600 
CTCCCGCCGC TCCCAGAACT GCGCCTGOGC AACAATGGCC ACAGTGTGCA ACTGACCCTG 660 
CCTCCTGGGC TAGAGATGGC TCTGGGTCCC GGGCGGGAGT ACCGGGCTCT GCAGCTGCAT 720 
CTGCACTGGG GGGCTGCAGG TCGTCCGGGC TCGGAGCACA CTGTGGAAGG CCACCGTTTC "780 
CCTGCCGAGA TCCACGTGGT TCACCTCAGC ACCGCCTTTG CCAGAGTTGA CGAGGCCTTG 840 
GGGCGCCCGG GAGGCCTGGC CGTGTTGGCC GCCTTTCTGG AGGAGGGCCC GGAAGAAAAC 900 
AGTGCCTATG AGCAGTTGCT GTCTCGCTTG GAAGAAATCG CTGAGGAAGG CTCAGAGACT 960 

CAGGTCCCAG GACTGGACAT ATCTGCACTC CTGCCCTCTG ACTTCAGCCG CTACTTCCAA 1020 

TATGAGGGGT CTCTGACTAC ACCGCCCTGT GCCCAGGGTG TCATCTGGAC TGTGTTTAAC 1080 

CAGACAGTGA TGCTGAGTGC TAAGCAGCTC CACACCCTCT CTGACACCCT GTGGGGACCT 1140 

GGTGACTCTC GGCTACAGCT GAACTTCCGA GCGACGCAGC CTTTGAATGG GCGAGTGATT 1200 

GAGGCCTCGT TCCCTGCTGG AGTGGACAGC AGTCCTCGGG CTGCTGAGCC AGTCCAGCTG 12 60 

AATTCCTGCC TGGCTGCTGG TGACATCCTA GCCCTGGTTT TTGGCCTCCT TTTTGCTGTC 13 20 

ACCAGCGTCG CGTTCCTTGT GCAGATGAGA AGGCAGCACA GAAGGGGAAC CAAAGGGGGT 1380 

GTGAGCTACC GCCCAGCAGA GGTAGCCGAG ACTGGAGCCT AGAGGCTGGA TCTTGGAGAA 1440 

TGTGAGAAGC CAGCCAGAGG CATCTGAGGG GGAGCOGGTA ACTGTCCTGT CCTGCTCATT 1500 
ATGCCACTTC CTTTTAACTG CCAAGAAATT TTTTAAAATA AATATTTATA AT 



Seq ID NO: 511 Protein sequence 
Protein Accession #: NP_001207.1 

1 ll 21 31 41 51 

MAPLCPSPWL PLLIPAPAPG LTVQLLLSLIi LLMPVHPQRL PRMQEDSPLG GGSSGEDDPL 60 

GEEDLPS BED SPREEDPPGE EDLPGEEDLP GEEDLPEVKP KSEEEGSLKL EDIjPTVEAPG 120 

DPQEPQNNAH RDKEGDDQSH WRYGGDPPWP RVSPACAGRF QSPVDIRPQL AAFCPALRPL 180 

ELLGPQLPPL PBLRLRNNGH SVQLTLPPGL EMALGPGREY RALQLHLHWG AAGRPGSEHT 240 

VEGHRFPAEI HWHLSTAFA RVDEALGRPG GLAVLAAFLE EGPEENSAYE QLIjSRLEEIA 300 

EEGSETQVPG LD IS ALL PSD FSRYFQYEGS LTTPPCAQGV IWTVFNQTVM LSAKQLHTLS 360 

DTLWGPGDSR LQLNFRATQP LNGRVIEASF PAGVDSSPRA AEPVQLNSCL AAGDILALVF 420 
GLLFAVTSVA FLVQMRRQHR RGTKGGVSYR PAEVAETGA 



Seq ID HO: 512 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 1..3 978 

1 ll 21 31 41 51 

llllll 

ATGGTGGGTG AAGGACCCTA CCTTATCTCA GATCTGGACC AGCGAGGCCG GCGGAGATCC 60 

TTTGCAGAAA GATATGACCC CAGCCTGAAG ACCATGATCC CAGTGCGACC CTGTGCAAGG 120 

TTAGCACCCA ACCCGGTGGA TGATGCCGGG CTACTCTCCT TCGCCACATT TTCCTGGCTC 180 

ACGCCGGTGA TGGTGAAAGG CTACCGGCAA AGGCTGACCG TAGAC ACCC T GCCCCCATTG 240 

TCGACATATG ACTCATCTGA CACCAATGCC AAAAGATTTC GAGTCCTTTG GGATGAAGAG 300 

GTAGCAAGGG TGGGTCCTGA GAAGGCCTCT CTGAGCCACG TGGTGTGGAA ATTCCAGAGG 360 

ACACGCGTGT TGATGGACAT CGTGGCCAAC AT CCTGTGCA TCATCATGGC AGCCATAGGG 420 

CCGACAGTTC TCATTCACCA AATCCTCCAG CAGACTGAGA GGACCTCTGG GAAAGTCTGG 480 

GTTGGCATTG GACTGTGCAT AGCCCTTTTT GCCACCGAGT TTACCAAAGT CTTCTTTTGG 540 

GCCCTTGCCT GGGCCATCAA CTACCGCACG GCCATCCGGT TGAAGGTGGC GCTCTCCACC 600 

TTGGTTTTTG AAAACCTAGT GTCCTTCAAG ACATTGACCC ACATCTCTGT TGGCGAGGTG 660 

CTCAATATAC TGTCAAGTGA TAGCTATTCT TTGTTTGAAG CTGCCTTGTT TTGTCCTTTG 720 

CCAGCCACCA TCCCGATCCT AATGGTCTTT TGTGCGGCGT ACGCCTTTTT CATTCTGGGG 780 

CCCACAGCTC TCATCGGGAT ATCAGTGTAT GTCATATTCA TACCCGTCCA GATGTTTATG 840 

GCCAAGCTCA ATTCAGCTTT CCGAAGGTCA GCAATTTTGG TGACAGACAA GCGAGTTCAG 900 

ACAATGAATG AGTTTCTGAC CTGCATCAGG CTGATCAAAA TGTATGCCTG GGAGAAATCT 960 

TTTACCAACA CTATCCAAGA TATAAGAAGG AGGGAAAGAA AATTACTGGA AAAAGCTGGA 1020 

TTTGT CCAAA GTGGAAACTC TGCCCTGGCC CCCATCGTGT CCACCATAGC CATCGTGCTG 1080 

ACATTATCCT GCCACATCCT CCTGAGACGC AAACTCACCG CACCCGTGGC ATTTAGTGTG 1140 

ATTGCCATGT TTAATGTAAT GAAGTTTTCC ATTGCAATCT TGCCCTTCTC CATCAAAGCA 1200 

ATGGCTGAAG CGAATGTCTC TCTAAGGAGA ATGAAGAAAA TTCTCATAGA TAAAAGCCCC 1260 

CCATCTTACA TCACCCAACC AGAAGACCCA GATACTGTCT TGCTTTTAGC AAATGCCACC 1320 

TTGACATGGG AGCATGAAGC CAGCAGGAAA AGTACCCCAA AGAAATTGCA GAACCAGAAA 1380 

AGGCATTTAT GCAAGAAACA GAGGTCAGAG GCATACAGTG AGAGGAGTCC ACCAGCCAAG 1440 

GGAGCCACTG GCCCAGAGGA GCAAAGTGAC AG CCTCAAAT CGGTTCTGCA CAGCATAAGC 1500 

TTTGTGGTGA GAAAGTTATG TCGTTATCCC GAAGCCCAGC TCCTGGCTTG GAGGTGGCCA 1560 

GCAGTGTTTG TTGGGAGAAT CATCAGAGGA TACAGGCCTC ATGGATTTTC TGCTAAAGAC 1620 

AAGGATGAAT CTAGAAGGCT TCTTACTTGG CCCCAAGAAG TGGATAGGAC TCAAAGGGCA 1680 

GCCAAATACC TGGGGAAGAT CTTGGGAATA TGTGGGAATG TGGGAAGTGG AAAGAGCTCC 1740 

CTCCTTGCAG CTCTCCTAGG ACAGATGCAG CTGCAGAAAG GGGTGGTGGC AGTCAATGGA 1800 

ACTTTGGCCT ACGTTTCACA GCAGGCATGG ATCTTTCATG GAAATGTGAG AGAAAACATA 1860 

CTCTTTGGAG AAAAGTATGA TCACCAAAGG TATCAGCACA CAGTCCGCGT CTGTGGCCTC 1920 

CAGAAGGACC TGAGCAACCT CCCCTATGGA GACCTGACTG AGATTGGGGA GCGGGGCCTC 1980 

AACCTCTCTG GGGGGCAGAG GCAGAGGATT AGCCTGGCCC GCGCTGTCTA CTCCGACCGT 2040 

CAGCTCTACC TGCTGGACGA CCCCCTGTCG GCCGTGGACG CCCACGTGGG GAAGCACGTC 2100 

TTTGAGGAGT GCATTAAGAA GACGCTCAGG GGAAAGACAG TCGTCCTGGT GACCCACCAG 2160 

CTACAGTTCT TAGAGTCTTG TGATGAAGTT ATTTTATTAG AAGATGGAGA GATTTGTGAA 2220 

AAGGGAACCC ACAAGGAGTT AATGGAGGAG AGAGGGCGCT ATGCAAAACT GATTCACAAC 2280 

CTGCGAGGAT TGCAGTTCAA GGATCCTGAA CACCTTTACA ATGCAGCAAT GGTGGAAGCC 2340 

TTCAAGGAGA GCCCTGCTGA GAGAGAGGAA GATGCTGGTA TAATCGGGTA CCTCCTTTCT 2400 

CTCTTCACTG TGTTCCTCTT CCTCCTGATG ATTGGCAGCG CTGCCTTCAG CAACTGGTGG 2460 

CTGGGTCTCT GGTTGGACAA GGGCTCACGG ATGACCTGTG GGCCCCAGGG CAACAGGACC 2520 

ATGTGTGAGG TCGGCGCGGT GCTGGCAGAC ATCGGTCAGC ATGTGTACCA GTGGGTGTAC 2580 

ACTGCAAGCA TGGTGTTCAT GCTGGTGTTT GGCGTCACCA AAGGCTTCGT CTTCACCAAG 2640 
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ACCACACTGA TGGCATCCTC CTCTCTGCAT GACACGGTGT TTGATAAGAT CTTAAAGAGC 2700 

CCAATGAGTT TCTTTGACAC GACTCCCACT GGCAGGCTAA TGAACCGTTT TTCCAAGGAT 2760 

ATGGACGAGC TGGATGTGAG GCTGCCGTTT CACGCAGAGA ACTTTCTGCA GCAGTTTTTT 2820 

ATGGTGGTGT TTATTCTCGT GATCTTGGCT GCTGTGTTTC CTGCTGTCCT TTTAGTCGTG 2880 

GCCAGCCTTG CTGTAGGCTT CTTCATTCTG TTACGCATTT TCCACAGAGG AGTCCAGGAG 2940 

CTCAAGAAGG TGGAGAATGT CAGCCGGTCA CCCTGGTTCA CCCACATCAC CTCCTCCATG 3000 

CAGGGCCTGG GCATCATTCA CGCCTATGGC AAGAAGGAGA GCTGCATCAC CTATACTTCA 3060 

TCCAAAGGCC TGTCATTGTC ATACATCATC CAGCTGAGCG GACTGCTCCA AGTGTGTGTG 3120 

CGAACGGGAA CAGAGACGCA AGCCAAATTC ACCTCCGTGG AGCTGCTCAG GGAATACATT 3180 

TCGACCTGTG TTCCTGAATG CACTCATCCC CTCAAAGTGG GGACCTGTCC CAAGGACTGG 3240 

CCCAGCTGTG GGGAGATCAC CTTCAGAGAC TATCAGATGA GATACAGAGA CAACACCCCC 3300 

CTTGTTCTCG ACAGCCTGAA CTTGAACATA CAAAOTGGGC AGACAGTCGG GATTGTTGGA 3360 

AGAACAGGTT CCGGAAAGTC ATCGTTAGGA ATGGCTTTGT TTCGTCTGGT GGAGCCAGCC 3420 

AGTGGCACAA TCTTTATTGA TGAGGTGGAT ATCTGCATTC TCAGCTTGGA AGACCTCAGA 3480 

ACCAAGCTGA CTGTGATCCC ACAGGATCCT GTCCTGTTTG TAGGTACAGT AAGGTACAAC 3540 

TTGGATCCCT TTGAGAGTCA CACCGATGAG ATGCTCTGGC AGGTTCTGGA GAGAACATTC 3600 

ATGAGAGACA CAATAATGAA ACTCCCAGAA AAATTACAGG CAGAAGTCAC AGAAAATGGA 3660 

GAAAACTTCT CAGTAGGGGA ACGTCAGCTG CTTTGTGTGG CCCGAGCTCT TCTCCGTAAT 3720 

TCAAAGATCA TTCTCCTTGA TGAAGCCACC GCCTCTATGG ACTCCAAGAC TGACACCCTG 3780 

GTTCAGAACA CCATCAAAGA TGCCTTCAAG GGCTGCACTG TGCTGACCAT CGCCCACCGC 3840 

CTCAACACAG TTCTCAACTG CGATCACGTC CTGGTTATGG AAAATGGGAA GGTGATTGAG 3 900 

TTTGACAAGC CTGAAGTCCT TGCAGAGAAG CCAGATTCTG CATTTGCGAT GTTACTAGCA 3960 
GCAGAAGTCA GATTGTAG 

Seq ID NO i 513 Protein sequence 
Protein Accession #: Eos sequence 

1 11 ' 21 31 41 51 

I I I I I I 

MVGEGPYLIS DLDQRGRRRS FAERYDPSLK TMIPVRPCAR LAPNPVDDAG LLSFATFSWL 60 

TPVMVKGYRQ RLTVDTLPPL STYDSSDTNA KRFRVLWDEE VARVGPEKAS LSHWWKFQR 120 

TRVLMDIVAN ILCIIMAAIG PTVLIHQILQ QTERTSGKVW VGIGLCXALF ATEFTKVFFW 180 

ALAWAINYRT AIRLKVALST LVFENLVSFK TLTHISVGEV LNILSSDSYS LFEAALFCPIi 240 

PATIPILMVF CAAYAFFILG PTALIGISVY VIFIPVQMFM AKIiNSAFRRS AILVTDKRVQ 300 

TMNEFLTCIR LIKMYAWEKS FTNTIQDIRR RERKLLEKAG FVQSGNSALA PIVSTIAIVL 360 

TLSCHILLRR KLTAPVAFSV IAMFNVMKFS IAILPFSIKA MAEANVSLRR MKKILIDKSP 420 

PSYITQPEDP DTVLLLANAT LTWEHEASRK STPKKLQNQK RHLCKKQRSE AYSER9PPAK 480 

GATGPEEQSD SLKSVLHSIS FWRKLCRYP EAQLLAWRWP AVFVGRIIRG YRPHGFSAKD 540 

KDESRRIiLTW PQEVDRTQRA AKYLGKILGI CGNVGSGKSS LLAALLGQMQ LQKGWAVNG 600 

TLAYVSQQAW I PHGNVREN I LFGEKYDHQR YQHTVRVCGL QKDLSNLPYG DLTEIGERGL 660 

NLSGGQRQRX S LARA VY SDR QLYLLDDPLS AVDAHVGKHV FEECIKKTLR GKTWLVTHQ 720 

LQFLESCDEV ILLEDGEICE KGTHKELMEE RGRYAKLIHN LRGLQFKDPE HLYNAAMVEA 780 

FKESPAEREE DAGIIGYLIiS LFTVFLFLLM IGSAAFSNWW LGLWLDKGSR MTCGPQGNRT 840 

MCEVGAVLAD IGQHVYQWVY TASMVFMLVF GVTKGFVPTK TTLMASSSLH DTVFDKILKS 900 

PMSFPDTTPT GRLMNRFSKD MDELDVRLPF HAENFIiQQFF MWFIL.VILA AVFPAVLLW 960 

ASLAVGFFIL LRIFHRGVQE LKKVENVSRS PWPTHITSSM OGLGIIHAYG KKESCITYTS 1020 

SKGLSLSYII QLSGLLQVCV RTQTETQAKF TSVELLREYI STCVPECTHP LKVGTCPKDW 1080 

PSCGEITFRD YQMRYRDNTP LVLDSLNLNI QSGQTVGIVG RTGSGKSSLG MALFRLVEPA 1140 

SGTIFIDEVD ICILSLEDLR TKLTVIPQDP VLFVGTVRYN LDPFESHTDE KLWQVLERTF 1200 

MRDTIMKLPE KLQAEVTENG ENFSVGERQL LCVARALliRN SKIILLDEAT ASMDSKTDTL 1260 

VQNTIKDAFK GCTVLTIAHR LNTVLNCDHV LVMENGKVIE FDKPEVLAEK PDSAFAMLLA 1320 
ABVRL 

Seq ID NO i 514 DNA sequence 
Nucleic Acid Accession #: Z31560 
Coding sequence: 1-966 

1 11 21 31 41 51 

I I I 1 I I 

CACAGCGCCC GCATGTACAA CATGATGGAG ACGGAGCTGA AGCCGCCGGG CCCGCAGCAA 60 

ACTTCGGGGG GCGGCGGCGG CAACTCCACC GCGGCGGCGG CCGGCGGCAA CCAGAAAAAC 120 

AGCCCXSGACC GCGTCAAGCG GCCCATGAAT GCCTTCATGG TGTGGTCCCG CGGGCAGCGG 180 

CGCAAGATGG CCCAGQAGAA CCCCAAGATG CACAACTCGG AGAT CAG CAA GCGCCTGGGC 240 

GCCGAGTGGA AACTTTTGTC GGAGACGGAG AAGCGGCCGT TCATCGACGA GGCTAAGCGG 300 

CTGCGAGCGC TGCACATGAA GGAGCACCCG GATTATAAAT ACCGGCCCCG GCGGAAAACC 360 

AAGACGCTCA TGAAGAAGGA TAAGTACAOG CTGCCCGGCG GGCTGCTGGC CCCCGGCGGC 420 

AATAGCATGG CGAGCGGGGT CGGGGTGGGC GCCGGCCTGG GCGCGGGCGT GAACCAGCGC 480 

ATGGACAGTT ACGCGCACAT GAACGGCTGG AGCAACGGCA GCTACAGCAT GATGCAGGAC 540 

CAGCTGGGCT ACCCGCAGCA CCCGGGCCTC AATGCGCACG GCGCAGCGCA GATGCAGCCC 600 

ATGCACCGCT ACGACGTGAG CGCCCTGCAG TACAACTCCA TGACCAGCTC GCAGACCTAC 660 

ATGAACGGCT CGCCCACCTA CAGCATGTCC TACT CGCAGC AGGGCACCCC TGGCATGGCT 720 

CTTGGCTCCA TGGGTTCGGT GGTCAAGTCC GAGGCCAGCT CCAGCCCCCC TGTGGTTACC 780 

TCTTCCTCCC ACTCCAGGGC GCCCTGCCAG GCCGGGGACC TCCGGGACAT GATCAGCATG 840 

TATCTCCCCG GCGCCGAGGT GCCGGAACCC GCCGCCCCCA GCAGACTTCA CATGTCCCAG 900 

CACTACCAGA GCGGCCCGGT GCCCGGCACG GCCATTAACG GCACACTGCC CCTCTCACAC 960 

ATGTGAGGGC CGGACAGCGA ACTGGAGGGG GGAGAAATTT TCAAAGAAAA ACGAGGGAAA 1020 

TGGGAGGGGT GCAAAAGAGG AGAGTAAGAA ACAGCATGGA GAAAACCCGG TACGCTCAAA 1080 
AAAAA 

Seq ID NO j 515 Protein sequence 
Protein Accession #: CAA83435 

1 11 21 31 41 51 

I I I I I 

HSARMYNMME TELKPPGPQQ TSGGGGGNST AAAAGGNQKN SPDRVKRPMN AFMVWSRGQR 60 

RKMAQENPKM HNSEISKRLG AEWKLLSETE KRPFIDEAKR LRALHMKEHP DYKYRPRRKT 120 

KTLMKKDKYT IiPGGLLAPGG NSMASGVGVG AGLGAGVNQR MDSYAHMNGW SNGSYSMMQD 180 
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QLGYPQHPGL NAHGAAQMQP MHRYDVSALQ YNSMTSSQTY MNGSPTYSMS YSQQGTPGMA 
LGSMGSWKS BASSSPPWT SSSHSRAPCQ AGDLRDMISM YLPGAEVPEP AAPSRLHMSQ 
HYQSGPVPGT AINGTLPLSH M 

Seg ID NO: 516 DMA sequence 
Nucleic Acid Accession fh U91618 
Coding sequence: 29. .541 



PCT/US02/12476 



240 
300 



CGGACTTGGC 
CATGCTACTC 
AGCATTAGAA 
TCCCTCTTGG 
AGCTGAGGAA 
TGCTTTAGAT 
TCACAGCAGG 
TGACAAAAAT 
GCTGTATGAG 
AGAGAATAAA 
ATTATATTTG 
ATTGAATGTG 
TCTTCAAAAA 



11 

i 

TTGTTAGAAG 
CTGGCTTTCA 
GCAGATTTCT 
AAGATGACTC 
ACAGGAGAAG 
GGCTTTAGCT 
GCTTTTCAAC 
GGAAAGGAAG 
AATAAACCCA 
TCATTTATTT 
TGTGAAAATG 
TTTTTCTGCA 
AAAAAAAAAA 



21 

I ' 

GCTOAAAGAT 
GCTCCTGGAG 
TGACCAATAT 
TGCTAAATGT 
TTCATGAAGA 
TGGAAGCAAT 
ACTGGGAGTT 
AAGTCATAAA 
OAAGACCCTA 
ACATGTGATT 
TGACAAACAC 
CTAATAGAAA 
AAATGGGGCC 



31 
I 

GATGGCAGGA 
TCTGTGCTCA 
GCATACATCA 
TTGCAGTCTT 
GGAGCTTGTT 
GTTGACAATA 
AATCCAGGAA 
GAGAAAAATT 
CATACTCAAA 
GTGATTCATC 
ACTTATCTGT 
TTAGACTAAG 
GCAATT 



41 
I 

ATGAAAATCC 
GATTCAGAAG 
AAGATTAGTA 
GTAAATAATT 
GCAAGAAGGA 
TACCAGCTCC 
GATATTCTTG 
CCTTATATTC 
AGAGATTCTT 
AT CCCTT AAT 
CTCTTCTACA 
TGTTTTCAAA 



51 
I 

AGCTTGTATG 
AGGAAATGAA 
AAGCACATGT 
TGAACAGCCC 
AACTTCCTAC 
ACAAAATCTG 
ATACTGGAAA 
TGAAACGGCA 
ACTATTACTG 
TAAATATCAA 
ATTGTGGTTT 
TAAATCTAAA 



Seq ID NO: 517 Protein sequence 
Protein Accession ft: AAB50564 



51 



1 11 21 31 41 

11)1)] 
MMAGMKIQLV CMLLLAFSSW SLCSDSEEEM KALBADPLTN MHTSKISKAH VPSWKMTLLN 
VCSL.VNNLNS PAEETGEVHE EELVARRKLP TALDGFSLEA MLTIYQLHKI CHSRAFQHWE 
LIQBDILDTG NDKNGKEEVI KRKIPYILKR QLYENKPRRP YIIiKRDSYYY 

Seq ID NO t 518 DNA sequence 

Nucleic Acid Accession #x NM_006536.2 

'Coding sequence: 109.. 2940 



ACCTAAAACC 
ATGTATGCAG 
AGCATTGCAG 
GAACTCCCAT 
ATTGCAATTA 
ATAACTGAAG 
ATAAAGATTT 
T CAT AT G AAA 
TACACCCTAC 
TTCCTACTGA 
GAATGGGCCC 
ATAAATGGGC 
GTGTGTGAAA 
GGATGCACCT 
AGTTTATCTT 
CTACAGAACC 
TTTCACCACA 
GTACAGGCTG 
GCTGACAGAC 
ATTCATACCT 
CACCAAATTA 
TCAGCTAAAA 
AAACTGAATG 
CTTCTTGGCA 
CTGGGTTCAT 
TTCTTTGTTC 
TCTGGAACTG 
AAACCTCACC 
ATGTTTCTAG 
GGACGAAAAT 
TGGATTCCAG 
TCTCTGCAAG 
GCCACTGTGG 
TATGCCAATG 
GAGCCAGAGA 
GTTATAAAAA 
TATAGCTTGA 
CCAGGGAGTC 
GCTCCAAGGA 
AGCTCAGGAG 
CCACCATGCA 
TGGACAGCAC 
AGTAAAAGTC 
AAGCGAAATC 
ACGAATGGAC 
GCAATACGAG 
CCTCTGTTTA 
GGAGTTTTAA 



11 
I 

TTGCAAGTTC 
CAGGCTCAGT 
GTCCTATTTG 
TCCTGGGAGC 
ATCCTCAGGT 
CTTCATTTTA 
TAATACCTGC 
AGGCAAATGT 
AATACAGAGG 
ATGATAACTT 
ACCTCCGTTG 
AAAATCAAAT 
AAGGTCCTTG 
TTATCTACAA 
CTGTGGTTGA 
AGATGTGCAG 
GCTTTCCCAT 
GTGACAAAGT 
TCCTTCAACT 
TCGTGGGCAT 
ACAGCAATGA 
CAGACATCAG 
GAAAAGCTTA 
ATTGCTTACC 
CTGCAGCCCC 
CAGATATATC 
GAGACATTTT 
ATCAATTGAA 
TTACGTGGCA 
ACTACACAAA 
GAACAGCTAA 
CCCTGAAAGT 
AAGCCTTTGT 
TGAAACAGGG 
CTGGAGATCC 
ATGATGGAAT 
AAGTGCATGT 
ATGCTATGTA 
AATCAGTAGG 
GCTCCTTTTC 
AAATTATTGA 
CTGGAGAAGA 
TACAGAATAT 
CTCAGCAAGC 
CTGAACATCA 
CAATGGATAG 
TTCCCCCCAA 
CAGCAATGGG 



21 
I 

AGGAAGAAAC 
GTGAGTGAAC 
CAACCTGAAG 
TGGAGTACAG 
ACCTGAGAAT 
CCTATTTAAT 
CACATGGAAA 
CATAGTGACT 
GTGTGGAAAA 
AACAGCTGGC 
GGGTGTGTTC 
TAAAGTGACA 
CCCCCAAGAA 
TAGCACCCAA 
ATTTTGTAAT 
CCTCAGAAGT 
GAATGGGACT 
GGTCTGTTTA 
ACAACAAGCC 
TGCCAGTTTC 
TGATCGAAAG 
CATTTGTTCA 
TGGCTCTGTG 
CACTGTGCTC 
AAATCTGGAG 
AAACTCCAAT 
CCAGCAACAT 
AAACACAGTG 
GGCCAGTGGT 
TAATTTTATC 
GCCTGGGCAC 
GACAGTGACC 
GGAAAGAGAC 
ATTTTATCCC 
TGTTACGCTG 
TTACTCGAGG 
CAATCACTCT 
TGTACCAGGT 
CAGAAATGAG 
AGTGCTGGGA 
CCTGGAAGCT 
CTTTGATCAG 
CCAAGATGAC 
TGGCATCAGG 
GCCAAATGGA 
GAACTCCTTA 
TTCTGATCCT 
TTTGATAGGA 



31 
I 

CATCTGCATC 
TGGAGGCTTC 
TTTGTGACTC 
CTTCAAGACA 
CAGAACCTCA 
GCTACCAAGA 
GCTAATAATA 
GACTGGTATG 
GAGGGAAAAT 
TACGGATCAC 
GATGAGTATA 
AGGTGTTCAT 
AACTGTATTA 
AATGCAACTG 
GCAAGTACCC 
GCATGGGATG 
GAGCTTCCAC 
GTGCTGGATG 
GCAGAATTTT 
GACAGCAAAG 
TTGCTGGTTT 
GGGCTTAAGA 
ATGATATTAG 
AGCAGTGGTT 
GAATTATCAC 
AGCATGATTG 
ATTCAGCTTG 
ACTGTGGATA 
CCTCCTGAGA 
ACCAATCTAA 
TGGACTTACA 
TCTCGCGCCT 
AGCCTCCATT 
ATTCTTAATG 
AGACTCCTTG 
TATTTTTTCT 
CCCAGCATAA 
TACACAGCAA 
GAGGAGCGAA 
GTTCCAGCTG 
GTAAAAGTAG 
GGCCAGG CTA 
TTTAACAATG 
GAGATATTTA 
GAAACACATG 
CAGTCTGCTG 
GTACCTGCCA 
ATCATTTGCC 



41 

I 

CATATTGAAA 
TCTACAACAT 
TCCTGGTTGC 
ATGGGTATAA 
TCTCAAACAT 
GAAGAGTATT 
ACAGCAAAAT 
GGGCACATGG 
ACATTCATTT 
GAGGCCGAGT 
ACAATGACAA 
CTGACATCAC 
TTAGTAAGCT 
CATCAATAAT 
ACAACCAAGA 
TAATCACAGA 
CTCCTCCCAC 
TGTCCAGCAA 
ATTTGATGCA 
GAGAGATCAG 
CATATCTGCC 
AAGGATTTGA 
TGACCAGCGG 
CAACAATTCA 
GTCTTACAGG 
ATGCTTTCAG 
AAAGTACAGG 
ATACTGTGGG 
TTATATTATT 
CTTTTCGGAC 
CCCTGAACAA 
CCAACTCAGC 
TTCCTCATCC 
CCACTGTCAC 
ATGATGGAGC 
CCTTTGCTGC 
GCACCCCAGC 
ACGGTAATAT 
AGTGGGGCTT 
GCCCCCACCC 
AAGAGGAATT 
CAAGCTATGA 
CTATTTTAGT 
CGTTCTCACC 
AAAGCCACAG 
TATCTAACAT 
GAGATTATCT 
TTATTATAGT 



51 
I 

ACCTGACACA 
GACCCAAAGG 
CTTAAGTTCA 
TGGATTGCTC 
TAAGGAAATG 
TTTCAGAAAT 
AAAACAAGAA 
AGATGATCCA 
CACACCTAAT 
GTTTGTCCAT 
ACCTTTCTAC 
AGGCATTTTT 
TTTTAAAGAA 
GTTCATGCAA 
AGCACCAAAC 
CTCTGCTGAC 
ATTCTCGCTT 
GATGGCAGAG 
GATTGTTGAA 
AGCCCAGCTA 
CACCACTGTA 
GGTGGTTGAA 
AGATGATAAG 
CTCCATTGCC 
AGGTTTAAAG 
TAGAATTTCC 
TGAAAATGTC 
CAACGACACT 
TGATCCTGAT 
AGCTAGTCTT 
TACCCATCAT 
TGTGCCCCCA 
TGTGATGATT 
TGCCACAGTT 
AGGTGCTGAT 
AAATGGTAGA 
CCACTCTATT 
TCAGATGAAT 
TAGCCGAGTC 
TGATGTGTTT 
GACCCTATCT 
AATAAGAATG 
AAATACATCA 
CCAGATTTCC 
AATTTATGTT 
TGCCCAGGCG 
TATATTGAAA 
TGTGACACAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



60 
120 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
I860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 



382 



WO 02/086443 

CATACTTTAA GCAGGAAAAA GAGAGCAOAC AAGAAAGAGA ATGGAACAAA ATTATTATAA 2940 

ATAAATATCC AAAGTGTCTT CCTTCTTAGA TATAAGACCC ATGGCCTTOG ACTACAAAAA 3000 

CATACTAACA AAGTCAAATT AACATCAAAA CTGTATTAAA ATGCATTGAG TTTTTGTACA 3060 

ATACAGATAA GATTTTTACA TGGTAGATCA ACAATTCTTT TTGGGGGTAG ATTAGAAAAC 3120 

CCTTACACTT TGGCTATGAA CAAATAATAA AAATTATTCT TTAAAGTAAT GTCTTTAAAG 3180 

GCAAAGGGAA GGGTAAAGTC GGACCAGTGT CAAGGAAAGT TTGTTTTATT GAGGTGGAAA 3240 

AATAGCCCCA AGCAGAGAAA AGGAGGGTAG GTCTGCATTA TAACTGTCTG TGTGAAGCAA 3300 

TCATTTAGTT ACTTTGATTA ATTTTTCTTT TCTCCTTATC TGTGCAGTAC AGGTTGCTTG 3360 

TTTACATGAA GATCATGCTA TATTTTATAT ATGTAGCCCC TAATGCAAAG CTCTTTACCT 3420 

CTTGCTATTT TGTTATATAT ATTTCAGATG ACATCTCCCT GCTAATGCTC AGAGATCTTT 3480 

TTTCACTGTA AGAGGTAACC TTTAACAATA TGGGTATTAC CTTTGTCTCT TCATACCGGT 3540 

TTTATGACAA AGGTCTATTG AATTTATTTG TNTGTAAGTT TCTACTCCCA TCAAAGCAGC 3600 

TTTCTAAGTT TATTGCCTTG GGTTATTATG GAATGATAGT TATAGCCCCN TATAATGCCT 3660 
TACCTAGGAA A 

Seq ZD NO: 519 Protein sequence 
Protein Accession #i NP_006S27.1 

1 11 21 31 41 51 

I I I I I I 

MTQRSIAGPI CNLKFVTLLV ALSSELPFLG AGVQLQDNGY NGLLIAINPQ VPENQNLISN 60 

IKEMITEASP YLFNATKRRV PPRNIKILIP ATWKANNNSK IKQESYEKAN VIVTDWYGAH 120 

GDDPYTIfQYR GCGKEGKYIH FTPNFLLNDN LTAGYGSRGR VFVHEWAHLR WGVFDEYNND 180 

KPFYINGQNQ IKVTRCSSDI TGIFVCEKGP CPQENCIISK LPKEGCTPIY NSTQNATASI 240 

MFMQSLSSW EFCNASTHNQ EAPNLQNQMC SLRSAWDVIT DSADFHHSFP MNGTELPPPP 300 

TFSLVQAGDK WCLVLDVSS KMAEADRLLQ LQQAAEFYLM QIVEIHTFVG IASFDSKGEI 360 

-RAQLHQIN9N DDRKLLVSYL PTTVSAKTDI SICSGLKKGF EWEKLNGKA YGSVMILVTS 420 

GDDKLIiGNCL PTVLSSGSTI HSIALGSSAA PNLEELSRLT GGLKFFVPDI SNSNSMIDAP 480 

SRISSGTGDI FQQHIQLEST GENVKPHHQL KNTVTVDNTV GNDTMFLVTW QASGPPEIIL 540 

FDPDGRKYYT NNPITNLTPR TASLWIPGTA KPGHWTYTLN NTHHSLQALK VTVTSRASNS 600 

AVPPATVEAF VERDSLHPPH PVMIYANVKQ GFYPILNATV TATVEPBTGD PVTLRLLDDG 660 

AGADVIKNDG IYSRYFPSFA ANGRYSLKVH VNHSPSISTP AHS IPG SHAM YVPGYTANGN 720 

IQMNAPRKSV GRNEEERKWG FSRVSSGGSF SVLGVPAGPH PDVFPPCKII DLEAVKVEEE 780 

LTLSWTAPGE DPDQGQATSY EtRMSKSLQN IQDDFNNAIL VNTSKRNPQQ AGIREIFTFS 840 

PQISTNGPEH QPNGETHESH RIYVAIRAMD RNSLQSAVSN IAQAPLFIPP NSDPVPARDY 900 
LILiKGVLiTAM GLIGIICLII WTHHTLSRK KRADKKENGT KLL 

Seq ID NO: 520 DNA sequence 

Nucleic Acid Accession #: NM_000228.1 

Coding sequence i 82 . .3600 

1 11 21 31 41 SI 

I I I I I I 

GCTTTCAGGC GATCTGGAGA AAGAACGGCA GAACACACAG CAAGGAAAGG TCCTTTCTGG 60 

GGATCACCCC ATTGGCTGAA GATGAGACCA TTCTTCCTCT TGTGTTTTGC CCTGCCTGGC 120 

CTCCTGCATG CCCAACAAGC CTGCTCCCGT GGGGCCTGCT ATCCACCTGT TGGGGACCTG 180 

CTTGTTGGGA GGACCCGGTT TCTCCGAGCT TCATCTACCT GTGGACTGAC CAAGCCTGAG 240 

ACCTACTGCA CCCAGTATGG CGAGTGGCAG ATGAAATGCT GCAAGTGTGA CTCCAGGCAG 300 

CCTCACAACT ACTACAGTCA CCGAGTAGAG AATGTGGCTT CATCCTCCGG CCCCATGCGC 360 

TGGTGGCAGT CCCAGAATGA TGTGAACCCT GTCTCTCTGC AGCTGGACCT GGACAGGAGA 420 

TTCCAGCTTC AAGAAGT CAT GATGGAGTTC CAGGGGCCCA TGCCCGCCGG CATGCTGATT 480 

GAGCGCTCCT CAGACTTCGG TAAGACCTGG CGAGTGTACC AGTACCTGGC TGCCGACTGC 540 

ACCTCCACCT TCCCTCGGGT CCGCCAGGGT CGGCCTCAGA GCTGGCAGGA TGTTCGGTGC 600 

CAGTCCCTGC CTCAGAGGCC TAATGCACGC CTAAATGGGG GGAAGGTCCA ACTTAACCTT 660 

ATGGATTTAG TGTCTGGGAT TCCAGCAACT CAAAGTCAAA AAATTCAAGA GGTGGGGGAG 720 

ATCACAAACT TGAGAGTCAA TTTCACCAGG CTGGCCCCTG TGCCCCAAAG GGGCTACCAC 780 

CCTCCCAGCG CCTACTATGC TGTGTCCCAG CTCCGTCTGC AGGGGAGCTG CTTCTGTCAC 840 

GGCCATGCTG ATCGCTGCGC ACCCAAGCCT GGGGCCTCTG CAGGCCCCTC CACCGCTGTG 900 

CAGGTCCACG ATGTCTGTGT CTGCCAGCAC AACACTGCCG GCCCAAATTG TGAGCGCTGT 960 

GCACCCTTCT ACAACAACCG GCCCTGGAGA CCGGCGGAGG GCCAGGACGC CCATGAATGC 1020 

CAAAGGTGCG ACTGCAATGG GCACTCAGAG ACATGTCACT TTGACCCCGC TGTGTTTGCC 1080 

GCCAGCCAGG GGGCATATGG AGGTGTGTGT GACAATTGCC GGGACCACAC CGAAGGCAAG 1140 

AACTGTGAGC GGTGTCAGCT GCACTATTTC CGGAACCGGC GCCCGGGAGC TTCCATTCAG 1200 

GAGACCTGCA TCTCCTGCGA GTGTGATCCG GATGGGGCAG TGCCAGGGGC TCCCTGTGAC 1260 

CCAGTGACCG GGCAGTGTGT GTGCAAGGAG CATGTGCAGG GAGAGCGCTG TGACCTATGC 1320 

AAGCCGGGCT TCACTGGACT CACCTACGCC AACCCGCAGG GCTGCCACCG CTGTGACTGC 1380 

AACATCCTGG GGTCCCGGAG GGACATGCCG TGTGACGAGG AGAGTGGGCG CTGCCTTTGT 1440 

CTGCCCAACG TGGTGGGTCC CAAATGTGAC CAGTGTGCTC CCTACCACTG GAAGCTGGCC 1500 

AGTGGCCAGG GCTGTGAACC GTGTGCCTGC GACCCGCACA ACTCCCCTCA GCCCACAGTG 1560 

CAACCAGTTC ACAGGGCAGT GCCCTGTCGG GAAGGCTTTG GTGGCCTGAT GTGCAGCGCT 1620 

GCAGCCATCC GCCAGTGTCC AGACCGGACC TATGGAGACG TGGCCACAGG ATGCCGAGCC 1680 

TGTGACTGTG ATTTCCGGGG AACAGAGGGC CCGGGCTGCG ACAAGGCATC AGGCCGCTGC 1740 

CTCTGCCGCC CTGGCTTGAC CGGGCCCCGC TGTGACCAGT GCCAGCGAGG CTACTGCAAT 1800 

CGCTACCCGG TGTGCGTGGC CTGCCACCCT TGCTTCCAGA CCTATGATGC GGACCTCCGG 1860 

GAGCAGGCCC TGCGCTTTGG TAGACTCCGC AATGCCACCG CCAGCCTGTG GTCAGGGCCT 1920 

GGGCTGGAGG ACCGTGGCCT GGCCTCCCGG ATCCTAGATG CAAAGAGTAA GATTGAGCAG 1980 

ATCCGAGCAG TTCTCAGCAG CCCCGCAGTC ACAGAGCAGG AGGTGGCTCA GGTGGCCAGT 2040 

GCCATCCTCT CCCTCAGGCG AACTCTCCAG GGCCTGCAGC TGGATCTGCC CCTGGAGGAG 2100 

GAGACGTTGT CCCTTCCGAG AGACCTGGAG AGTCTTGACA GAAGCTTCAA TGGTCTCCTT 2160 

ACTATGTATC AGAGGAAGAG GGAGCAGTTT GAAAAAATAA GCAGTGCTGA TCCTTCAGGA 2220 

GCCTTCCGGA TGCTGAGCAC AGCCTACGAG CAGTCAGCCC AGGCTGCTCA GCAGGTCTCC 2280 

GACAGCTCGC GCCTTTTGGA CCAGCTCAGG GACAGCCGGA GAGAGGCAGA GAGGCTGGTG 2340 

CGGCAGGCGG GAGGAGGAGG AGGCACCGGC AGCCCCAAGC TTGTGGCCCT GAGGCTGGAG 2400 

ATGTCTTCGT TGCCTGACCT GACACCCACC TTCAACAAGC TCTGTGGCAA CTCCAGGCAG 2460 

ATGGCTTGCA CCCCAATATC ATGCCCTGGT GAGCTATGTC CCCAAGACAA TGGCACAGCC 2520 

TGTGGCTCCC GCTGCAGGGG TGTCCTTCCC AGGGCCGGTG GGGCCTTCTT GATGGCGGGG 2580 

CAGGTGGCTG AGCAGCTGCG GGGCTTCAAT GCCCAGCTCC AGCGGACCAG GCAGATGATT 2640 
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AGGGCAGCCG AGGAATCTGC CTCACAGATT CAATCCAGTG CCCAGCGCTT GGAGACCCAG 2700 

GTGAGCGCCA GCCGCTCCCA GATGGAGGAA GATGT CAGAC GCACACGGCT CCTAATCCAG 2760 

CAGGTCCGGG ACTTCCTAAC AGACCCCGAC ACTGATGCAG CCACTATCCA GGAGGTCAGC 2820 

GAGGCCGTGC TGGCCCTGTG GCTGCCCACA GACTCAGCTA CTGTTCTGCA GAAGATGAAT 2680 

GAGATCCAGG CCATTGCAGC CAGGCTCCCC AACGTGGACT TGGTGCTGTC CCAGACCAAG 2940 

CAGGACATTG CGCGTGCCCG CCX3GTTGCAG GCTGAGGCTG AGGAAGCCAG GAGCCGAGCC 3000 

CATGCAGTGG AGGGCCAGGT GGAAGATGTG GTTGGGAACC TGCGGCAGGG GACAGTGGCA 3060 

CTGCAGGAAG CTCAGGACAC CATGCAAGGC ACCAGCCGCT CCCTTCGGCT TATCCAGGAC 3120 

AGGGTTGCTG AGGTTCAGCA GGTACTGCGG CCAGCAGAAA AGCTGGTGAC AAGCATGACC 3180 

AAGCAGCTGG GTGACTTCTG GACACGGATG GAGGAGCTCC GCCACCAAGC CCGGCAGCAG 3240 

GGGGCAGAGG CAGTCCAGGC CCAGCAGCTT GCGGAAGGTG CCAGCGAGCA GGCATTGAGT 3300 

GCCCAAGAGG GATTTGAGAG AATAAAACAA AAGTATGCTG AGTTGAAGGA CCGGTTGGGT 3360 

CAGAGTTCCA TGCTGGGTGA GCAGGGTGCC CGGATCCAGA GTGTGAAGAC AGAGGCAGAG 3420 

GAGCTGTTTG GGGAGACCAT GGAGATGATG GACAGGATGA AAGACATGGA GTTGGAGCTG 3480 

CTGCGGGGCA GCCAGGCCAT CATGCTGCGC TCGGOGGACC TGACAGGACT GGAGAAGCGT 3540 

GTGGAGCAGA TCCGTGACCA CATCAATGGG CGCGTGCTCT ACTATGCCAC CTGCAAGTGA 3600 

TGCTACAGCT TCCAGCCCGT TGCCCCACTC ATCTGCCGCC TTTGCTTTTG GTTGGGGGCA 3 660 

GATTGGGTTG GAATGCTTTC CATCTCCAGG AGACTTTCAT GCAGCCTAAA GTACAGCCTG 3720 

GACCACCCCT GGTGTGTAGC TAGTAAGATT ACCCTGAGCT GCAGCTGAGC CTGAGCCAAT 3780 

GGGACAGTTA CACTTGACAG ACAAAGATGG TGGAGATTGG CATGCCATTG AAACTAAGAG 3840 

CTCTCAAGTC AAGGAAGCTG GGCTGGGCAG TATCCCCCGC CTTTAGTTCT CCACTGGGGA 3900 

GGAATCCTGG ACCAAGCACA AAAACTTAAC AAAAGTGATG TAAAAATGAA AAGCCAAATA 3960 
AAAATCTTTG G 

Seq ID NO: 521 Protein sequence 
Protein Accession #: NP_000219.1 

1 11 21 31 ' 41 51 

I II I I I 

MRPFFLLCFA LPGLLHAQQA CSRGACYPPV GDLLVGRTRP LRASSTCGLT KPETYCTQYG 60 

EWQMKCCKCD SRQPHNYYSH RVENVASSSG PMRWWQSQND VNPVSLQLDL DRRFQLQEVM 120 

MEFQGPMPAG MLI ERSSDFG KTWRVYQYLA ADCTSTFPRV RQGRPQSWQD VRCQSLPQRP 180 

NARLNGGKVQ LNLMDLVSGI PATQSQKIQE VGEITNLRVN FTRLAPVPQR GYHPPSAYYA 240 

VSQLRLQGSC FCHGHADRCA PKPGASAGPS TAVQVHDVCV CQHNTAGPNC ERCAPFYNNR 300 

PWRPAEGQDA HECQRCDCNG H5ETCHFDPA VPAASQGAYG GVCDNCRDHT EGKNCERCQL 360 

HYFRNRRPGA SIQETCISCE CDPDGAVPGA PCDPVTGQCV CKEHVQGERC DLCKPGFTGL 420 

TYANPQGCHR CDCNXLGSRR DMPCDEESGR CLCLPNWGP KCDQCAPYHW KLASGOGCEP 480 

CACDPHNSPQ PTVQPVHRAV PCREGFGGLM CSAAAIRQCP DRTYGDVATG CRACDCDFRG 540 

TEGPGCDKAS GRCLCRPGLT GPRCDQCQRG YCNRYPVCVA CHPCFQTYDA DLREQALRFG 600 

RLRNATASLW SGPGLEDRGL ASRILDAKSK IEQIRAVLSS PAVTEQEVAQ VASAILSLRR 660 

TLQGLQLDLP LEEETLSLPR DLESLDRSFN GLLTMYQRKR EQFEKISSAD PSGAFRMLST 720 

AYEQSAQAAQ QVSDSSRLLD QLROSRREAE RLVRQAGGGG GTGSPKLVAL RLEMSSLPDL 780 

TPTFNKLCGN SRQMACTPIS CPGELCPQDN GTACGSRCRG VLPRAGGAFL MAGQVAEQLR 840 

GFNAQLQRTR QMIRAAEESA SQIQSSAQRL ETQVSASRSQ MEEDVRRTRL LIQQVRDFLT 900 

DPDTDAATIQ EVSEAVLALW LPTDSATVLQ KMNEIQAIAA RLPNVDLVLS QTKQDIARAR 960 

RLQAEAEEAR SRAHAVEGQV EDWGNLRQG TVALQEAQDT MQGTSRSLRL IQDRVAEVQQ 1020 

VLRPAEKLVT SMTKQLGDFW TRMEELRHQA RQQGAEAVQA QQLAEGASEQ ALSAQEGFER 1080 

IKQKYAELKD RLGQSSMLGE QGARIQSVKT EAEELFGETM EMMDRMKDME LELLRGSQAI 1140 
MLRSADLTGL EKRVEQIRDH INGRVLYYAT CK 

Seq ID NO: 522 DNA sequence 

Nucleic Acid Accession #: NM_001944.1 

Coding sequence: 84.. 3083 

1 11 21 31 41 51 

I I I I I I 

TTTTCTTAGA CATTAACTGC AGACGGCTGG CAGGATAGAA GCAGCGGCTC ACTTGGACTT 60 

TTTCACCAGG GAAATCAGAG ACAATGATGG GGCTCTTCCC CAGAACTACA GGGGCTCTGG * 120 

CCATCTTCGT GGTGGTCATA TTGGTTCATG GAGAATTGCG AATAGAGACT AAAGGTCAAT 180 

ATGATGAAGA AGAGATGACT ATGCAACAAG CTAAAAGAAG GCAAAAACGT GAATGGGTGA 240 

AATTTGCCAA ACCCTGCAGA GAAGGAGAAG ATAACTCAAA AAGAAACCCA ATTGCCAAGA 300 

TTACTTCAGA TTACCAAGCA ACCCAGAAAA TCACCTACCG AATCTCTGGA GTGGGAATCG 360 

ATCAGCCGCC TTTTGGAATC TTTGTTGTTG ACAAAAACAC TGGAGATATT AACATAACAG 420 

CTATAGTCGA CCGGGAGGAA ACTCCAAGCT TCCTGATCAC ATGTCGGGCT CTAAATGCCC 480 

AAGGACTAGA TGTAGAGAAA CCACTTATAC TAACGGTTAA AATTTTGGAT ATTAATGATA 540 

ATCCTCCAGT ATTTTCACAA CAAATTTTCA TGGGTGAAAT TGAAGAAAAT AGTGCCTCAA 600 

ACTCACTGGT GATGATACTA AATGCCACAG ATGCAGATGA ACCAAACCAC TTGAATTCTA 660 

AAATTGCCTT CAAAATTGTC TCTCAGGAAC CAGCAGGCAC ACCCATGTTC CTCCTAAGCA 720 

GAAACACTGG GGAAGTCCGT ACTTTGACCA ATTCTCTTGA CCGAGAGCAA GCTAGCAGCT 780 

ATCGTCTGGT TGTGAGTGGT GCAGACAAAG ATGGAGAAGG ACTATCAACT CAATGTGAAT 840 

GTAATATTAA AGTGAAAGAT GTCAACGATA ACTTCCCAAT GTTTAGAGAC TCTCAGTATT 900 

CAGCACGTAT TGAAGAAAAT ATTTTAAGTT CTGAATTACT TCGATTTCAA GTAACAGATT 960 

TGGATGAAGA GTACACAGAT AATTGGCTTG CAGTATATTT CTTTACCTCT GGGAATGAAG 1020 

GAAATTGGTT TGAAATACAA ACTGATCCTA GAACTAATGA AGGCATCCTG AAAGTGGTGA 1080 

AGGCTCTAGA TTATGAACAA CTACAAAGCG TGAAACTTAG TATTGCTGTC AAAAACAAAG 1140 

CTGAATTTCA CCAATCAGTT ATCTCTOGAT ACCGAGTTCA GTCAACCCCA GTCACAATTC 1200 

AGGTAATAAA TGTAAGAGAA GGAATTGCAT TCCGTCCTGC TTCCAAGACA TTTACTGTGC 1260 

AAAAAGGCAT AAGTAGCAAA AAATTGGTGG ATTAT AT CCT GGGAACATAT CAAGCCATCG 1320 

ATGAGGACAC TAACAAAGCT GCCTCAAATG TCAAATATGT CATGGGACGT AACGATGGTG 1380 

GATACCTAAT GATTGATTCA AAAACTGCTG AAATCAAATT TGTCAAAAAT ATGAACCGAG 1440 

ATTCTACTTT CATAGTTAAC AAAACAATCA CAGCTGAGGT TCTGGCCATA GATGAATACA 1500 

CGGGTAAAAC TTCTACAGGC ACGGTATATG TTAGAGTACC CGATTT CAAT GACAATTGTC . 1560 

CAACAGCTGT CCTCGAAAAA GATGCAGTTT GCAGTTCTTC ACCTTCCGTG GTTGTCTCCG 1620 

CTAGAACACT GAATAATAGA TACACTGGCC CCTATACATT TGCACTGGAA GATCAACCTG 1680 

TAAAGTTGCC TGCCGTATGG AGTATCACAA CCCTCAATGC TACCTCGGCC CTCCTCAGAG 1740 

CCCAGGAACA GATACCTCCT GGAGTATACC ACATCTCCCT GGTACTTACA GACAGTCAGA 1800 

ACAATCGGTG TGAGATGCCA CGCAGCTTGA CACTGGAAGT CTGTCAGTGT GACAACAGGG 1860 
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GCATCTGTGG AACTTCTTAC CCAACCACAA GCCCTGGGAC CAGGTATGGC AGGCCGCACT 1920 

CAGGGAGGCT GGGGCCTGCC QCCATCGGCC TGCTGCTCCT TGGTCTCCTG CTGCTGCTGT 1980 

TGGCCCCCCT TCTGCTGTTG ACCTGTGACT GTGGGGCAGG TTCTACTGGG GGAGTGACAG 2040 

GTGGTTTTAT CCCAGTTCCT GATGGCTCAG AAGGAACAAT TCATCAGTGG GGAATTGAAG 2100 

GAGCCCATCC TGAAGACAAG GAAATCACAA ATATTTGTGT GCCTCCTGTA ACAGCCAATG 2160 

GAGCCGATTT CATGGAAAGT TCTGAAGTTT GTACAAATAC GTATGCCAGA GGCACAGCGG 2220 

TGGAAGGCAC TTCAGGAATG GAAATGACCA CTAAGCTTGG AGCAGCCACT GAATCTGGAG 2280 

GTGCTGCAGG CTTTGCAACA GGGACAGTGT CAGGAGCTGC TTCAGGATTC GGAGCAGCCA 2340 

CTGGAGTTGG CATCTGTTCC TCAGGGCAGT CTGGAACCAT GAGAACAAGG CATTCCACTG 2400 

GAGGAACCAA TAAGGACTAC GCTGATGGGG CGATAAGCAT GAATTTTCTG GACTCCTACT 2460 

TTTCTCAGAA AGCATTTGCC TGTGCGGAGG AAGACGATGG CCAGGAAGCA AATGACTGCT 252 0 

TGTTGATCTA TGATAATGAA GGCGCAGATG CCACTGGTTC TCCTGTGGGC TCCGTGGGTT 2580 

GTTGCAGTTT TATTGCTGAT GACCTGGATG ACAGCTTCTT GGACTCACTT GGACCCAAAT 2640 

TTAAAAAACT TGCAGAGATA AGCCTTGGTG TTGATGGTGA AGGCAAAGAA GTTCAGCCAC 2700 

CCTCTAAAGA CAGCGGTTAT GGGATTGAAT CCTGTGGCCA TCCCATAGAA GTCCAGCAGA 2760 

CAGGATTTGT TAAGTGCCAG ACTTTGTCAG GAAGTCAAGG AGCTTCTGCT TTGT CCGCCT 2820 

CTGGGTCTGT CCAGCCAGCT GTTTCCATCC CTGACCCTCT GCAGCATGGT AACTATTTAG 2880 

TAACGGAGAC TTACTCGGCT TCTGGTTCCC TCGTGCAACC TTCCACTGCA GGCTTTGATC 2 940 

CACTTCTCAC ACAAAATGTG ATAGTGACAG AAAGGGTGAT CTGTCCCATT TCCAGTGTTC 3000 

CTGGCAACCT AGCTGGCCCA ACGCAGCTAC GAGGGTCACA TACTATGCTC TGTACAGAGG 3060 

ATCCTTGCTC CCGTCTAATA TGACCAGAAT GAGCTGGAAT ACCACACTGA CCAAATCTGG 3120 

ATCTTTGGAC TAAAGTATTC AAAATAGCAT AGCAAAGCTC ACTGTATTGG GCTAATAATT 3180 

TGGCACTTAT TAGCTTCTCT CATAAACTGA TCACGATTAT AAATTAAATG TTTGGGTTCA 3240 

TACCCCAAAA GCAATATGTT GTCACTCCTA ATTCTCAAGT ACTATTCAAA TTGTAGTAAA 3300 
TCTTAAAGTT TTTCAAAACC CT AAAAT CAT ATTCGC 

Seq ID NO: 523 Protein sequence 
Protein Accession fh NP_ 00193S.1 

1 11 21 31 41 51 

1. 1 I I I I 

MMGLFPRTTG ALAIPWVIL VHGELRIETK GQYDEEEMTM QQAKRRQKRE WVKPAKPCRE 60 

GEDNSKRNPI AKITSDYQAT QKITYRISGV GIDQPPFGIF WDKNTGDIN ITAIVDREET 120 

PSPIjITCRAL NAQGLDVEKP LILTVKILDI NDNPPVFSQQ IFMGEIEENS ASNSLVMILN 180 

ATDADEPNHL NSKIAFKIVS QEPAGTPMFI* LSRNTGEVRT LTNSLDREQA SSYRLWSGA 240 

DKDGEGLSTQ CECNIKVKDV NDNFPMPRDS QYSARIEENI LSSELLRFQV TDLDEEYTDN 300 

WLAVYFFTSG NEGNWFEIQT DPRTNEGILK WKALDYEQL QSVKLSIAVK NKAEFHQSVI 360 

SRYRVQSTPV TIQVINVREG IAFRPASKTF TVQKGISSKK LVDYILGTYQ AIDEDTNKAA 420 

SNVKYVMGRN DGGYLMIDSK TAEIKFVKNM NRDSTFIVNK TITAEVLAID EYTGKTSTGT 480 

VYVRVPDFND NCPTAVUEKD AVCSSSPSW VSARTLNNRY TGPYTFALED QPVKLPAVWS 540 

ITTLNATSAL LRAQEQIPPG VYHISLVLTD SQNNRCEMPR SLTLEVCQCD NRGICGTSYP 600 

TTSPGTRYGR PHSGRLGPAA IGLL.LLGLLL LLLAFLLLLT CDCGAGSTGG VTGGFIPVPD 660 

GSEGTIHQWG IEGAHPEDKE ITNICVPPVT ANGADFMESS BVCTNTYARG TAVEGTSGME 720 

MTTKLGAATE SGGAAGFATG TVSGAASGFG AATGVGIC9S GQSGTMRTRH STGGTNKDYA 780 

DGAISMNFLD SYFSQKAFAC AEEDDGQEAN DCLLIYDNEG ADATGS PVGS VGCCSFIADD 840 

LDDSFIiDSLQ PKFKKLAEIS LGVDGEGKEV QPPSKDSGYG IESCGHPIEV QQTGFVKCQT 900 

LSGSQGASAL SASGSVQPAV SIPDPLQHGN YLVTETYSAS GSLVQPSTAG FDPLLTQNVI 960 
VTERVICPIS SVPGNLAGPT QLRGSHTMLC TEDPCSRLI 

Seq ID NOt 524 DNA sequence 

Nucleic Acid Accession #t XM_058069.2 

Coding sequence: 1..1413 

1 11 21 31 41 51 

I I I I I I 

ATGAAGTTTC TTCTAATACT GCTCCTGCAG GCCACTGCTT CTGGAGCTCT TCCCCTGAAC 60 

AGCTCTACAA GCCTGGAAAA AAATAATGTG CTATTTGGTG AAAGATACTT AGAAAAATTT 120 

TATGGCCTTG AGATAAACAA ACTTCCAGTG ACAAAAATGA AATATAGTGG AAACTTAATG 180 

AAGGAAAAAA TCCAAGAAAT GCAGCACTTC TTGGGTCTGA AAGTGACCGG GCAACTGGAC 240 

ACATCTACCC TGGAGATGAT GCACGCACCT CGATGTGGAG TCCCCGATGT CCATCATTTC 300 

AGGGAAATGC CAGGGGGGCC CGTATGGAGG AAACATTATA TCACCTACAG AATCAATAAT 360 

TACACACCTG ACATGAACCG TGAGGATGTT GACTACGCAA TCCGGAAAGC TTTCCAAGTA 420 

TGGAGTAATG TTACCCCCTT GAAATTCAGC AAGATTAACA CAGGCATGGC TGACATTTTG 480 

GTGGTTTTTG CCCGTGGAGC TCATGGAGAC TTCCATGCTT TTGATGGCAA AGGTGGAATC 540 

CTAGCCCATG CTTTTGGACC TGGATCTGGC ATTGGAGGGG ATGCACATTT CGATGAGGAC 600 

GAATTCTGGA CTACACATTC AGGAGGCACA AACTTGTTCC TCACTGCTGT TCACGAGATT 660 

GGCCATTCCT TAGGTCTTGG CCATTCTAGT GATCCAAAGG CCGTAATGTT CCCCACCTAC 720 

AAATATGTTG ACATCAACAC ATTTCGCCTC TCTGCTGATG ACATACGTGG CATTCAGTCC 780 

CTGTATGGAG ACCCAAAAGA GAACCAACGC TTGCCAAATC CTGACAATTC AGAACCAGCT 840 

CTCTGTGACC CCAATTTGAG TTTTGATGCT GTCACTACCG TGGGAAATAA GATCTTTTTC 900 

TTCAAAGACA GGTTCTTCTG GCTGAAGGTT TCTGAGAGAC CAAAGACCAG TGTTAATTTA 960 

ATTTCTTCCT TATGGCCAAC CTTGCCATCT GGCATTGAAG CTGCTTATGA AATTGAAGCC 1020 

AGAAATCAAG TTTTTCTTTT TAAAGATGAC AAATACTGGT TAATTAGCAA TTTAAGACCA 1080 

GAGCCAAATT ATCCCAAGAG CATACATTCT TTTGGTTTTC CTAACTTTGT GAAAAAAATT 1140 

GATGCAGCTG TTTTTAACCC ACGTTTTTAT AGGACCTACT TCTTTGTAGA TAACCAGTAT 1200 

TGGAGGTATG ATGAAAGGAG ACAGATGATG GACCCTGGTT ATCCCAAACT GATTACCAAG 1260 

AACTTCCAAG GAATCGGGCC TAAAATTGAT GCAGTCTTCT ACTCTAAAAA CAAATACTAC 1320 

TATTTCTTCC AAGGATCTAA CCAATTTGAA TATGACTTCC TACTCCAACG TATCACCAAA 1380 
ACACTGAAAA GCAATAGCTG GTTTGGTTGT TGA 

Seq id NOt 525 Protein sequence 
Protein Accession #: P39900 

1 11 21 31 41 51 

I I I I" I I 

MKFliLIIjIiLQ ATASGALPLN SSTSIiEKNNV LFGERYLEKF YGLEINKLPV TKMKYSGNLM 60 

KEKIQEMQHP LGLKVTGQLD TSTLEMMHAP RCGVPDVHHF REMPGGPVWR KHYITYRINN 120 
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YTPDMNREDV DYAIRKAFQV WSNVTPLKFS KINTGMADIL WPARGAHGD FHAFDGKGGI 180 

LAHAFGPGSG IGGDAHFDED EFWTTHSGGT NLFLTAVHEI GH5LGLGHSS DPKAVMFPTY 240 

KYVDINTFRL SADDIRGIQS LYGDPKENQR LPNPDNSEPA LCDPNLSFDA VTTVGNKIFF 300 

FKDRFFWLKV SERPKTSVNL ISSLWPTLPS GIEAAYEIEA RNQVFLPKDD KYWLISNLRP 360 

EPNYPKSIHS FGFPNFVKKI DAAVFNPRFY RTYFFVDNQY WRYDERRQMM DPGYPKLITK 420 
NFQGIGPKID AVFYSKNKYY YFFQGSNQFE YDFLLQRITK TLKSNSWFGC 

Seq ID NO: 526 DNA sequence 

Nucleic Acid Accession #: NM_024423.1 

Coding sequence: 64.. 2590 

1 11 21 31 41 51 

I I I I I I 

GGCAGGTCTC GCTCTCGGCA CCCTCCCGGC GCCCGCGTTC TCCTGGCCCT GCCCGGCATC 60 

CCGATGGCCG CCGCTGGGCC CCGGCGCTCC GTGCGCGGAQ CCGTCTGCCT GCATCTGCTG 120 

CTGACCCTCG TGATCTTCAG TCGTGATGGT GAAGCCTGCA AAAAGGTGAT ACTTAATGTA 180 

CCTTCTAAAC TAGAGGCAGA CAAAATAATT GGCAGAGTTA ATTTGGAAGA GTGCTTCAGG 240 

TCTGCAGACC TCATCCGGTC AAGTGATCCT GATTTCAGAG TTCTAAATGA TGGGTCAGTG 300 

TACACAGCCA GGGCTGTTGC GCTGTCTGAT AAGAAAAGAT CATTTACCAT ATGGCTTTCT 360 

GACAAAAGGA AACAGACACA GAAAGAGGTT ACTGTGCTGC TAGAACATCA GAAGAAGGTA 42 0 

TCGAAGACAA GACACACTAG AGAAACTGTT CTCAGGCGTG CCAAGAGGAG ATGGGCACCT 480 

ATTCCTTGCT CTATGCAAGA GAATTCCTTG GGCCCTTTCC CATTGTTTCT TCAACAAGTT 540 

GAATCTGATG CAGCACAGAA CTATACTGTC TTCTACTCAA TAAGTGGACG TGGAGTTGAT 600 

AAAGAACCTT TAAATTTGTT TTATATAGAA AGAGACACTG GAAATCTATT TTGCACTCGG 660 

CCTGTGGATC GTGAAGAATA TGATGTTTTT GATTTGATTG CTTATGCGTC AACTGCAGAT 720 

GGATATTCAG CAGATCTGCC CCTCCCACTA CCCATCAGGG TAGAGGATGA AAATGACAAC 780 

. CACCCTGTTT TCACAGAAGC AATTTATAAT TTTGAAGTTT TGGAAAGTAG TAGACCTGGT 840 

ACTACAGTGG GGGTGGTTTG TGCCACAGAC AGAGATOAAC CGGACACAAT GCATACGCGC 900 

CTGAAATACA GCATTTTGCA GCAGACACCA AGGTCACCTG GGCTCTTTTC TGTGCATCCC 960 

AGCACAGGCG TAATCACCAC AGTCTCTCAT TATTTGGACA GAGAGGTTGT AGACAAGTAC 1020 

TCATTGATAA TGAAAGTACA AGACATGGAT GGCCAGTTTT TTGGATTGAT AGGCACATCA 1080 

ACTTGTATCA TAACAGTAAC AGATTCAAAT GATAATGCAC CCACTTTCAG ACAAAATGCT 1140 

TATGAAGCAT TTGTAGAGGA AAATGCATTC AATGTGGAAA TCTTACGAAT ACCTATAGAA 1200 

GATAAGGATT TAATTAACAC TGCCAATTGG AGAGTCAATT TTACCATTTT AAAGGGAAAT 1260 

GAAAATGGAC ATTTCAAAAT CAG CACAGAC AAAGAAACTA ATGAAGGTGT TCTTTCTGTT 1320 

GTAAAGCCAC TGAATTATGA AGAAAACCGT CAAGTGAACC TGGAAATTGG AGTAAACAAT 13 80 

GAAGCGCCAT TTGCTAGAGA TATTCCCAGA GTGACAGCCT TGAACAGAGC CTTGGTTACA 1440 

GTTCATGTGA GGGATCTGGA TGAGGGGCCT GAATGCACTC CTGCAGCCCA ATATGTGOGG IS 00 

ATTAAAGAAA ACTTAGCAGT GGGGTCAAAG ATCAACGGCT ATAAGGCATA TGACCCCGAA 1560 

AATAGAAATG GCAATGGTTT AAGGTACAAA AAATTGCATG ATCCTAAAGG TTGGATCACC 1620 

ATTGATGAAA TTTCAGGGTC AATCATAACT TCCAAAATCC TGGATAGGGA GGTTGAAACT 1680 

CCCAAAAATG AGTTGTATAA TATTACAGTC CTGGCAATAG ACAAAGATGA TAGATCATGT 1740 

ACTGGAACAC TTGCTGTGAA CATTGAAGAT GTAAATGATA ATCCACCAGA AATACTTCAA 1800 

GAATATGTAG TCATTTGCAA ACCAAAAATG GGGTATACCG ACATTTTAGC TGTTGATCCT 1860 

GATGAACCTG TCCATGGAGC TCCATTTTAT TTCAGTTTGC CCAATACTTC TCCAGAAATC 1920 

AGTAGACTGT GGAGCCTCAC CAAAGTTAAT GATACAGCTG CCCGTCTTTC ATATCAGAAA 1980 

AATGCTGGAT TTCAAGAATA TACCATTCCT ATTACTGTAA AAGACAGGGC CGGCCAAGCT 2040 

GCAACAAAAT TATTGAGAGT TAATCTGTGT GAATGTACTC ATCCAACTCA GTGTCGTGCG 2100 

ACTTCAAGGA GTACAGGAGT AATACTTGGA AAATGGGCAA TCCTTGCAAT ATTACTGGGT 2160 

AT AG CACTGC TCTTTTCTGT ATTGCTAACT TTAGTATGTG GAGTTTTTGG TGCAACTAAA 2220 

GGGAAACGTT TTCCTGAAGA TTTAGCACAG CAAAACTTAA TTATATCAAA CACAGAAGCA 2280 

CCTGGAGACG ATAGAGTGTG CTCTGCCAAT GGATTTATGA CCCAAACTAC CAACAACTCT 2340 

AGCCAAGGTT TTTGTGGTAC TATGGGATCA GGAATGAAAA ATGGAGGGCA GGAAACCATT 2400 

GAAATGATGA AAGGAGGAAA CCAGACCTTG GAATCCTGCC GGGGGGCTGG GCATCATCAT 2460 

ACCCTGGACT CCTGCAGGGG AGGACACACG GAGGTGGACA ACTGCAGATA CACTTACTCG 2520 

GAGTGGCACA GTTTTACTCA ACCCCGTCTC GGTGAAGAAT CCATTAGAGG ACACACTGGT 2580 

TAAAAATTAA ACATAAAAGA AATTGCATCG ATGTAATCAG AATGAAGACC GCATGCCATC 2640 

CCAAGATTAT GTCCTCACTT ATAACTATGA GGGAAGAGGA TCTCCAGCTG GTTCTGTGGG 2700 

CTGCTGCAGT GAAAAGCAGG AAGAAGATGG CCTTGACTTT TTAAATAATT TGGAACCCAA 2760 

ATTTATTACA TTAGCAGAAG CATGCACAAA GAGATAATGT CACAGTGCTA CAATTAGGTC 2820 

TTTGTCAGAC ATTCTGGAGG TTTCCAAAAA TAATATTGTA AAGTTCAATT TCAACATGTA 2880 

TGTATATGAT GATTTTTTTC TCAATTTTGA ATTATGCTAC TCACCAATTT ATATTTTTAA 2940 

AGCCAGTTGT TGCTTATCTT TTCCAAAAAG TGAAAAATGT TAAAACAGAC AACTGGTAAA 3000 

TCTCAAACTC CAGCACTGGA ATTAAGGTCT CTAAAGCATC TGCTCTTTTT TTTTTTTACG 3060 

GATATTTTAG TAATAAATAT GCTGGATAAA TATTAGTCCA ACAATAGCTA AGTTATGCTA 3120 

ATATCACATT ATTATGTATT CACTTTAAGT GATAGTTTAA AAAATAAACA AGAAATATTG 3180 

AGTATCACTA TGTGAAGAAA GTTTTGGAAA AGAAACAATG AAGACTGAAT TAAATTAAAA 3240 

ATGTTGCAGC TCATAAAGAA TTGGGACTCA CCCCTACTGC ACTACCAAAT TCATTTGACT 3300 

TTGGAGGCAA AATGTGTTGA AGTGCCCTAT GAAGTAGCAA TTTTCTATAG GAATATAGTT 3360 

GGAAATAAAT GTGTGTGTGT ATATTATTAT TAATCAATGC AATATTTAAA ATGAAATGAG 3420 

AACAAAGAGG AAAATGGTAA AAACTTGAAA TGAGGCTGGG GTATAGTTTG TCCTACAATA 3480 

GAAAAAAGAG AGAGCTTCCT AGGCCTGGGC TCTTAAATGC TGCAT T ATAA CTGAGTCTAT 3540 

GAGGAAATAG TTCCTGTCCA ATTTGTGTAA TTTGTTTAAA ATTGTAAATA AATTAAACTT 3600 

TTCTGGTTTC TGTGGGAAGG AAATAGGGAA TCCAATGGAA CAGTAGCTTT GCTTTGCAGT 3660 

CTGTTTCAAG ATTTCTGCAT CCACAAGTTA GTAGCAAACT GGGGAATACT CGCTGCAGCT 3720 

GGGGTTCCCT GCTTTTTGGT AGCAAGGGTC CAGAGATGAG GTGTTTTTTT CGGGGAGCTA 3780 

ATAACAAAAA CATTTTAAAA CTTACCTTTA CTGAAGTTAA ATCCTCTATT GCTGTTTCTA 3840 

TTCTCTCTTA TAGTGACCAA CATCTTTTTA ATTTAGATCC AAATAACCAT GTCCTCCTAG 390 0 

AGTTTAGAGG CTAGAGGGAG CTGAGGGGAG GATCTTACTG AAAGCACCCT GGGGAGATTG 3960 

ATTGTCCTTA AACCTAAGCC CCACAAACTT GACACCTGAT CAGGTCTGGG AGCTACAAAA 4020 

TTTCATTTTT CTCCTCACTG CCCTTCTTCT GAGTGGCATT GGCCTGAATC AAGGAAAGCC 4080 

AGGCCTTGTG GGCCCCCTTC TTTCGGCTTT CTGCTAAAGC AACACCTCCA GCAGAGATTC 4140 

CCTTAAGTGA CTCCAGGTTT TCCACCATCC TTCAGCGTGA ATTAATTTTT AATCAGTTTG 4200 

CTTTCTCCAG AGAAATTTTA AAATAATAGA AGAAATAGAA ATTTTGAATG TATAAAAGAA 4260 

AAAGATCAAG TTGTCATTTT AGAACAGAGG GAACTTTGGG AGAAAGCAGC CCAAGTAGGT 4320 

TATTTGTACA GTCAGAGGGC AACAGGAAGA TGCAGGCCTT CAAGGGCAAG GAGAGGCCAC 4380 

AAGGAATATG GGTGGGAGTA AAAGCAACAT CGTCTGCTTC ATACTTTTTC CTAGGCTTGG 4440 
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CACTGCCTTT TCCTTTCTCA GGCCAATGGC AACTGCCATT T6AGTCCGGT GAGGGATCAG 4500 

CCAACCTCTT CTCTATGGCT CACCTTATTT GGAGTGAGAA ATCAAGGAGA CAGAGCTGAC 4560 

TGCATGATGA GTCTGAAGGC ATTTGCAGGA TGAGCCTGAA CTGGTTGTGC AGAACAAACA 4620 

AGGCATTCAT GGGAATTGTT GTATTCCTTC TGCAGCCCTC CTTCTGGGCA CTAAGAAGGT 4680 

CTATGAATTA AATGCCTATC TAAAATTCTG ATTTATTCCT ACATTTTCTG TTTTCTAATT 4740 

TGACCCTAAA AT CT ATGTGT TTTAGACTTA GACTTTTTAT TGCCCCCCCC CCCTTTTTTT 4800 

TTGAGACGGA GTCTCGCTCT GACGCACAGG CTGGAGTGCA GTGGCTCCGA TCTCTGCTCA 4860 

CTGAAAGCTC CGCCTCCCGG GTTCATGCCA TTCTCCTGCC TCAGCCTCCT GAGTAGCTGG 4920 

GACTAGAGGC GCCCACCACC ACGCCCGGCT AATTTTTTGT ATTTTTAATA GAGACGGGGT 4980 

TTCACTGTGT TAGCCAGGAT GGTCTCGATC TCCTGACCTC GTGATCCGCC TGCCTCGGCC 5040 

TCCCAAAGTG CTGGGATTAC AGGCATGACC CACCGCTCCC GGCCTTGTTT TCCGTTTAAA 5100 

GTCGTCTTCT TTTAATGTAA TCATTTTGAA CATGTGTGAA AGTTGATCAT ACGAATTGGA 5160 

TCAATCTTGA AATACTCAAC CAAAAGACAG TCGAGAAGCC AGGGGGAGAA AGAACTCAGG 5220 

GCACAAAATA TTGGTCTGAG AATGGAATTC TCTGTAAGCC TAGTTGCTGA AATTTCCTGC 5280 

TGTAACCAGA AGCCAGTTTT ATCTAACGGC TACTGAAACA CCCACTGTGT TTTGCTCACT 5340 

CCCACTCACC GATCAAAACC TGCTACCTCC CCAAGACTTT ACTAGTGCCG ATAAACTTTC 5400 

TCAAAGAGCA ACCAGTATCA CTTCCCTGTT TATAAAACCT CTAACCATCT CTTTGTTCTT 5460 

TGAACATGCT GAAAACCACC TGGTCTGCAT GTATGCCCGA ATTTGTAATT CTTTTCTCTC 5520 

AAATGAAAAT TTAATTTTAG GGATTCATTT CTATATTTTC ACATATGTAG TATTATTATT 5580 

TCCTTATATG TGTAAGGTGA AATTTATGGT ATTTGAGTGT GCAAGAAAAT ATATTTTTAA 5640 

AGCTTTCATT TTTCCCCCAG TGAATGATTT AGAATTTTTT ATGTAAATAT ACAGAATGTT 5700 

T TTTCTT ACT TTTATAAGGA AGCAGCTGTC TAAAATGCAG TGGGGTTTGT TTTGCAATGT 5 760 

TTTAAACAGA GTTTTAGTAT TGCTATTAAA AGAAGTTACT TTGCTTTTAA AGAAACTTGG 5820 

CTGCTTAAAA TAAGCAAAAA TTGGATGCAT AAAGTAATAT TTACAGATGT GGGGAGATGT 5880 

AATAAAACAA TATTAACTTG GCTGCTTAAA ATAAGCAAAA ATTGGATGCA TAAAGTAATA 5940 

TTTACAGATG TGGGGAGATG TAATAAAACA ATATTAACTT GGTTTCTTGT TTTTGCTGTA 6000 

TTTAGAGATT . AAATAATTCT AAGATGATCA CTTTGCAAAA TTATGCTTAT GGCTGGCATG 6060 

GAAATAGAAA TACTCAATTA TGTCTTTGTT GTATTAATGG GGAATATTTT GGACAATGTT 6120 

TCATTATCAA ATTGTCGACA TCATTAATAT ATATTGTAAT GTTGGGAAGA GATCACTATT 6180 

TTGAAGCACA GCTTTACAGA TGAGTATCTA TGATACATAT GTATAATAAA TTTTGATCGG 6240 

GTATTAAAAG TATTAGAAGG TGGTTATAAT TGCAGAGTAT TCCATGAATA GTACACTGAC 6300 

ACAGGGGTTT TACTTTGAGG ACCAGTGTAG TCAAGGGAAA ACATGAGTTA AAAAGAAAAG 6360 

CAGGCAATAT TGCAGTCTTG ATTCTGCCAC TTACAGGATA GATAATGCCT GAACTTTAAT 6420 

GACAAGATGA TCCAACCATA AAGGTGCTCT GTGCTTCACA GTGAATCTTT TCCCCATGCA 6480 

GGAGTGTGCT CCCCTACAAA CGTTAAGACT GATCATTTCA AAAATCTATT AGCTATATCA 6540 

AAAGCCTTAC ATTTTAATAT AGGTTGAACC AAAATTTCAA TTCCAGTAAC TTCTATTGTA 6600 

ACCATTATTT TTGTGTATGT CTTCAAGAAT GTTCATTGGA TTTTTGTTTG TAATAGTAAA 6660 

ATACCGGATA CATTTCACGT GTCCTTCAGT ATTGATTTGG TTGAATATTG GGTCATAATG 6720 

GTTGAGAAGC ATGGACACTA GAGCCAGAAT GCTTGGATAT GAATCCTGGA TCTGTCACTT 6780 

ACTTCTGTGT GACCTTTGAA AGGCTACTTA TTTCCTCTCT TAGCTTTCTC ATTAAAATCA 6840 

ATGAACAATG CCAGCCTCAT GGGGTTGTTG AATGATTAAA TTAGTTAATA TACCTAAAGT 6900 

ACATAGAACA CTGCCTGCAC ATAGTAAAAG AATTATAAGT GTGAGGTAGT TGGTAAAATT 6960 

ATGTAGTTGG ATATACTACC GAACAATATC TAATCTCTTT TTAGGGAAAT AAAGTTTGTG 7020 
CATATATATA ATCCCGAAAC ATG 

Seq ID NO i 527 Protein sequence 
Protein Accession #: NP_077741.1 

1 11 21 31 41 51 

MAAAGPRRSV RGAVCLHLLL TLVIFSRDGE ACKKVILNVP SKLEADKIIG RVNLEECFRS 60 

ADLIRSSDPD FRVLNDGSVY TARAVALSDK KRSFTIWLSD KRKQTQKEVT VfcliEHQKKVS 120 

KTRHTRETVL RRAKRRWAP I PCSMQENSLG PFPLFLQQVE SDAAQNYTVF YSISGRGVDK 180 

EPLNLFYIER DTGNLFCTRP VDREEYDVFD LIAYASTADG YSADLPLPLP IRVEDENDNH 240 

PVFTEAIYNF EVbESSRPQT TVGWCATDR DEPDTMHTRI* KYSILQQTPR SPGIiFSVHPS 300 

TGVITTVSHY LDREWDKYS LIMKVQDMDG QFFGLIGTST CIITVTDSND NAPTFRQNAY 3 60 

EAFVEENAFN VEILRIPIED KDLINTANWR VNFTILKGNE NGHFKISTDK ETNEGVLSW 420 

KPLNYEENRQ VNLEIGVNNE APFARDIPRV TALNRALVTV HVRDLDEGPE CTPAAQYVRI 480 

KENLAVGSKI NGYKAYDPEN RNGNGLRYKK LHDPKGWITI DEISGSIITS KILDREVETP 540 

KNELYNITVL A1DKDDRSCT GTLAVNI EDV NDNPPEILQE YWICKPKMG YTDILAVDPD 600 

EPVHGAPFYF SLPNTSPEIS RLWSLTKVND TAARLSYQKN AGFQEYTIPI TVKDRAGQAA 660 

TKLLRVNLCE CTHPTQCRAT SRSTGVILGK WAILAILLGI ALLFSVLLiTL VCGVFGATKG 720 

KRFPEDLAQQ NIiIISNTEAP GDDRVCSANG FMTQTTNNSS QGFCGTMGSG MKNGGQBTIE 780 
MMKGGNQTIiE SCRGAGHHHT LDSCRGGHTE VDNCRYTYSE WHSFTQPRLG EESIRGHTG 

Seq ZD NO: 528 DNA sequence 

Nucleic Acid Accession #: NM_001941.2 

Coding sequence: 64.. 2754 

1 11 21 31 41 51 

I I I I I I 

GGCAGGTCTC GCTCTCGGCA CCCTCCCGGC GCCCGCGTTC TCCTGGCCCT GCCCGGCATC 60 

CCGATGGCCG CCGCTGGGCC CCGGCGCTCC GTGCGCGGAG CCGTCTGCCT GCATCTGCTG 120 

CTGACCCTCG TGATCTTCAG TCGTGATGGT GAAGCCTGCA AAAAGGTGAT ACTTAATGTA 180 

CCTTCTAAAC TAGAGGCAGA CAAAATAATT GGCAGAGTTA ATTTGGAAGA GTGCTTCAGG 240 

TCTGCAGACC TCATCCGGTC AAGTGATCCT GATTTCAGAG TTCTAAATGA TGGGTCAGTG 300 

TACACAGCCA GGGCTGTTGC GCTGTCTQAT AAGAAAAGAT CATTTACCAT ATGGCTTTCT 360 

GACAAAAGGA AACAGACACA GAAAGAGGTT ACTGTGCTGC TAGAACATCA GAAGAAGGTA 420 

TCGAAGACAA GACACACTAG AGAAACTGTT CTCAGGCGTG CCAAGAGGAG ATGGGCACCT 480 

ATTCCTTGCT CTATGCAAGA GAATTCCTTG GGCCCTTTCC CATTGTTTCT TCAACAAGTT 540 

GAATCTGATG CAGCACAGAA CTATACTGTC TTCTACTCAA TAAGTGGACG TGGAGTTGAT 600 

AAAGAACCTT TAAATTTGTT TTATATAGAA AGAGACACTG GAAATCTATT TTGCACTCGG 660 

CCTGTGGATC GTGAAGAATA TGATGTTTTT GATTTGATTG CTTATGCGTC AACTGCAGAT 720 

GGATATTCAG CAGATCTGCC CCTCCCACTA CCCATCAGGG TAGAGGATGA AAATGACAAC 780 

CACCCTGTTT TCACAGAAGC AATTTATAAT TTTGAAGTTT TGGAAAGTAG TAGACCTGGT 840 

ACTACAGTGG GGGTGGTTTG TGCCACAGAC AGAGATGAAC CGGACACAAT GCATACGCGC 900 

CTGAAATACA GCATTTTGCA GCAGACACCA AGGTCACCTG GGCTCTTTTC TGTGCATCCC 960 



387 



WO 02/086443 

AGCACAGG CG TAATCACCAC AGTCTCTCAT 
TCATTGATAA TGAAAGTACA AGACATGGAT 
ACTTGTATCA TAACAGTAAC AGATTCAAAT 
TATGAAGCAT TTGTAGAGGA AAATGCATTC 
GATAAGGATT TAATTAACAC TGCCAATTGG 
GAAAATGGAC ATTTCAAAAT CAGCACAGAC 
GTAAAGCCAC TGAATTATGA AGAAAACCGT 
GAAGCGCCAT TTGCTAGAGA TATTCCCAGA 
GTTCATGTGA GGGATCTGGA TGAGGGGCCT 
ATTAAAGAAA ACTTAGCAGT GGGGTCAAAG 
AATAGAAATG GCAATGGTTT AAGGTACAAA 
ATTGATGAAA TTTCAGGGTC AATCATAACT 
CCCAAAAATG AGTTGTATAA TATTACAGTC 
ACTGGAACAC TTGCTGTGAA CATTGAAGAT 
GAATATGTAG TCATTTGCAA ACCAAAAATG 
GATGAACCTG TCCATGGAGC TCCATTTTAT 
AGTAGACTGT GGAGCCTCAC CAAAGTTAAT 
AATGCTGGAT TTCAAGAATA TACCATTCCT 
GCAACAAAAT TATTGAGAGT TAATCTGTGT 
ACTTCAAGGA GTACAGGAGT AATACTTGGA 
ATAGCACTGC TCTTTTCTGT ATTGCTAACT 
GGGAAACGTT TTCCTGAAGA TTTAGCACAG 
CCTGGAGACG ATAGAGTGTG CTCTGCCAAT 
AGCCAAGGTT TTTGTGGTAC TATGGGATCA 
GAAATGATGA AAGGAGGAAA CCAGACCTTG 
ACCCTGGACT CCTOCAGGGG AGGACACACG 
GAGTGGCACA GTTTTACTCA ACCCCGTCTC 
GAAGACCGCA TGCCATCCCA AGATTATGTC 
CCAGCTGGTT CTGTGGGCTG CTGCAGTGAA 
AATAATTTGG AACCCAAATT TATTACATTA 
AGTGCTACAA TTAGGTCTTT GTCAGACATT 
TTCAATTTCA ACATGTATGT ATATGATGAT 
CCAATTTATA TTTTTAAAGC CAGTTGTTGC 
AACAGACAAC TGGTAAATCT CAAACTCCAG 
TCTTTTTTTT TTTTACGGAT ATTTTAGTAA 
AT AG CTAAGT TATGCTAATA TCACATTATT 
ATAAACAAGA AATATTGAGT ATCACTATGT 
ACTGAATTAA ATTAAAAATG TTGCAGCTCA 
ACCAAATTCA TTTGACTTTG GAGGCAAAAT 
TCTATAGGAA TATAGTTGGA AATAAATGTG 
ATTTAAAATG AAATGAGAAC AAAGAGGAAA 
TAGTTTGTCC TACAATAGAA AAAAGAGAGA 
ATTATAACTG AGTCTATGAG GAAATAGTTC 
GTAAATAAAT TAAACTTTTC TGGTTTCTGT 
TAGCTTTGCT TTGCAGTCTG TTTCAAGATT 
GAATACTCGC TGCAGCTGGG GTTCCCTGCT 
TTTTTTTCGG GGAGCTAATA ACAAAAACAT 
CTCTATTGCT GTTTCTATTC TCTCTTATAG 
TAACCATGTC CTCCTAGAGT TTAGAGGCTA 
GCACCCTGGG GAGATTGATT GTCCTTAAAC 
GTCTGGGAGC TACAAAATTT CATTTTTCTC 
CTGAATCAAG GAAAGCCAGG CCTTGTGGGC 
ACCTCCAGCA GAGATTCCCT TAAGTGACTC 
AATTTTTAAT CAGTTTGCTT TCTCCAGAGA 
TTGAATGTAT AAAAGAAAAA GATCAAGTTG 
AAGCAGCCCA AGTAGGTTAT TTGTACAGTC 
GGGCAAGGAG AGGCCACAAG GAATATGGGT 
CTTTTTCCTA GGCTTGGCAC TOCCTTTTCC 
GTCCGGTGAG GGATCAGCCA ACCTCTTCTC 
AAGGAGACAG AGCTGACTGC ATGATGAGTC 
GTTGTGCAGA ACAAACAAGG CATTCATGGG 
CTGGGCACTA AGAAGGTCTA TGAATTAAAT 
TTTTCTGTTT TCTAATTTGA CCCTAAAATC 
CCCCCCCCCC TTTTTTTTTG AGACGGAGTC 
GCTCCGATCT CTGCTCACTG AAAGCTCCGC 
GCCTCCTGAG TAGCTGGGAC TACAGGCGCC 
TTTAATAGAG ACGGGGTTTC ACTGTGTTAG 
ATCCGCCTGC CTCGGCCTCC CAAAGTGCTG 
CTTGTTTTCC GTTTAAAGTC GTCTTCTTTT 
TGATCATACG AATTGGATCA ATCTTGAAAT 
GGGAGAAAGA ACTCAGGGCA CAAAATATTG 
TTGCTGAAAT TTCCTGCTGT AACCAGAAGC 
ACTGTGTTTT GCTCACTCCC TCACTCACCG 
CTAGTGCCGA TAAACTTTCT CAAAGAGCAA 
TAACCATCTC TTTGTTCTTT GAACATGCTG 
TTTGTAATTC TTTTCTCTCA AATGAAAATT 
CATATGTAGT ATTATTATTT CCTTATATGT 
CAAGAAAATA TATTTTTAAA GCTTTCATTT 
TGTAAATATA CAGAATGTTT TTTCTTACTT 
GGGGTTTGTT TTGCAATGTT TTAAACAGAG 
TGCTTTTAAA GAAACTTGGC TGCTTAAAAT 
TACAGATGTG GGGAGATGTA ATAAAACAAT 
TAGAGATTAA ATAATTCTAA GATGATCACT 
AATAGAAATA CTCAATTATG TCTTTGTTGT 
ATTATCAAAT TGTCGACATC ATTAATATAT 
GAAGCACAGC TTTACAGATG AGTATCTATG 
ATTAAAAGTA TTAGAAGGTG GTTATAATTG 



TATTTGGACA GAGAGGTTGT AGACAAGTAC 1020 

GGCCAGTTTT TTGGATTGAT AGGCACATCA 1080 

GATAATGCAC CCACTTTCAG ACAAAATGCT 1140 

AATGTGGAAA TCTTACGAAT ACCTATAGAA 1200 

AGAGTCAATT TTACCATTTT AAAGGGAAAT 1260 

AAAGAAACTA ATGAAGGTGT TCTTTCTGTT 1320 

CAAGTGAACC TGGAAATTGG AGTAAACAAT 1380 

GTGACAG CCT TGAACAGAGC CTTGGTTACA 1440 

GAATGCACTC CTGCAGCCCA ATATGTGCGG 1500 

ATCAACGGCT ATAAGGCATA TGACCCCGAA 1560 

AAATTGCATG ATCCTAAAGG TTGGATCACC 1620 

TCCAAAATCC TGGATAGGGA GGTTGAAACT 1680 

CTGGCAATAG ACAAAGATGA TAGATCATGT 1740 

GTAAATGATA ATCCACCAGA AATACTTCAA 1800 

GGGTATACCG ACATTTTAGC TGTTGATCCT 1860 

TTCAGTTTGC CCAATACTTC TCCAGAAATC 1920 

GATACAGCTG CCCGTCTTTC ATATCAGAAA 1980 

ATTACTGTAA AAGACAGGGC CGGCCAAGCT 2040 

GAATGTACTC ATCCAACTCA GTGTCGTGCG 2100 

AAATGGGCAA TCCTTGCAAT ATTACTGGGT 2160 

TTAGTATGTG GAGTTTTTGG TGCAACTAAA 2220 

CAAAACTTAA TTATATCAAA CACAGAAGCA 2280 

GGATTTATGA CCCAAACTAC CAACAACTCT 2340 

GGAATGAAAA ATGGAGGGCA GGAAACCATT 2400 

GAATCCTGCC GGGGGGCTGG GCATCATCAT 2460 

GAGGTGGACA ACTGCAGATA CACTTACTCG 2520 

GGTGAAAAAT . TGCATCGATG TAATCAGAAT 2580 

CTCACTTATA ACTATGAGGG AAGAGGATCT 2640 

AAGCAGGAAG AAGATGGCCT TGACTTTTTA 2700 

GCAGAAGCAT GCACAAAGAG ATAATGTCAC 2760 

CTGGAGGTTT CCAAAAATAA TATTGTAAAG 2820 

TTTTTTCTCA ATTTTGAATT ATGCTACTCA 2880 

TTATCTTTTC CAAAAAGTGA AAAATGTTAA 2940 

CACTGGAATT AAGGT CTCTA AAGCATCTGC 3000 

TAAATATGCT GGATAAATAT TAGTCCAACA 3060 

ATGTATTCAC TTTAAGTGAT AGTTTAAAAA 3120 

GAAGAAAGTT TTGGAAAAGA AACAATGAAG 3180 

TAAAGAATTG GGACTCACCC CTACTGCACT 3240 

GTGTTGAAGT GCCCTATGAA GTAGCAATTT 3300 

TGTGTGTATA TTATTATTAA TCAATGCAAT 3360 

ATGGTAAAAA CTTGAAATGA GGCTGGGGTA 3420 

GCTTCCTAGG CCTGGGCTCT TAAATGCTGC 3480 

CTGTCCAATT TGTGTAATTT GTTTAAAATT 3540 

GGGAAGGAAA TAGGGAATCC AATGGAACAG 3600 

TCTGCATCCA CAAGTTAGTA GCAAACTGGG 3660 

TTTTGGTAGC AAGGGTCCAG AGATGAGGTG 3720 

TTTAAAACTT ACCTTTACTG AAGTTAAATC 3780 

TGAC CAACAT CTTTTTAATT TAGATCCAAA 3840 

GAGGGAGCTG AGGGGAGGAT CTTACTGAAA 3 900 

CTAAGCCCCA CAAACTTGAC ACCTGATCAG 3960 

CTCACTGCCC TTCTTCTGAG TGGCATTGGC 4020 

CCCCTTCTTT CGGCTTTCTG CTAAAGCAAC 4080 

CAGGTTTTCC ACCATCCTTC AGCGTGAATT 4140 

AATTTTAAAA TAATAGAAGA AATAGAAATT 4200 

TCATTTTAGA ACAGAGGGAA CTTTGGGAGA 4260 

AGAGGGCAAC AGGAAGATGC AGGCCTTCAA 4320 

GGGAGTAAAA GCAACATCGT CTGCTTCATA 4380 

TTTCTCAGGC CAATGGCAAC TGCCATTTGA 4440 

TATGGCTCAC CTTATTTGGA GTGAGAAAT C 4500 

TGAAGGCATT TGCAGGATGA GCCTGAACTG 4560 

AATTGTTGTA TTCCTTCTGC AGCCCTCCTT 4620 

GCCTATCTAA AATTCTGATT TATTCCTACA 4680 

T ATGTGTT TT AGACTTAGAC TTTTTATTGC 4740 

TCGCTCTGAC GCACAGGCTG GAGTGCAGTG 4800 

CTCCCGGGTT CATGCCATTC TCCTGCCTCA 4860 

CACCACCACG CCCGGCTAAT TTTTTGTATT 4920 

CCAGGATGGT CTCGATCTCC TGACCTCGTG 4980 

GGATTACAGG CATGACCCAC CGCTCCCGGC 5040 

AATGTAATCA TTTTGAACAT GTGTGAAAGT 5100 

ACTCAACCAA AAGACAGTCG AGAAGCCAGG 5160 

GTCTGAGAAT GGAATTCTCT GTAAGCCTAG 5220 

CAGTTTTATC TAACGGCTAC TGAAACACCC 5280 

ATCAAAACCT GCTACCTCCC CAAGACTTTA 5340 

CCAGTATCAC TTCCCTGTTT ATAAAACCTC 5400 

AAAACCACCT GGTCTGCATG TATGCCCGAA 5460 

TAATTTTAGG GATTCATTTC TATATTTTCA 5520 

GTAAGGTGAA ATTTATGGTA TTTGAGTGTG 5580 

TTCCCCCAGT GAATGATTTA GAATTTTTTA 5640 

TTATAAGGAA GCAGCTGTCT AAAATGCAGT 5700 

TTTTAGTATT GCTATTAAAA GAAGTTACTT 5760 

AAGCAAAAAT TGGATGCATA AAGTAATATT 5820 

ATTAACTTGG TTTCTTGTTT TTGCTGTATT 5880 

TTGCAAAATT ATGCTTATGG CTGGCATGGA S940 

ATTAATGGGG AATATTTTGG ACAATGTTTC 6000 

ATTGTAATGT TGGGAAGAGA TCACTATTTT 6060 

ATACATATGT ATAATAAATT TTGATCGGGT 6120 

CAGAGTATTC CATGAATAGT ACACTGACAC 6180 
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AGGGGTTTTA CTTTGAGGAC CAGTGTAGTC AAGGGAAAAC 
GGCAATATTG CAGTCTTGAT TCTGCCACTT ACAGGATAGA 
CAAGATGATC CAACCATAAA GGTGCTCTGT GCTTCACAGT 
AGTGTGCTCC CCTACAAACG TTAAGACTGA TCATTTCAAA 
AGCCTTACAT TTTAATATAG GTTGAACCAA AATTTCAATT 
CATTATTTTT GTGTATGTCT TCAAGAATGT TCATTGGATT 
ACCGGATACA TTTCACGTGT CCTTCAGTAT TGATTTGGTT 
TGAGAAGCAT GGACACTAGA GCCAGAATGC TTGGATATGA 
TTCTGTGTGA CCTTTGAAAG GCTACTTATT TCCTCTCTTA 
GAACAATGCC AGCCTCATGG GGTTGTTGAA TGATTAAATT 
ATAGAACACT GCCTGCACAT AGTAAAAGAA TTATAAGTGT 
GTAGTTGGAT ATACTACCGA ACAATATCTA ATCTCTTTTT 
TATATATAAT CCCGAAACAT G 



Seq ID NO: 529 Protein sequence 
Protein Accession Jh NP_001932.1 



1 
I 

MAAAGPRRSV 
ADLIRSSDPD 
KTRHTRETVL 
EPLNLFYIER 
PVFTEAIYNF 
TGVITTVSHY 
EAPVEBNAPN 
KPLNYEENRQ 
KENIAVGSKI 
KNELYNITVL 
EPVHGAPFYP 
TKLLRVNLCE 
KRFPEDLAQQ 
MMKGGNQTIiE 
DRMPSQDYVL 



11 

I 

RGAVCLHIjLL 
FRVLNDGSVY 
RRAKRRWAPI 
DTGNLFCTRP 
EVLESSRPGT 
LDREWDKYS 
VEILRIPIED 
VNLEIGVNNE 
NGYKAYD PEN 
AIDKDDRSCT 
SLPNTSPEIS 
CTHPTQCRAT 
NLIISNTEAP 
SCRGAGHHHT 
TYNYEGRGSP 



21 
I 

TLVIFSRDGE 
TARAVAL SDK 
PCSMQENSLG 
VDREEYDVFD 
TVGWCATDR 
LIMKVQDMDG 
KDLINTANWR 
APPARDIPRV 
RNGNGLRYKK 
GTLAVNIEDV 
RLWSLTKVND 
SRSTGVILGK 
GDDRVCSANG 
LDSCRGGHTE 
AGSVGCCSEK 



31 

I 

ACKKVILNVP 
KRSFTIWLSD 
PFPLFLQQVE 
liIAYASTADG 
DEPDTMHTRL 
QFFGLIGTST 
VNFTILKGNE 
TALNRALVTV 
LHDPKGWITI 
NDNPPEILQE 
TAARLSYQKN 
WA I LAI ZjLG X 
FMTQTTNNSS 
VDNCRYTYSE 
QEEDGLDFLN 



Seq ID NO i 530 DNA sequence 

Nucleic Acid Accession #: NM_016583.2 

Coding sequence : 72 . . 842 



GGAGTGGGGG 
TAAGAGCAAA 
CCATGGCCCA 
ATCCAGCCCT 
ATGGCCTGCT 
TGAAGCCTGG 
CAGTGATTCC 
AACTTGGCCT 
TAAAGCTCCA 
TGGACATCAC 
TTGGTGACTG 
CCCTCCCCAT 
AGTTGGTTCA 
CCCTGGTGCA 
AAGCCTTCCA 
GCCCATGTGC 
TCCCACCAGG 
AAAAAAAAAA 



11 
I 

AGAGAGAGGA 
GATGTTTCAA 
GTTTGGAGGC 
GCCCTTGAGT 
GTCTGGGGGC 
AGGAGGTACT 
TGGCCTGAAC 
TGTGCAGAGC 
AGTGAATACG 
TGCAGAAATC 
CACCCATTCC 
TCAAGGTCTT 
GGQCAACGTG 
TGACATTGTT 
GGAAGGGGCT 
TGGAAGATGA 
CGTGTGTAAC 
AAAAAAAAAA 



21 
I 

GACCAGGACA 
ACTGGGGGCC 
CTGCCCGTGC 
CCCACAGGTC 
CTGTTGGGCA 
TCTGGTGGCC 
AACATCATTG 
CCTGATGGCC 
CCCCTGGTCG 
TTAGCTGTGA 
CCTGGAAGCC 
CTGGACAGCC 
TGCCCTCTGG 
AACATGCTGA 
GGCCTCTGCT 
CACAGTTGCC 
ATCCCATGTG 
AAAAAAAAA 



31 
I 

GCTGCTGAGA 
TCATTGTCTT 
CCCTGGACCA 
TTGCAGGAAG 
TTCTGGAAAA 
TCCTTGGGGG 
ACATAAAGGT 
ACCGTCTCTA 
GTGCAAGTCT 
GAGATAAGCA 
TGCAAATTTC 
TCACAGGGAT 
TCAATGAGGT 
TCCACGGACT 
GAGCTGCTTC 
TTCTCTCCGA 
CCTCACCTAA 



ATGAGTTAAA 
TAATGCCTGA 
GAATCTTTTC 
AATCTATTAG 
CCAGTAACTT 
TTTGTTTGTA 
GAATATTGGG 
ATCCTGGATC 
GCTTTCTCAT 
AGTTAATATA 
GAGGTAGTTG 
AGGGAAATAA 



41 

I 

SKLEADKIIG 
KRKQTQKEVT 
SDAAQNYTVF 
YSADLPLPLP 
KYSILQQTPR 
CIITVTDSND 
NGHFK I STDK 
HVRDLDEGPE 
DEISGSIITS 
YWICKPKMG 
AGFQEYTIPI 
ALLFSVLLTL 
QGFCGTMGSG 
WHSFTQPRLG 
NLEPKFITLA 



41 
I 

CCTCTAAGAA 
CTACGGGCTG 
GACCCTGCCC 
CTTGACAAAT 
CCTTCCGCTC 
ACTGCTTGGA 
CACTGACCCC 
TGTCACCATC 
GTTGAGGCTG 
GGAGAGGATC 
TCTGCTTGAT 
CTTGAATAAA 
TCTCAGAGGC 
ACAGTTTGTC 
CCAGTGCTCA 
GGAACCTGCC 
TAAAATGGCT 



AAGAAAAGCA 
ACTTTAATGA 
CCCATGCAGG 
CTATATCAAA 
CTATTGTAAC 
ATAGTAAAAT 
TCATAATGGT 
TGTCACTTAC 
TAAAATCAAT 
CCTAAAGTAC 
GTAAAATTAT 
AGTTTGTGCA 



6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
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51 
I 

RVNLEECFRS 
VLLEHQKKVS 
YSISGRGVDK 
IRVEDENDNH 
SPGliFSVKPS 
NAPTFRQNAY 
ETNEGVLSW 
CTPAAQYVRI 
KILDREVETP 
YTDILAVDPD 
TVKDRAGQAA 
VCGVFGATKG 
MKNGGQETXE 
EKLHRCNQNE 
EACTKR 



51 
I 

GTCCAGATAC 
TTAGCCCAGA 
TTGAATGTGA 
GCCCTCAGCA 
CTGGACATCC 
AAAGTGACGT 
CAGCTGCTGG 
CCTCTCGGCA 
GCTGTGAAGC 
CACCTGGTCC 
GGACTTGGCC 
GTCCTGCCTG 
TTGGACATCA 
ATCAAGGTCT 
CAGATGGCTG 
CCCTCTCCTT 
CTTCTTCTGC 



Seq ID NO» 531 Protein sequence 
Protein Accession #: NP_057667.1 

11 



51 



21 31 41 

| I I I I I 

MFQTGGIjIVF YGLLAQTMAQ FGGIjPVPIiDQ TLPLNVNPAL PLSPTGLAGS LTNALSNGLL 

SGGLLGILEN LPLLDILKPG GGT SGGLLGG LLGKVTSVIP GLNNIIDIKV TDPQLLELGL 

VQSPDGHRLY VTIPLGIKLQ VNTPLVGASL LRLAVKLDIT AEILAVRDKQ ERIHLVLGDC 

THSPGSLQIS IiLDGLGPLPI QGLLDSLTGI LNKVLPELVQ GNVCPLVNEV LRGLDITLVH 
DXVNMLIHGIi QFVIKV 



Seq ID NO: 532 DNA sequence 

Nucleic Acid Accession 8: NM_004363.1 

Coding sequence: 115.. 2223 



l 
I 

CTCAGGGCAG 
TCCTGGAACT 
TCTCCCTCGG 
TCACTTCTAA 
TTCAATGTCG 
TTTGGCTACA 
GTAATAGGAA 
CCCAATGCAT 



11 
I 

AGGGAGGAAG 
CAAGCTCTTC 
CCCCTCCCCA 
CCTTCTGGAA 
CAGAGGGGAA 
GCTGGTACAA 
CTCAACAAGC 
CCCTGCTGAT 



21 

i 

GACAGCAGAC 
TCCACAGAGG 
CAGATGGTGC 
CCCGCCCACC 
GGAGGTGCTT 
AGGTGAAAGA 
TACCCCAGGG 
CCAGAACATC 



31 
I 

CAGACAGTCA 
AGGACAGAGC 
ATCCCCTGGC 
ACTGCCAAGC 
CTACTTGTCC 
GTGGATGGCA 
CCCGCATACA 
ATCCAGAATG 



41 
I 

CAGCAGCCTT 
AGACAGCAGA 
AGAGGCTCCT 
TCACTATTGA 
ACAATCTGCC 
ACCGTCAAAT 
GTGGTCGAGA 
ACACAGGATT 



51 
I 

GACAAAACGT 
GACCATGGAG 
GCTCACAGCC 
ATCCACGCCG 
CCAGCATCTT 
TATAGGATAT 
GATAATATAC 
CTACACCCTA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 



60 
120 
160 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 



60 
120 
180 
240 



60 
120 
160 
240 
300 
360 
420 
480 



389 
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CACGTCATAA AGTCAGATCT TGTGAATGAA GAAGCAACTG GCCAGTTCCG GGTATACCCG 540 

GAGCTGCCCA AGCCCTCCAT CTCCAGCAAC AACTCCAAAC CCGTGGAGGA CAAGGATGCT 600 

GTGGCCTTCA CCTGTGAACC TGAGACTCAG GACGCAACCT ACCTGTGGTG GGTAAACAAT 660 

CAGAGCCTCC CGGTCAGTCC CAGGCTGCAG CTGTCCAATG GCAACAGGAC CCTCACTCTA 720 

TTCAATGTCA CAAGAAATGA CACAGCAAGC TACAAATGTG AAACCCAGAA CCCAGTGAGT 780 

GCCAGGCGCA GTGATTCAGT CATCCTGAAT GTCCTCTATG GCCCGGATGC CCCCACCATT 840 

TCCCCTCTAA ACACATCTTA CAGATCAGGG GAAAATCTGA ACCTCTCCTG CCACGCAGCC 900 

TCTAACCCAC CTGCACAGTA CTCTTGGTTT GTCAATGGGA CTTTCCAGCA ATCCACCCAA 960 

GAGCTCTTTA TCCCCAACAT CACTGTGAAT AATAGTGGAT CCTATACGTG CCAAGCCCAT 1020 

AACTCAGACA CTGGCCTCAA TAGGACCACA GTCACGACGA TCACAGTCTA TGCAGAGCCA 1080 

CCCAAACCCT TCATCACCAG CAACAACTCC AACCCCGTGG AGGATGAGGA TGCTGTAGCC 1140 

TTAACCTGTG AACCTGAGAT TCAGAACACA ACCTACCTGT GGTGGGTAAA TAATCAGAGC 1200 

CTCCCGGTCA GTCCCAGGCT GCAGCTGTCC AATGACAACA GGACCCTCAC TCTACTCAGT 1260 

GTCACAAGGA ATGATGTAGG ACCCTATGAG TGTGGAATCC AGAACGAATT AAGTGTTGAC 1320 

CACAGCGACC CAGTCATCCT GAATGTCCTC TATGGCCCAG ACGACCCCAC CATTTCCCCC 1380 

TCATACACCT ATTACCGTCC AGGGGTGAAC CTCAGCCTCT CCTGCCATGC AGCCTCTAAC 1440 

CCACCTGCAC AGTATTCTTG GCTGATTGAT GGGAACATCC AGCAACACAC ACAAGAGCTC 1500 

TTTATCTCCA ACATCACTGA GAAGAACAGC GGACTCTATA CCTGCCAGGC CAATAACTCA 1560 

GCCAGTGGCC ACAGCAGGAC TACAGTCAAG ACAATCACAG TCTCTGCGGA GCTGCCCAAG 1620 

CCCTCCATCT CCAGCAACAA CTCCAAACCC GTGGAGGACA AGGATGCTGT GGCCTTCACC 1680 

TGTGAACCTG AGGCTCAGAA CACAACCTAC CTGTGGTGGG TAAATGGTCA GAGCCTCCCA 1740 

GTCAGTCCCA GGCTGCAGCT GTCCAATGGC AACAGGACCC TCACTCTATT CAATGTCACA 1300 

AGAAATGACG CAAGAGCCTA TGTATGTGGA ATCCAGAACT CAGTGAGTGC AAACCGCAGT 1860 

GACCCAGTCA CCCTGGATGT CCTCTATGGG CCGGACACCC CCATCATTTC CCCCCCAGAC 1920 

TCGTCTTACC TTTOGGGAGC GAACCTCAAC CTCTCCTGCC ACTCGGCCTC TAACCCATCC 1980 

CCGCAGTATT CTTGGCGTAT CAATGGGATA CCGCAGCAAC ACACACAAGT TCTCTTTAT C 2040 

GCCAAAATCA CGCCAAATAA TAACGGGACC TATGCCTGTT TTGTCTCTAA CTTGGCTACT 2100 

GGCCGCAATA ATTCCATAGT CAAGAGCATC ACAGTCTCTG CATCTGGAAC TTCTCCTGGT 2160 

CTCTCAGCTG GGGCCACTGT CGGCATCATG ATTGGAGTGC TGGTTGGGGT TGCTCTGATA 2220 

TAGCAGCCCT GGTGTAGTTT CTTCATTTCA GGAAGACTGA CAGTTGTTTT GCTTCTTCCT 2280 

TAAAGCATTT GCAACAGCTA CAGTCTAAAA TTGCTTCTTT ACCAAGGATA TTTACAGAAA 2340 

AGACTCTGAC CAGAG AT CGA GACCATCCTA GCCAACATCG TGAAACCCCA TCTCTACTAA 2400 

AAATACAAAA ATGAGCTGGG CTTGGTGGCG CGCACCTGTA GTCCCAGTTA CTCGGGAGGC 2460 

TGAGGCAGGA GAATCGCTTG AACCCGGGAG GTGGAGATTG CAGTGAGCCC AGATCGCACC 2520 

ACTGCACTCC AGTCTGGCAA CAGAG CAAG A CTCCATCTCA AAAAGAAAAG AAAAGAAGAC 2580 

TCTGACCTGT ACTCTTGAAT ACAAGTTTCT GATACCACTG CACTGTCTGA GAATTTCCAA 2640 

AACTTTAATG AACTAACTGA CAGCTTCATG AAACTGTCCA CCAAGATCAA GCAGAGAAAA 2700 

TAATTAATTT CATGGGACTA AATGAACTAA TGAGGATTGC TGATTCTTTA AATGTCTTGT 2760 

TTCCCAGATT TCAGGAAACT TTTTTTCTTT TAAGCTATCC ACTCTTACAG CAATTTGATA 2820 

AAATATACTT TTGTGAACAA AAATTGAGAC ATTTACATTT TCTCCCTATG TGGTCGCTCC 2880 

AGACTTGGGA AACTATTCAT GAATATTTAT ATTGTATGGT AATATAGTTA TTGCACAAGT 2940 
TCAATAAAAA TCTGCTCTTT GTATAACAGA AAAA 

Seq ID NO i 533 Protein sequence 
Protein Accession #i NP_004354.1 

1 11 21 31 41 51 

I ! I I I I 

MESPSAPPHR WCIPWQRLLL TASLLTFWNP PTTAKLTIES TPFNVAEGKE VLLLVHNLPQ 60 

HLFGYSWYKG ERVDGNRQII GYVIGTQQAT PGPAYSGREI IYPNASLLIQ NIIQNDTGFY 120 

TLHV1 KSDLV NEEATGQFRV YPELPKFSIS SNNSKPVEDK DAVAPTCEPE TQDATYLWWV 180 

NNQSLPVSPR LQLSNGNRTL TLFNVTRNDT ASYKCETQNP VSARRSDSVI LNVLYGPDAP 240 

TISPLNTSYR SGBNLNLSCH AASNPPAQYS WFVNGTFQQS TQELFIPNIT VNNSGSYTCQ 300 

AHNSDTGLNR TTVTTITVYA EPPKPPITSN NSNPVEDEDA VALTCEPE1Q NTTYLWWVNN 360 

QSLPVSPRLQ LSNDNRTLTL LSVTRNDVGP YECGIQNELS VDHSDPVILN VLYGPODPTI 420 

SPSYTYYRPG VNLSLSCHAA SNPPAQYSWL IDGNIQQHTQ ELFISNITEK NSGLYTCQAN 480 

NSASGHSRTT VKTITVSAEL PKPSISSNNS KPVEDKDAVA FTCEPEAQNT TYLWWVNGQS 540 

LPVSPRLQLS NGNRTLTLiFN VTRKDARAYV CGIQNSVSAN RSDPVTLDVL YGPDTPIISP 600 

PDSSYLSGAN LNLSCHSASN PSPQYSWRIN GIPQQHTQVL FIAKITPNNN GTYACFVSNI, 660 
ATGRNNSIVK SITVSASGTS PGLSAGATVG IMIGVLVGVA LI 

Seq ID NO: 534 DMA sequence 

Nucleic Acid Accession #: NM_006952.1 

Coding sequence i 11.. 7 93 

1 11 21 31 41 51 

I I I I I I 

AATCCCGACA ATGGCGAAAG ACAACTCAAC TGTTCGTTGC TTCCAGGGCC TGCTGATTTT 60 

TGGAAATGTG ATTATTGGTT GTTGCGGCAT TGCCCTGACT GCGGAGTGCA TCTTCTTTGT 120 

ATCTGACCAA CACAGCCTCT ACCCACTGCT TGAAGCCACC GACAACGATG ACATCTATGG 180 

GGCTGCCTGG ATCGGCATAT TTGTGGGCAT CTGCCTCTTC TGCCTGTCTG TTCTAGGCAT 240 

TGTAGGCATC ATGAAGTCCA GCAGGAAAAT TCTTCTGGCG TATTTCATTC TGATGTTTAT 300 

AGTATATGCC TTTGAAGTGG CATCTTGTAT CACAGCAGCA ACACAACGAG ACTTTTTCAC 360 

ACCCAACCTC TTCCTGAAGC AGATGCTAGA GAGGTACCAA AACAACAGCC CTCCAAACAA 420 

TGATGACCAG TGGAAAAACA ATGGAGTCAC CAAAACCTGG GACAGGCTCA TGCTCCAGGA 480 

CAATTGCTGT GGCGTAAATG GTCCATCAGA CTGGCAAAAA TACACATCTG CCTTCCGGAC 540 

TGAGAATAAT GATGCTGACT ATCCCTGGCC TCGTCAATGC TGTGTTATGA ACAATCTTAA 600 

AGAACCTCTC AACCTGGAGG CTTGTAAACT AGGCGTGCCT GGTTTTTATC ACAATCAGGG 660 

CTGCTATGAA CTGATCTCTG GTCCAATGAA CCGACACGCC TGGGGGGTTG CCTGGTTTGG 720 

ATTTGCCATT CTCTGCTGGA CTTTTTGGGT TCTCCTGGGT ACCATGTTCT ACTGGAGCAG 780 
AATTGAATAT TAAGAA 

Seq ID NO* 535 Protein sequence 
Protein Accession #i NP_008883.1 

1 11 21 31 41 51 

I I I I ) 1 



390 



WO 02/086443 

MAKDNSTVRC FQGLLIFGNV IIGCCGIALT AECIFFVSDQ 
IGIPVGICLP CLSVLGIVGI MKSSRKILLA YFILMFIVYA 
FLKQMLERYQ NNSPPNNDDQ WKNNGVTKTW DRLMLQDKCC 
DADYPWPRQC CVMNNLKEP1* NLEACKLGVP GFYHNQGCYB 
LCWTFWVLLG TMFYWSRIEY 

Seq ID NO: 536 DNA sequence 
Nucleic Acid Accession #: NMJ302638.1 
Coding sequence i 120.. 473 

1 11 21 31 41 51 

I 1 I i I I 

CAATACAGCT AAGGAATTAT CCCTTGTAAA TACCACAGAC COGCCCTGGA GCCAGGCCAA 50 

GCTGGACTGC ATAAAGATTQ GTATGGCCTT AGCTCTTAGC CAAACACCTT CCTGACACCA 120 

TGAGGGCCAG CAGCTTCTTG ATCGTGGTGG TGTTCCTCAT CGCTGGGACG CTGGTTCTAQ 180 

AGGCAGCTGT CACGGGAGTT CCTGTTAAAG GTCAAGACAC TGTCAAAGGC CGTGTTCCAT 240 

TCAATGGACA AGATCCCGTT AAAGGACAAG TTTCAGTTAA AGGTCAAGAT AAAGTCAAAG 300 

CGCAAGAGCC AGTCAAAGGT CCAGTCTCCA CTAAGCCTGG CTCCTGCCCC ATTATCTTGA 360 

TCCGGTGCGC CATGTTGAAT CCCCCTAACC GCTGCTTGAA AGATACTGAC TGCCCAGGAA 420 

TCAAGAAGTG CTGTGAAGGC TCTTGCGGGA TGGCCTGTTT CGTTCCCCAG TGAAGGGAGC 480 

CGGTCCTTGC TGCACCTGTG CCGTCCCCAG AGCTACAGGC CCCATCTGGT CCTAAGTCCC S40 

TGCTGCCCTT CCCCTTCCCA CACTGTCCAT TCTTCCTCCC ATTCAGGATG CCCACGGCTG 600 
GAGCTGCCTC TCTCATCCAC TTTCCAATAA A 

Seq ID NO i 537 Protein sequence 
Protein Accession #: NP_002629,1 

1 11 21 31 41 51 

I I 1 I .1 I 

MRASSFLIW VFLIAGTLVL EAAVTGVPVK GQDTVKGRVP FNGQDPVKGQ VSVXGQDKVX 60 
AQEPVKGPV9 TKPGSCPIIL IRCAMLNPPN RCLKDTDCPG IKKCCEGSCG MACFVPQ 

Seq ID NO: 538 DNA sequence 

Nucleic Acid Accession #» NM_001793.2 

Coding sequence: 71., 2560 

1 11 21 31 41 51 

I 1 1 I I I 

AAAGGGGCAA GAGCTGAGCG GAACACCGGC CCGCCGTCGC GGCAGCTGCT TCACCCCTCT 60 

CTCTGCAGCC ATGGGGCTCC CTCGTGGACC TCTCGCGTCT CTCCTCCTTC TCCAGGTTTG 120 

CTGGCTGCAG TGCGCGGCCT CCGAGCCGTG CCGGGCGGTC TTCAGGGAGG CTGAAGTGAC 180 

CTTGGAGGCG GGAGGCGCGG AGCAGGAGCC CGGCCAGGCG CTGGGGAAAG TATTCATGGG 240 

CTGCCCTGGG CAAGAGCCAG CTCTGTTTAG CACTGATAAT GATGACTTCA CTGTGCGGAA 300 

TGGCGAGACA GTCCAGGAAA GAAGGTCACT GAAGGAAAGG AATCCATTGA AGATCTTCCC 360 

ATCCAAACGT ATCTTACGAA GACACAAGAG AGATTGGGTG GTTGCTCCAA TATCTGTCCC 420 

TGAAAATGGC AAGGGTCCCT TCCCCCAGAG ACTGAATCAG CTCAAGTCTA ATAAAGATAG 480 

AGACACCAAG ATTTTCTACA GCATGACGGG GCCGGGGGCA GACAGCCCCC CTGAGGGTGT 540 

CTTCGCTGTA GAGAAGGAGA CAGGCTGGTT GTTGTTGAAT AAGCCACTGG ACCGGGAGGA 600 

GATTG CCAAG TATGAGCTCT TTGGCCACGC TGTGTCAGAG AATGGTGCCT CAGTGGAGGA 660 

CCCCATGAAC ATCTCCATCA TCGTGACCGA CCAGAATGAC CACAAGCCCA AGTTTACCCA 720 

GGACACCTTC CGAGGGAGTG TCTTAGAGGG AGTCCTACCA GGTACTTCTG TGATGCAGGT 780 

GACAGCCACG GATGAGGATG ATGCCATCTA CACCTACAAT GGGGTGGTTG CTTACTCCAT 840 

CCATAGCCAA GAACCAAAGG ACCCACACGA CCTCATGTTC ACCATTCACC GGAGCACAGG 900 

CACCATCAGC GTCATCTCCA GTGGCCTGGA CCGGGAAAAA GTCCCTGAGT ACACACTGAC 960 

CATCCAGGCC ACAGACATGG ATGGGGACGG CTCCACCACC ACGGCAGTGG CAGTAGTGGA 1020 

GATCCTTGAT GCCAATGACA ATGCTCCCAT GTTTGACCCC CAGAAGTACG AGGCCCATGT 1080 

GCCTGAGAAT GCAGTGGGCC ATGAGGTGCA GAGGCTGACG GTCACTGATC TGGACGCCCC 1140 

CAACTCACCA GCGTGGCGTG CCACCTACCT TATCATGGGC GGTGACGACG GGGACCATTT 12 00 

TACCATCACC ACCCACCCTG AGAGCAACCA GGGCATCCTG ACAACCAGGA AGGGTTTGGA 1260 

TTTTGAGGCC AAAAACCAGC ACACCCTGTA CGTTGAAGTG ACCAACGAGG CCCCTTT TGT 1320 

GCTGAAGCTC CCAACCTCCA CAGCCACCAT AGTGGTCCAC GTGGAGGATG TGAATGAGGC 1380 

ACCTGTGTTT GTCCCACCCT CCAAAGTCGT TGAGGTCCAG GAGGGCATCC CCACTGGGGA 1440 

GCCTGTGTGT GTCTACACTG CAGAAGACCC TGACAAGGAG AATCAAAAGA TCAGCTACCG 1500 

CATCCTGAGA GACCCAGCAG GGTGGCTAGC CATGGACCCA GACAGTGGGC AGGTCACAGC 1560 

TGTGGGCACC CTCGACCGTG AGGATGAGCA GTTTGTGAGG AACAACATCT ATGAAGTCAT 1620 

GGTCTTGGCC ATGGACAATG GAAGCCCTCC CACCACTGGC ACGGGAACCC TTCTGCTAAC 1680 

ACTGATTGAT GTCAATGACC ATGGCCCAGT CCCTGAGCCC CGTCAGATCA CCATCTGCAA 1740 

CCAAAGCCCT GTGCGCCAGG TGCTGAACAT CACGGACAAG GACCTGTCTC CCCACACCTC 1800 

CCCTTTCCAG GCCCAGCTCA CAGATGACTC AGACATCTAC TGGACGGCAG AGGT CAACG A I860 

GGAAGGTGAC ACAGTGGTCT TGTCCCTGAA GAAGTTCCTG AAGCAGGATA CATATGACGT 1920 

GCACCTTTCT CTGTCTGACC ATGGCAACAA AGAGCAGCTG ACGGTGATCA GGGCCACTGT 1980 

GTGCGACTGC CATGGCCATG TCGAAACCTG CCCTGGACCC TGGAAGGGAG GTTTCATCCT 2040 

CCCTGTGCTG GGGGCTGTCC TGGCTCTGCT GTTCCTCCTG CTGGTGCTGC TTTTGTTGGT 2100 

GAGAAAGAAG CGGAAGATCA AGGAGCCCCT CCTACTCCCA GAAGATGACA CCCGTGACAA 2160 

CGTCTTCTAC TATGGCGAAG AGGGGGGTGG CGAAGAGGAC CAGGACTATG ACATCACCCA 2220 

GCTCCACCGA GGTCTGGAGG CCAGGCCGGA GGTGGTTCTC CGCAATGACG TGGCACCAAC 2280 

CATCATCCCG ACACCCATGT ACCGTCCTCG GCCAGCCAAC CCAGATGAAA TCGGCAACTT 2340 

TATAATTGAG AACCTGAAGG CGGCTAACAC AGACCCCACA GCCCCGCCCT ACGACACCCT 2400 

CTTGGTGTTC GACTATGAGG GCAGCGGCTC CGACGCCGCG TCCCTGAGCT CCCTCACCTC 2460 

CTCCGCCTCC GACCAAGACC AAGATTACGA TTATCTGAAC GAGTGGGGCA GCCGCTTCAA 2520 

GAAGCTGGCA GACATGTACG GTGGCGGGGA GGACGACTAG GCGGCCTGCC TGCAGGGCTG 2580 

GGGACCAAAC GTCAGGCCAC AGAGCATCTC CAAGGGGTCT CAGTTCCCCC TTCAGCTGAG 2 640 

GACTTCGGAG CTTGTCAGGA AGTGGCCGTA GCAACTTGGC GGAGACAGGC TATGAGTCTG 2700 

ACGTTAGAGT GGTTGCTTCC TTAGCCTTTC AGGATGGAGG AATGTGGGCA GTTTGACTTC 2760 

AGCACTGAAA ACCTCTCCAC CTGGGCCAGG GTTGCCTCAG AGGCCAAGTT TCCAGAAGCC 2820 

TCTTACCTGC CGTAAAATGC TCAACCCTGT GTCCTGGGCC TGGGCCTGCT GTGACTGACC 2880 

TACAGTGGAC TTTCTCTCTG GAATGGAACC TTCTTAGGCC TCCTGGTGCA ACTTAATTTT 2940 



HSLYPLLEAT DNDD I YGAAW 60 

FEVASCITAA TQRDFFTPNL 120 

GVNGPSDWQK YTSAFRTENN 180 

LISGPMNRHA WGVAWFGPAI 240 



391 



5 
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TTTTTTTAAT GCTATCTTCA AAACGTTAGA OAAAGTTCTT CAAAAGTGCA GCCCAGAGCT 
GCTGGGCCCA CTGGCCGTCC TGCATTTCTG GTTTCCAGAC CCCAATGCCT CCCATTCGGA 
TGGATCTCTG CGTTTTTATA CTGAGTGTGC CTAGGTTGCC CCTTATTTTT TATTTTCCCT 
GTTGCGTTGC TATAGATGAA GGGTGAGGAC AATCGTGTAT ATGTACTAGA ACTTTTTTAT 
TAAAGAAACT TTTCCCAGAA AAAAA 



PCTYUS02/12476 



3000 
3060 
3120 
3180 



Seq ID NO: S3 9 Protein sequence 
Protein Accession #« NP_001784.2 



1 
I 

MGLPRGPLAS 
QEPALFSTDN 
KGPFPQRLNQ 
YELFGHAVSE 
DEDDAIYTYN 
TDMDGDGST T 
AWRATYLIMG 
PTSTATIWH 
DPAGWLAMDP 
VNDHGPVPEP 
TWLSLKKFL 
GAVLAXjLFLL 
GLEARPEWL 
DYEGSGSDAA 



11 

I 

LLLJjQVCWLQ 
DDFTVRNGET 
LKSNKDRDTK 
NGASVEDPMN 
GWAYSIHSQ 
TAVAWEILD 
GDDGDHFTIT 
VEDVNEAPVF 
DSGQVTAVGT 
RQIT I CNQSP 
KQDTYDVHLS 
LVLLLLVRKK 
RNDVAPTIIP 
SLSSLTSSAS 



21 
1 

CAASEPCRAV 
VQERRSLKER 
IFYSITGPGA 
ISIIVTDQND 
EPKDPHDLMF 
AND NAP MF DP 
THPESNQGIL 
VPPSKWEVQ 
LDREDEQFVR 
VRQVLNITDK 
LSDHGNKEQL 
RKIKEPLLLP 
TPMYRPRPAN 
DQDQDYDYLN 



31 
I 

FREAEVTLEA 
NPLKIFPSKR 
DSPPEGVFAV 
HKPKFTQDTF 
TIHRSTGTIS 
QKYEAHVPEN 
TTRKGLDFEA 
EGIPTGBPVC 
NNIYEVMVLA 
DLSPHTSPFQ 
TVIRATVCDC 
EDDTRDNVFY 
PDEIGNFIIE 
EWGSRFKKLA 



.Seq ID NO: 540 DNA sequence . 
Nucleic Acid Accession Eos sequence 
Coding sequence i 1 . . 672 



1 
I 

ATGAGGCTCC 
CGGGGCTCCC 
AAGGGCGGGG 
CTGCTCGCCT 
GCGAGACAAC 
TGTCATGTTT 
ACAGAGCCAT 
AAGCAGTGCT 
CTCCTGGAAG 
TTAGAGGGGC 
AGCTGTGGTG 
AGCCTGTCTT 



11 
I 

AAAGACCCCG 
CCTACCGGCC 
AGGGGGCGCC 
TGCTGCTGGT 
GAGATCCAGA 
GTGAGAGAGA 
ACTGCGTTAT 
CCGCTGGTTG 
AGCCCATGCC 
CACCTATCAA 
GGCTGTGGCT 
GA 



21 
I 

ACAGGCCCCG 
AGACCCGGGG 
GCGCGCTGAC 
CGTGGCCCTA 
GGACTCCCAG 
AAACACTTTC 
AGCGGCCGTG 
TGCAGCGATG 
CTTCTTTTAC 
CTCATCAGTG 
GGCCATCCTC 



31 
] 

GCGGGTGGGA 
AGAGGCGCGC 
CCTCCCTGGG 
CCGCGGGTGT 
CGAACGGACG 
GAGTGCCAGA 
AAAATATTTC 
GAGAGACCCA 
CTCAAGTGTT 
TTCAAAGAAT 
CTGCTGCTGG 



41 

I 

GGAEQEPGQA 
ILRRHKRDWV 
EKETGWLLLN 
RG3VLEGVLP 
VISSGLDREK 
AVGHEVQRLT 
KNQHTLYVEV 
VYTAEDPDKE 
MDNGSPPTTG 
AQLTDDSDIY 
HGHVETCPGP 
YGEEGGGEED 
NLKAANTDPT 
DMYGGGEDD 



41 
I 

GGCGGGCGCC 
GGAGGCTGOG 
CACCGCTGGG 
GGACAGACGC 
AGGGTGACAA 
ACCCAAGGAG 
CACGTTTTTT 
AGCCAGAGGA 
GTAAAATTCG 
ATGCTGGGAG 
CCTCCATTGC 



51 
I 

LGKVFMGCPG 
VAPISVPENG 
KPLDREEIAK 
GTSVMQVTAT 
VPEYTLTIQA 
VTDLDAPNSP 
TNEAPFVLKL 
NQKISYRILR 
TGTLLLTLID 
WTAEVNEEGD 
WKGGFILPVL 
QDYDITQLHR 
APPYDTLLVF 



51 
I 

CCGGGGCGGG 
AAGGTTCCAG 
GACGATGGCG 
CAACCTGACT 
TAGAGTGTGG 
GTGCAAATGG 
CATGGTTGCG 
GAAGCGGTTT 
CTACTGCAAT 
CATGGGTGAG 
AGCCGGCCTC 



Seq ID NO: 541 Protein sequence 
Protein Accession #: Eos sequence 

l 11 21 31 41 51 

I I I I I I 

MRLQRPRQAP AGGRRAPRGG RGSPYRPDPG RGARRLRRFQ KGGEGAPRAD PPWAPLGTMA 
LLALLLWAL PRVWTDANLT ARQRDPEDSQ RTDEGDNRW CHVCERENTF ECQNPRRCKW 
TEPYCVIAAV KIFPRFFMVA KQCSAGCAAM ERPKPEEKRF LLEEPMPFFY LKCCKIRYCN 
LBGPPINSSV FKEYAGSMGE SCGGLWLAIL LLLASIAAGL SLS 

Seq ID NO: 542 DNA sequence 

Nucleic Acid Accession ft: XM_03S292.2 

Coding sequence: 53 .. 1576 



GCTCGCTGGG 
TGCGGGCCCG 
GGAGAAGATG 
CGTGACCCTG 
TATCGGCTCG 
GCTGGCGCTG 
CGCGGAGCTC 
CTACGGCTCG 
ATCGCAGTAC 
CTGCCCGGTG 
GGCCGTGAAC 
CAAGCTCCTG 
TGTGTCCAAT 
TGTGCTGGCA 
CACAGAGGAA 
CATCGTGACG 
GCAGATGCTG 
GTCCTGGATC 
GTTCACATCC 
CTCCATGATC 
GACGCTGCTC 
CAACTGGCTC 
TGAGCTTGAG 
CCTCTTCCTG 
CATCATCCTC 
GTGGCTCCTC 



11 

I 

CCGCGGCTCC 
AAGCGGCGCG 
CTGGCCGCCA 
CAGCGGAACA 
GGCATCTTCG 
GTGGTGTGGG 
GGCACCACCA 
CTGCCCGCCT 
ATCGTGGCCC 
OCCGAGGAGG 
TGCTACAGCG 
GCCCTGGCCC 
CTAGATCCCA 
TTATACAGCG 
ATGATCAACC 
CTGGTGTACG 
TCGTCCGAGG 
ATCCCCGTCT 
TCCAGGCTCT 
CACCCACAGC 
TACGCCTTCT 
TGCGTGGCCC 
CGGCCCATCA 
ATCGCCGTCT 
AGCGGGCTGC 
CAGGGCATCT 



21 
I 

CGGGTGTCCC 
CGCTAGCGGC 
AGAGCGCGGA 
TCAOGCTGCT 
TGAOGCCCAC 
CCGCGTGCGG 
TCTCCAAATC 
TCCTCAAGCT 
TGGTCTTCGC 
CAGCCAAGCT 
TGAAGGCCGC 
TGATCATCCT 
ACTTCTCATT 
GCCTCTTTGC 
CCTACAGAAA 
TGCTGACCAA 
CCGTGGCCGT 
TCGTGGGCCT 
TCTTCGTGGG 
TCCTCACCCC 
CCAAGGACAT 
TGGCCATCAT 
AGGTGAACCT 
CCTTCTGGAA 
CCGTCTACTT 
TCTCCACGAC 



31 

I 

AGGCCCGGCC 
GCCGGCGGCC 
CGGCTCGGCG 
CAACGGCGTG 
GGGCGTGCTC 
CGTCTTCTCC 
GGGCGGCGAC 
CTGGATCGAG 
CACCTACCTG 
CGTGGCCTGC 
CACCCGGGTC 
GCTGGGCTTC 
TGAAGGCACC 
CTATGGAGGA 
CCTGCCCCTG 
CCTGGCCTAC 
GGACTTCGGG 
GTCCTGCTTC 
GTCCCGGGAA 
CGTGCCGTCC 
CTTCTCCGTC 
CGGCATGATC 
GGCCCTGCCT 
GACACCCGTG 
CTTCGGGGTC 
CGTCCTGTGT 



GAGTGTGGCA 
TGGTGGAAAA 
CAGAAGCTCA 



51 
I 

GCATGGCGGG 
AAGAGGCGCG 
AGGGCGAGGG 
TGGGGACCAT 
GCTCGCCGGG 
CGCTCTGCTA 
TGCTGGAGGT 
TCCGGCCTTC 
TCTTCCCCAC 
TGCTGCTCAC 
TTGCCGCCGC 
GAAAGGGTGA 
TGGGGAACAT 
TGAATTTCGT 
TCTCCCTGCC 
TGTCCACCGA 
TGGGCGTCAT 
ATGGGTCCCT 
CCTCCATCCT 
CGTGTGTGAT 
TCAGCTTCTT 
ACAGAAAGCC 
TCCTGGCCTG 
TCGGCTTCAC 
ACAAGCCCAA 
TGCAGGTGGT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



60 
120 
180 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
640 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 



392 



WO 02/086443 

CCCCCAGGAG ACATAGCCAG GAGGCCGAGT GGCTGCCGGA GGAGCATGC 



PCT/US02/12476 



5 



Seq ID NO: 543 Protein sequence 
Protein Accession ft: XP_035292.2 



1 ll 21 31 41 51 

1(11 I I 

MAGAGPKRRA LAAPAAEEKE EAREKMLAAK SADGSAPAGE GEGVTLQRNI TLLNGVAIIV 60 

GTIIGSGIFV TPTGVLKEAG SPGIALWWA ACGVFSIVGA LCYAELGTTI SKSGGDYAYM 120 

10 LEVYGSLPAP LKliWIELLII RPSSQYIVAL VFATYLIiKPL FPTCPVPEEA AKLVACLCVL 180 

LLTAVNCYSV KAATRVQDAF AAAKLLALAL IILLGFVQIG KGDVSNLD PN FSPEGTKliDV 240 

GNIVLALYSG LFAYGGWNYL NFVTEEMINP YRNLPLAIII SIiPIVTLVYV LTNLAYFTTIi 300 

STEQMLSSEA VAVDFGNYHL GVMSWIIPVF VGLSCFGSVN GSLFTSSRLF FVGSREGHLP 360 

SILSM1HPQL LTPVPSI/VPT CVMTLLYAFS KDIFSVINFF SFFNWLCVAL AIIGMIWLRH 420 

15 RKPELERPIK VNIALPVFFI LACLFLIAVS FWKTPVECGI GFTIIIiSGLP VYPFGVWWKN 480 
KPKWLLQGIF STTVLCQKLM QWPQET 

Seq ID NO: 544 DNA sequence 
Nucleic Acid Accession NM_005268.1 
20 Coding sequence: 168.. 989 

1 11 21 31 41 51 

I I 1 I I I 

TAAAAAGCAA AAGAATTCGC GGCCGCGTCG ACACGGGCTT CCCCGAAAAC CTTCCCCGCT 60 

25 TCTGGATATG AAATTCAAGC TGCTTGCTGA GTCCTATTGC CGGCTGCTGG GAGCCAGGAG 120 

AGCCCTGAGG AGTAGTCACT CAGTAGCAGC TGACGCGTGG GTCCACCATG AACTGGAGTA 180 

TCTTTGAGGG ACTCCTGAGT GGGGTCAACA AGTACTCCAC AGCCTTTGGG CGCATCTGGC 240 

TGTCTCTGGT CTTCATCTTC CGCGTGCTGG TGTACCTGGT GACGGCCGAG CGTGTGTGGA 300 

GTGATGACCA CAAGGACTTC GACTGCAATA CTCGCCAGCC CGGCTGCTCC AACGTCTGCT 360 

30 TTGATGAGTT CTTCCCTGTG TCCCATGTGC GCCTCTGGGC CCTGCAGCTT ATCCTGGTGA 420 

CATGCCCCTC ACTGCTCGTG GTCATGCACG TGGCCTACCG GGAGGTTCAG GAGAAGAGGC 480 

ACCGAGAAGC CCATGGGGAG AACAGTGGGC GCCTCTACCT GAACCCCGGC AAGAAGCGGG 540 

GTGGGCTCTG GTGGACATAT GTCTGCAGCC TAGTGTTCAA GGCGAGCGTG GACATCGCCT 600 

TTCTCTATGT GTTCCACTCA TTCTACCCCA AATATATCCT CCCTCCTGTG GTCAAGTGCC 660 

35 ACGCAGATCC ATGTCCCAAT ATAGTGGACT GCTTCATCTC CAAGCCCTCA GAGAAGAACA 720 

TTTTCACCCT CTTCATGGTG GCCACAGCTG CCATCTGCAT CCTGCTCAAC CTCGTGGAGC 780 

TCATCTACCT GGTGAGCAAG AGATGCCACG AGTGCCTGGC AGCAAGGAAA GCTCAAGCCA 840 

TGTGCACAGG TCATCACCCC CACGGTACCA CCTCTTCCTG CAAACAAGAC GACCTCCTTT 900 

CGGGTGACCT CATCTTTCTG GGCTCAGACA GTCATCCTCC TCTCTTACCA GACCGCCCCC 960 

40 GAGACCATGT GAAGAAAACC ATCTTGTGAG GGGCTGCCTG GACTGGTCTG GCAGGTTGGG 1020 

CCTGGATGGG GAGGCTCTAG CATCTCTCAT AGGTGCAACC TGAGAGTGGG GGAGCTAAGC 1080 

CATGAGGTAG GGGCAGGGAA GAGAGAGGAT TCAGACGCTC TGGGAGCCAG TTCCTAGTCC 1140 

TCAACTCCAG CCACCTGCCC CAGCTCGACG GCACTGGGCC AGTTCCCCCT CTGCTCTGCA 1200 
GCTCGGTTTC CTTTTCTAGA ATGGAAATAG TGAGGGCCAA TGC 



45 



60 



80 



Seq ID NO: 545 Protein sequence 
Protein Accession # : NP_005259.1 



1 11 21 31 . 41 51 

50 | | | | | 1 

MNWSIPEGLL SGVNKYSTAF GRIWLSLVFI FRVLVYLVTA ERVWSDDHKD FDCNTRQPGC 60 
SNVCFDEFFP VSHVRLWALQ LILVTCPSLL WMHVAYREV QEKRHREAHG ENSGRLYLNP 120 
GKXRGGLWWT YVCSLVPKAS VDIAFLYVFH SFYPKYILPP WKCHADPCP NIVDCPISKP 180 
SEKNIFTLFM VATAAICILL NLVELIYLVS KRCHECLAAR KAQAMCTGHH PHGTTS SCKQ 240 
55 DDLLSGDLIF LGSDSHPPLL PDRPRDHVKK TIL 



Seq ID NO: 546 DNA sequence 

Nucleic Acid Accession #: NM_002391.1 

Coding sequence: 2 6.. 457 



1 11 21 31 41 51 

I I I I I I 

CGGGCGAAGC AGCGCGGGCA GCGAGATGCA GCACCGAGGC TTCCTCCTCC TCACCCTCCT 60 

CGCCCTGCTG GCGCTCACCT CCGCGGTCGC CAAAAAGAAA GATAAGGTGA AGAAGGGCGG 120 

65 CCCGGGGAGC GAGTGCGCTG AGTGGGCCTG GGGGCCCTGC ACCCCCAGCA GCAAGGATTG 180 

CGGCGTGGGT TTCCGCGAGG OCACCTGCGG GGCCCAGACC CAGCGCATCC GGTGCAGGGT 240 

GCCCTGCAAC TGGAAGAAGG AGTTTGGAGC CGACTGCAAG TACAAGTTTG AGAACTGGGG 300 

TGCGTGTGAT GGGGGCACAG GCACCAAAGT CCGCCAAGGC ACCCTGAAGA AGGCGCGCTA 360 

CAATGCTCAG TGCCAGGAGA CCATCCGCGT CACCAAGCCC TGCACCCCCA AGACCAAAGC 420 

70 AAAGGCCAAA GCCAAGAAAG GGAAGGGAAA GGACTAGACG CCAAGCCTGG ATGCCAAGGA 480 

GCCCCTGGTG TCACATGGGG CCTGGCCACG CCCTCCCTCT CCCAGGCCCG AGATGTGACC 540 

CACCAGTGCC TTCTGTCTGC TCGTTAGCTT TAATCAATCA TGCCCTGCCT TGTCCCTCTC 600 

ACTCCCCAGC CCCACCCCTA AGTGCCCAAA GTGGGGAGGG ACAAGGGATT CTGGGAAGCT 660 

TGAGCCTCCC CCAAAGCAAT GTGAGTCCCA GAGCCCGCTT TTGTTCTTCC CCACAATTCC 720 

75 ATTACTAAGA AACACATCAA ATAAACTGAC TTTTTCCCCC CAATAAAAGC TCTTCTTTTT 780 
TAATAT 



Seq ID NO: 547 Protein sequence 
Protein Accession #: NP_002382.1 



1 11 21 31 41 ' 51 

I I I I I I 

MQHRGFLLLT LIALLALTSA VAKKKDKVKK GGPGSECAEW AWGPCTPSSK DCGVGFREGT 60 
_ CGAQTQRIRC RVPCNWKKEF GADCKYKFEN WGACDGGTGT KVRQGTLKKA RYNAQCQETI 120 
85 RVTKPCTPKT KAKAKAKKGK GKD 

Seq ID NO: 548 DNA sequence 



393 



WO 02/086443 

Nucleic Acid Accession 8: 
Coding sequence: 1..766 



PCT/US02/12476 



KM 006783.1 



5 

10 
15 

20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



ATGGATTGGG 
GGGAAGGTGT 
CAGGAAGTGT 
AAAAATGTGT 
CTGATCTTCG 
GAAACCACTC 
ATTAAAAAGC 
TTTTTCCGAA 
TACCACCTGC 
TTTATTTCTA 
ATTTGCATGC 
AGATCAAAGA 
CAGAATGAAA 
AGCTAA 



11 
I 

GGACGCTGCA 
GGATCACAGT 
GGGGTGACGA 
GCTATGACCA 
TCTCCACCCC 
GCAAGTTCAG 
ACAAGGTTCG 
TCATCTTTGA 
CCTGGGTGTT 
GGCCAACAGA 
TGCTTAACGT 
GAGCACAGAC 
TGAATGAGCT 



21 
I 

CACTTTCATC 
CATCTTTATT 
GCAAGAGGAC 
CTTTTTCCCG 
AGCGCTGCTG 
GCGAGGAGAG 
GATAGAGGGG 
AGCAGCCTTT 
GAAATGTGGG 
GAAGACCGTG 
GGCAGAGTTG 
GCAAAAAAAT 
GATTTCAGAT 



31 

I 

GGGGGTGTCA 
TTCCGAGTCA 
TTCGTCTGCA 
GTGTCCCACA 
GTGGCCATGC 
AAGAGGAATG 
TCGCTGTGGT 
ATGTATGTGT 
ATTGACCCCT 
TTTACCATTT 
TGCTACCTGC 
CACCCCAATC 
AGTGGT CAAA 



41 

I 

ACAAACACTC 
TGATCCTAGT 
ACACACTGCA 
TCCGGCTGTG 
ATGTGGCCTA 
ATTTCAAAGA 
GGACGTACAC 
TTTACTTCCT 
GCCCCAACCT 
TTATGATTTC 
TGCTGAAAGT 
ATGCCCTAAA 
ATGCAATCAC 



51 
I 

CACCAGCATC 
GGTGGCTGCC 
ACCGGGATGC 
GGCCCTCCAG 
CTACAGGCAC 
CATAGAGGAC 
CACCAGCATC 
TTACAATGGG 
TGTTGACTGC 
TGCGTCTGTG 
GTGTTTTAGG 
GGAGAGTAAG 
AGGTTTCCCA 



Seq ID NO i 549 Protein sequence 
Protein Accession #i NP_006774.1 



51 



MDWGTLHTFI 
KNVCYDHFFP 
IKKHKVRIEG 
PISRPTEKTV 
QNEMNELISD 



11 21 31 41 

I I I I I 

GGVNKHSTSI GKVWITVIFI FRVMILWAA QEVWGDEQED FVCNTLQPGC 

VSHIRLWALQ. LIFVSTPAIjIj VAMHVAYYRH ETTRKFRRGE KRNDFKDIED 

SLWWTYTSSI PFRIIFEAAF MYVFYFLYNG YHLPWVLKCG IDPCPNLVDC 

FTIPMISASV ICMLLNVAEL CYLLLKVCFR RSKRAQTQKN HPNHALKESK 

SGQNAITGFP S 



Seq ZD NOi 550 DNA sequence 

Nucleic Acid Accession #: NM_00257l.l 

Coding sequence! 99.. 587 



CATCCCTCTG 
TCACCCTGGG 
AGGACCTGGA 
ACATCTCCCT 
CCACCCCCGA 
AGAAGAAGGT 
TGGCGAACGA 
AGGACACCAC 
AGGACGATGA 
GGTACTTGCT 
CCAGGAAGAC 
TTTCAAAGAA 
TCCTGCTGCA 
GCAGAGGTTA 



11 

I 

GCTCCAGAGC 
CGTGGCCCTG 
GCTCCCAAAG 
CATGGCGACA 
GGACAACCTG 
CCTTGGAGAG 
GGCCACGCTG 
CACCCCCATC 
GATCATGCAG 
GGACTTGAAA 
CAGACTCCCA 
TAACCACAGC 
CACCTGCACC 
TTAATAAACC 



21 

I 

TCAGAGCCAC 
GTCTGTGGTG 
TTGGCAGGGA 
CTGAAGGCCC 
GAGATCGTTC 
AAGACTGGGA 
CTCGATACTG 
CAGAGCATGA 
GGATTCATCA 
CAGATGGAAG 
CCCTTCCACA 
TCAGAAGACG 
ATTGCCATGG 
CTTGGAGCAT 



31 

I 

CCACAGCCGC 
TCCCGGCCAT 
CCTGGCACTC 
CTCTGAGGGT 
TGCACAGATG 
ATCCAAAGAA 
ACTACGACAA 
TGTGCCAGTA 
GGGCTTTCAG 
AGCCGTGCCG 
CCTCCAGAGC 
ATGACGTGGT 
GGAGGCTGCT 
G 



41 

I 

AGCCATGCTG 
GGACATCCCC 
CATGGCCATG 
CCACATCACC 
GGAGAACAAC 
GTTCAAGATC 
TTTCCTGTTT 
CCTGGCCAGA 
GCCCCTGCCC 
TTTCTAGCTC 
AGTGGGACTT 
CATCTGTGTC 
CCCTGGGGGC 



51 
I 

TGCCTCCTGC 
CAGACCAAGC 
GCGACCAACA 
TCACTGTTGC 
AGCTGTGTTG 
AACTATACGG 
CTCTGCCTAC 
GTCCTGGTGG 
AGGCACCTAT 
ACCTCCGCCT 
CCTCCTGCCC 
GCCATCCCCT 
AGAGTCTCTG 



Seq ID NO: 551 Protein sequence 
Protein Accession NP_002562.1 

1 11 21 31 41 51 

I I I I I I 

MDIPQTKQDL ELPKLAGTWH SMAMATNNIS LMATLKAPLR VHITSLLPTP EDNLEIVLHR 
WENNSCVEKK VLGEKTGNPK KFKINYTVAN EATLLDTDYD NFLFLCLQDT TTPIQSMMCQ 
YLARVLVEDD EIMQGFIRAF RPLPRHLWYI* LDLKQMEEPC RF 

Seq ID NO i 552 DNA sequence 

Nucleic Acid Accession #> NM_006500.l 

Coding sequence: 27.. 1967 



1 
I 

ACTTGCGTCT 
TCGCCGCCTG 
CGCCTGAGCT 
AGTCCCAAGG 
TCATCTTCCG 
TCAGCCTCCA 
GCATCTTCTT 
TCTACAAAGC 
GTAAGGAGCC 
TCATCTGGTA 
CGTCCCAGAC 
TGGTTAAAGA 
GGAACCACAT 
TGTGGCTGGA 
GTTTGGCTGA 
GGGAGGCAGA 
AGGAACACAG 
TGAGTGAACC 
CCCCTGAGAG 
ACCTCGAGTT 



11 
I 

CGCCCTCCGG 
CTGCTGCTGT 
GGTGGAGGTG 
CAACCTCAGC 
TGTGCGCCAG 
GGACAGAGGG 
GTGCCAGGGC 
TCCGGAGGAG 
TGAGGAGGTC 
CAAGAATGGC 
TGTGGAGTCG 
AGACAAAGAT 
GAAGGAGTCC 
AGTGGAGCCC 
TGGCAACCCT 
GGAAGAGACA 
TGGGCGCTAT 
ACAGGAACTA 
ACAGGAAGGC 
CCAGTGGCTG 



21 

I 

CCAAGCATGG 
CCTCQCQTCG 
GAAGTGGGCA 
CATGTCGACT 
GGCCAGGGCC 
GCTACTCTGG 
AAGCGCCCTC 
CCAAACATCC 
GCTACCTGTG 
CGGCCTCTGA 
AGTGGTTTGT 
GCCCAGTTTT 
AGGGAAGTCA 
GTGGGAATGC 
CCACCACACT 
ACCAACGACA 
GAATGTCAGG 
CTGGTGAACT 
AGCAGCCTCA 
AGAGAAGAGA 



31 
I 

GGCTTCCCAG 
CGGGTGTGCC 
GCACAGCCCT 
GGTTTTCTGT 
AGAGCGAACC 
CCCTGACTCA 
GGTCCCAGGA 
AGGTCAACCC 
TAGGGAGGAA 
AGGAGGAGAA 
ACACCTTGCA 
ACTGTGAGCT 
CCGTCCCTGT 
TGAAGGAAGG 
TCAGCATCAG 
ACGGGGTCCT 
CCTGGAACTT 
ATGTGTCTGA 
CCCTGACCTG 
CAGACCAGGT 



41 
I 

GCTGGTCTGC 
CGGAGAGGCT 
TCTGAAGTGC 
CCACAAGGAG 
TGGGGAGTAC 
AGTCACCCCC 
GTACCGCATC 
CCTGGGCATC 
CGGGTACCCC 
GAACCGGGTC 
GAGTATTCTG 
CAACTACCGG 
TTTCTACCCG 
GGACCGCGTG 
CAAGCAGAAC 
GGTGCTGGAG 
GGACACCATG 
CGTCCGAGTG 
TGAGGCAGAG 
GCTGGAAAGG 



51 
I 

GCCTTCTTGC 
GAGCAGCCTG 
GGCCTCTCCC 
AAGCGGACGC 
GAGCAGCGGC 
CAAGACGAGC 
CAGCTCCGCG 
CCTGTGAACA 
ATTCCTCAAG 
CACATTCAGT 
AAGGCACAGC 
CTGCCCAGTG 
ACAGAAAAAG 
GAAATCAGGT 
CCCAGCACCA 
CCTGCCCGGA 
ATATCGCTGC 
AGTCCCGCAG 
AGTAGCCAGG 
GGGCCTGTGC 



60 
120 
180 
240 
300 
360 
420 
480 
540. 
600 
660 
720 
780 



60 
120 
160 
240 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



60 
120 



60 
120 
1B0 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 



394 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

TTCAOTTGCA TOACCTGAAA CGGGAGGCAG GAGGCOGCTA 
CCAGCATACC CGGCCTGAAC CGCACACAGC TGGTCAAGCT 
GGATGGCATT CAAGGAGAGG AAGGTGTGGG TGAAAGAGAA 
GTGAAGCGTC AGGGCACCCC CGGCCCACCA TCTCCTGGAA 
AACAAGACCA AGATCCACAG CGAGTCCTGA GCACCCTGAA 
TGTTGGAGAC AGGTGTTGAA TGCACQGCCT CCAACGACCT 
TCTTCCTGGA GCTGGTCAAT TTAACCACCC TCACACCAGA 
TCAGCACTTC CACTGCCAGT CCTCATACCA GAGCCAACAG 
TGCCGGAGCC GGAGAGCCGG GGCGTGGTCA TCGTGGCTGT 
TGGCGGTGCT GGGCGCTGTC CTCTATTTCC TCTATAAGAA 
GCTCAGGGAA GCAGGAGATC ACGCTGCCCC CGTCTCGTAA 
TTAAGTCAGA TAAGCTCCCA GAAGAGATGG GCCTCCTGCA 
GGGCTCCGGG AGACCAGGGA GAGAAATACA TCGATCTGAG 
CAGCTCCCTT CCCTGCCTGG ACCATTCCCA GCTCCCTGCT 
CCTCCAAAGG GACTAGAGAG AAGCCTCCTG CTCCCCTCAC 
GGCCACTGGG TTAGGACCTG AGGACCTCAC TTGGCCCTGC 
GTCCACCACC ATCTCCTCCA CGTTGAGTGA AGCTCATCCC 
CCGAGCGGGT AGGAGAGTTT CTTGCAGAAC GTGTTTTTTC 
AAATACCTGG CTCCTGCCAG CAGCTGAGCT GGGTAGCCTC 
CAAAGGCTGG CTTCCACCAT CCAGGTGCAC CACTGAAGTG 
GCCTGCTCAT GTTGAAGTGC GCTGTTCACA CCCGCTCCGG 
AGAAGCAGCT GCAGTGTTGC TGCCACCACC CTCCTGCTCG 
ACATTTTTTC TTTGGTCAGA AGCCAGGAAC TGGTGTCATT 
GGCCAGGTGT GGTGGCTCAC GCCTGTAATC CCAGCACTTT 
TCACAAAGTC AGGACGAGAC CATCCTGGCT AACACGGTGA 
TACAAAAAAA AATTAGCTAG GCGTAGTGGT TGGCACCTAT 
CTGAAGCAGG AGAATGGTAT GAATCCAGGA GGTGGAGCTT 
CACTGCACTC CAGCCTGGGC AACACAGCGA GACTCCGTCT 
ACGCGTACCT GCGGTGAGGA AGCTGGGCGC TGTTTTCGAG 
TCCCCGTGTT CACTTGCTCC CATAGCCCTC TTGATGGATC 
GGGGAGCAGA CAAAGATGAG GTCTACACTG TCCTTCATGG 
TTAGCACCAA ACTTCTACAA ACCAAGCTCA GGGCCCCAAC 
AGAATGGTAC TTAGGGATGG AAAACGGGGC CTGGCTAGAG 
CTGTGTGTAT GCATACATAT GTGTGTATAT ATGGTTTTGT 
TTGTTTCCTT TATATATGTA TGTATATATA TATATGAAAA 
AAAGCTTAAT TGTCCCAGAA AATCATACAT TGCTTTTTTA 
AACCTGGGGG CCTGTGAAAC TACAACCAAA AGGCAGACAA 
AGAGATCAGG GGTTACCTCT GCTTCTGAGC AAATGGCTCA 
CTACCCTACT TTTCAGCAGC AAAACGTCCC GTATGACGCA 
TGTTAGCAGG AGCTATGTCC CTTCCTATCG TTTCCGTCCA 

Seq ID NO: 553 Protein sequence 
Protein Accession ft: NP_006491.1 



TCGCTGCGTG 
GGCCATTTTT 
TATGGTGTTG 
CGTCAACGGC 

GGGCAAAAAC 
CTCCAACACA 
CACCTCCACA 
GATTGTGTGC 
GGGCAAGCTG 
GACCGAACTT 
GGGCAGCAGC 
GCATTAGCCC 
CACTCTTCTC 
CTGCACACCC 
AAGCCGCTTT 
AAGCAAGGAG 
TTTACACACA 
TCTGAGCTGG 
AGGACACACC 
AGAGCACCCC 
CCTCTTCAAA 
CCTTAAAAGA 
GGGAGGCCGA 
AACCCTGTCT 
AGTCCCAGCT 
GCAGTGAGCC 
CGAGGAAAAA 
TTCAGGTGAA 
ACGTAAAACT 
GGATTAAAGC 
CCTAGAAGGG 
CTTCGGGTGT 
CAGGTGTGTA 
TATATATATA 
TTCTACATGG 
AACCGTTTCC 
AGCTCTACCA 
GCACGAAGGG 
CTT 



GCGTCTGTGC 
GGCCCCCCTT 
AATCTGTCTT 
ACGGCAAGTG 
ACCCCGGAGC 
ACCAGCATCC 
ACCACTGGCC 
GAGAGAAAGC 
ATCCTGGTCC 
CCGTGCAGGC 
GTAGTTGAAG 
GGTGACAAGA 
CGAATCACTT 
TCAGCCAAAG 
CCTTTCAGAG 
TCAGGGACCA 
CCCCAGTCTC 
TTATGGCTGT 

GGAGCCAGGC 
AGCGGCATCC 
GTCTCCTGTG 
TACGTGCCGG 
GGCGGGCGGA 
CTACTAAAAA 
ACTCGGAAGG 
GAGACCGTGC 
AAAAGAAAAG 
TTAGCCTCAA 
GAAAGGCAGC 
TATGGTTATA 
CCCAAATGAG 
GTGTGTCTGT 
AATTTGCAAA 
TATGAAAAAT 
GTACCACAGG 
AGTTGGCAGC 
GAGCAGACAG 
CCTGGCAGGC 



GLPRLVCAFL 
WFSVHKEKRT 
RSQBYRIQLR 
KEEKNRVKIQ 
TVPVFYFTEK 
NGVLVLE PAR 
TLTCEAESSQ 
LVKLAIFGPP 
STLNVLVTPE 
RANSTSTERK 
PSRKTELWE 



11 
I 

LAACCCCPRV 
LIFRVRQGQG 
VYKAPEEPNI 
SSQTVESSGI. 
VWLEVEPVGM 
KEHSGRYECQ 
DLEFQWLREE 
WMAFKERKVW 
LLETGVECTA 
LPEPESRGW 
VKSDKXjPEEM 



21 
I 

AGVPGEAEQP 
QSEPGEYEQR 
QVNPLGIPVN 
YTLQSILKAQ 
LKEGDRVEIR 
AWNLDTMISL 
TDQVLERGPV 
VKENMVLNLS 
SNDLGKNTSI 
IVAVIVCILV 
GLLQGSSGDK 



31 
I 

APELVEVEVG 
LSLQDRGATL 
SKEPEEVATC 
LVKEDKDAQF 
CliADGNPPPH 
LSEPQELLVN 
LQLHDIiKRBA 
CEASGHPRPT 
LFLELVNIjTT 
LAVLGAVLYF 
RAPGDQGEKY 



Seq ZD NO i 554 DNA sequence 

Nucleic Acid Accession #« NM_003183.3 

Coding sequencer 165.. 2639 



TCGAGCCTGG 
GCTAGGCCGG 
CGAAGGCTGC 
TATTCCTGAC 
TCGGCCCCCA 
CTTTATCTAA 
TAGAAACACT 
GTACTGAACG ' 
AGTACACTGC 
GGGTTCTAGC 
AATATAACAT 
TTTATAAATC 
ATTTAAAAGT 
AAGAGCTTGT 
AATTATTGGT 
CAACTACAAA 
CATGGGATAA 
AGTCTCCACA 
ATGAAGAAAA 
CTGAGGAAGC 
GAACTCTTGG 
CAAAGGCTTA 
GCACAAAGAA 



11 
I 

CGGTAGAATC 
GCGGATCCCG 
CCAGAGAGGT 
CAGCGTGGTT 
CCAGAGACTC 
TATCCAGCAG 
ACTAACTTTT 
TTTTTCACAA 
AAAATGGCAG 
CCACATAAGA 
AGAGCCACTT 
TGAAGATATC 
GGATAATGAA 
TCATCGAGTG 
GGTAGCAGAT 
TTACTTAATA 
TGCAGGTTTT 
AGAGGTAAAA 
GGATGCTTGG 
ATCTAAAGTT 
ATTAGCTTAT 
TTATAGCCCA 
TTATGGTAAA 



21 
I 

TTCCCAGTAG 
TCCTCCCCCG 
GGAGTCGGTA 
CCTTTCGTGC 
GAGAAGCTTG 
CATTCGGTAA 
TCAGCTTTGA 
AATTTCAAGG 
GACTTCTTCA 
GATGATG AT G 
TGGAGATTTG 
AAGAATGTTT 
GAGTTGCTCC 
AAAAGAAGAG 
CATCGCTTCT 
GAGCTAATTG 
AAAGGCTATG 
CCTGGTGAAA 
GATGTGAAGA 
TGCTTGGCAC 
GTTGGCTCTC 
GTTGGGAAGA 
ACCATCCTTA 



31 
I 

GCGGCGCGGG 
ATGTGAGCAG 
GCGGGGCCGG 

ATTCTTTGCT 
GAAAAAGAGA 
AAAGGCATTT 
TCGTGGTGGT 
CTGGACACGT 
TTATAATCAG 
TTAATGATAC 
CACGTTTGCA 
CAAAAGGGTT 
CTGACCCAGA 
ACAGATACAT 
ACAGAGTTGA 
GAATACAGAT 
AGCACTACAA 
TGTTGCTAGA 
ACCTTTTCAC 
CCAGAGCAAA 
AAAATATCTA 
CAAAGGAAGC 



41 
I 

STALLKCGLS 
ALTQVTPQDE 
VGRNGYPIPQ 
YCELNYRLPS 
F8ISKQNPST 
YVSDVRVSPA 
GGGYRCVASV 
ISWNVNGTAS 
LTPDSNTTTG 
LYKKGKLPCR 
IDLRH 



41 

I 

AGGGAAAAGA 
TTTTCCGAAA 
GAACATGAGG 
ACCTCCGGAT 
CTCAGACTAC 
TCTACAGACT 
TAAATTATAC 
GGATGGTAAA 
GGTTGGTGAG 
AATCAACACA 
CAAAGACAAA 
GTCTCCAAAA 
AGTAGACAGA 
TCCCATGAAG 
GGGCAGAGGG 
TGACATCTAT 
AGAGCAGATT 
CATGGCAAAA 
GCAATTTAGC 
ATACCAAGAT 
CAGCCATGGA 
TTTGAATAGT 
TGACCTGGTT 



1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2680 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
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51 
I 

QSQGNLSHVD 
RIFLCQGKRP 
VIWYKNGRPIi 
GNHMKESREV 
REAEEETTND 
APERQEGSSIi 
PSIPGLNRTQ 
EQDQDPQRVL 
LSTSTASPHT 
RSGKQEITLP 



SI 
I 

GGATTGAGGG 
CCCCGTCAGG 
CAGTCTCTCC 
GACCCGGGCT 
GATATTCTCT 
TCAACACATG 
CTGACATCAA 
AACGAAAGCG 
CCTGACTCTA 
GATGGGGCCG 
AGAATGTTAG 
GTGTGTGGTT 
GAACCACCTG 
AACACGTGTA 
GAAGAGAGTA 
CGGAACACTT 
CGCATTCTCA 
AGTTACCCAA 
TTTGATATAG 
TTTGATATGG 
GGTGTTTGTC 
GGTTTGACGA 
ACAACTCATG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
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AATTGGGACA TAATTTTGQA GCAGAACATG ATCCGGATGG TCTAGCAGAA TGTGCCCCGA 1440 

ATGAGGACCA GGGAGGGAAA TATGTCATGT ATCCCATAGC TGTGAGTGGC GATCACGAGA 1500 

ACAATAAGAT GTTTTCAAAC TGCAGTAAAC AATCAATCTA TAAGACCATT GAAAGTAAGG 1560 

CCCAGGAGTG TTTTCAAGAA CGCAGCAATA AAGTTTGTGG GAACTCGAGG GTGGATGAAG 1620 

GAGAAGAGTG TGATCCTGGC ATCATGTATC TGAACAACGA CACCTGCTGC AACAGCGACT 1680 

GCACGTTGAA GGAAGGTGTC CAGTGCAGTG ACAGGAACAG TCCTTGCTGT AAAAACTGTC 1740 

AGTTTGAGAC TGCCCAGAAG AAGTGCCAGG AGGOGATTAA TGCTACTTGC AAAGGCGTGT 1800 

CCTACTGCAC AGGTAATAGC AGTGAGTGCC CGCCTCCAGG AAATGCTGAA AATGACACTG 1860 

TTTGCTTGGA TCTTGGCAAG TGTAAGGATG GGAAATGCAT CCCTTTCTGC GAGAGGGAAC 1920 

AGCAGCTGQA GTCCTGTGCA TGTAATGAAA CTGACAACTC CTGCAAGGTG TGCTGCAGGG 1980 

ACCTTTCTGG CCGCTGTGTG CCCTATGTCG ATGCTGAACA AAAGAACTTA TTTTTGAGGA 2040 

AAGGAAAGCC CTGTACAGTA GGATTTTGTG ACATGAATGG CAAATGTGAG AAACGAGTAC 2100 

AGGATGTAAT TGAACGATTT TGGGATTTCA TTGACCAGCT GAGCATCAAT ACTTTTGGAA 2160 

AGTTTTTAGC AGACAACATC GTTGGGTCTG TCCTGGTTTT CTCCTTGATA TTTTGGATTC 2220 

CTTTCAGCAT TCTTGTCCAT TGTGTGGATA AGAAATTGGA TAAACAGTAT GAATCTCTGT 2280 

CTCTGTTTCA CCCCAGTAAC GTCGAAATGC TGAGCAGCAT GGATTCTGCA TCGGTTCGCA 2340 

TTATCAAACC CTTTCCTGCG CCCCAGACTC CAGGCCGCCT GCAGCCTGCC CCTGTGATCC 2400 

CTTCGGCGCC AGCAGCTCCA AAACTGGACC ACCAGAGAAT GGACACCATC CAGGAAGACC 2460 

CCAGCACAGA CTCCCATATG GACGAGGATG GGTTTGAGAA GGACCCCTTC CCAAATAGCA 2520 

GCACAGCTGC CAAGTCATTT GAGGATCTCA CGGACCATCC GGTCGCCAGA AGTGAAAAGG 2580 

CTGCCTCCTT TAAACTGCAG CGTCAGAATC GTGTTAACAG CAAAGAAACA GAGTGCTAAT 2640 

TTAGTTCTCA GCTCTTCTGA CTTAAGTGTG CAAAATATTT TTATAGATTT GACCTACAAA 2700 

TCAATCACAG CTTGTATTTT GTGAAGACTG GGAAGTGACT TAGCAGATGC TGGTCATGTG 2760 

TTTGAACTTC CTGCAGGTAA ACAGTTCTTG TGTGGTTTGG CCCTTCTCCT TTTGAAAAGG 2820 

TAAGGTGAAA GTGAATCTAC TTATTTTGAG GCTTTCAGGT TTTAGTTTTT AAAATATCTT 2880 

TTGACCTGTG GTGCAAAAGC AGAAAATACA GCTGGATTGG GTTATGAATA TTTACGTTTT 2 940 

TGTAAATTAA TCTTTTATAT TGATAACAGC ACTGACTAGG GAAATGATCA GTTTTTTTTT 3000 

ATACACTGTA ATGAACCGCT GAATATGAAG CATTTGGCAT TTATTTGTGA GAAAAGTGGA" 3060 

ATAGTTTTTT TTTTTTTTTT TTTTTTTTGC CTTCAACTAA AAACAAAGGA GATAAATTTA 3120 

GTATACATTG TATCTAAATT GTGGGTCTAT TTCTAGTTAT TACCCAGAGT TTTTATGTAG 3180 

CAGGGAAAAT ATATATCTAA ATTTAGAAAT CATTTGGGTT AATATGGCTC TTCATAATTC 3240 

TAAGACTAAT GCTCAGAACC TAACCACTAC CTTACAGTGA GGGCTATACA TGGTAGCCAG 3300 

TTGAATTTAT GGAATCTACC AACTGTTTAG GGCCCTGATT TGCTGGGCAG TTTTTCTGTA 3360 

TTTTATAAGT ATCTTCATGT ATCCCTGTTA CTGATAGGGA TACATGTCTT AGAAAATTCA 3420 

CTATTGGCTG GGAOTGGTGG CTCATGCCTG TAATCCCAGC ACTTGGAGAG GCTGAGGTTG 3480 
CGCCACTACA CTCCAGCCTG GGTGACAGAG TGAGATCTGC CTC 

Seq ID NOx 555 Protein sequence 
Protein Accession #* NP_003174,2 

1 11 21 31 41 51 

I I I I I I 

MRQSLLFLTS WPFVLAPRP PDDPGFGPHQ RLBKLDSLLS DYDILSLSNI QQHSVRKRDL 60 

QTSTHVETLL TFSALKRHFK LYLTSSTERF SQKFKWWD GKNESEYTAK WQDFFTGHW 120 

GEPDSRVLAH IRDDDVIIRI NTDGAEYNIE PLWRFVNDTK DKRMLVYKSE DIKNVSRLQS 180 

PKVCGYLKVD NEELLPKGLV DREPPEELVH RVKRRADPDP MKNTCK1»LW ADHRFYRYMG 240 

RGEESTTTNY LIELIDRVDD IYRNTSWDNA GFKGYGIQIE QIRILKSPOE VKPGEKHYNM 300 

AKSYPNEEKD AWDVKMLLEQ FSFDIAEEAS KVdiAHtiFTY QDFDMGTLGL AYVGSPRANS 360 

HGGVCPKAYY SPVGKKNIYL NSGLTSTKNY GKTILTKEAD LVTTHELGHN PGAEHDPDGL 420 

AECAPNEDQG GKYVMYPIAV SGDHENNKMF SNCSKQSIYK TIESKAQECF QBRSNKVCGN 480 

SRVDEGEECD PGIMYLNNDT CCNSDCTLKE GVQCSDRNSP CCKNCQFETA QKKCQEAINA 540 

TCKGVSYCTG NS8ECPPPGN AENDTVCLDL GKCKDGKCIP FCEREQQLES CACNETDNSC 600 

KVCCRDLSGR CVPYVDAEQK NLFLRKGKPC TVGFCDMNGK CEKRVQDVIE RFWDFIDQLS 660 

INTFGKF1AD NIVGSVLVFS LIFWIPFSIIj VHCVDKKLDK QYESLSLFHP SNVEMLSSMD 720 

SASVRIIKPP PAPQTPGRIiQ PAPVIPSAPA APKLDHQRMD TIQEDPSTDS HMDEDGFEKD 780 
PFPNSSTAAK S F EDLTDH P V ARSEKAASFK LQRQNRVNSK ETEC 

Seq ZD NO: 556 DNA sequence 

Nucleic Acid Accession Jh NM_021832.1 

Coding sequence: 164.. 2248 

1 11 21 31 41 51 

I I I i I I 

TCGAGCCTGG CGGTAGAATC TTCCCAGTAG GCGGCGCGGG AGGAAAAGAG GATTGAGGGG 60 

CTAGGCCGGG CGGATCCCGT CCTCCCCCGA TGTGAGCAGT TTTCCGAAAC CCCGTCAGGC 120 

GAAGGCTGCC CAGAGAGGTG GAGTCGGTAG CGGGGCCGGG AACATGAGGC AGTCTCTCCT 180 

ATTCCTGACC AGCGTGGTTC CTTTCGTGCT GGCGCCGCGA CCTCCGGATG ACCCGGGCTT 240 

CGGCCCCCAC CAGAGACTCG AGAAGCTTGA TTCTTTGCTC TCAGACTACG AT ATT CT CTC 300 

TTTATCTAAT ATCCAGCAGC ATTCGGTAAG AAAAAGAGAT CTACAGACTT CAACACATGT 360 

AGAAACACTA CTAACTTTTT CAGCTTTGAA AAGGCATTTT AAATTATACC TGACATCAAG 420 

TACTGAACGT TTTTCACAAA ATTTCAAGGT CGTGGTGGTG GATGGTAAAA ACGAAAGCGA 480 

GTACACTGTA AAATGGCAGG ACTTCTTCAC TGGACACGTG GTTGGTGAGC CTGACTCTAG 540 

GGTTCTAGCC CACATAAGAG ATGATGATGT TATAATCAGA AT CAACACAG ATGGGGCCGA 600 

ATATAACATA GAGCCACTTT GGAGATTTGT TAATGATACC AAAGACAAAA GAATGTTAGT 660 

TTATAAATCT GAAGATATCA AGAATGTTTC ACGTTTGCAG TCTCCAAAAG TGTGTGGTTA 720 

TTTAAAAGTG GATAATGAAG AGTTGCTCCC AAAAGGGTTA GTAGACAGAG AACCACCTGA 780 

AGAGCTTGTT CATCGAGTGA AAAGAAGAGC TGACCCAGAT CCCATGAAGA ACACGTGTAA 840 

ATTATTGGTG GTAGCAGATC ATCGCTTCTA CAGATACATG GGCAGAGGGG AAGAGAGTAC 900 

AACTACAAAT TACTTAATAG AG CT AATTG A CAGAGTTGAT GACATCTATC GGAACACTTC 960 

ATGGGATAAT GCAGGTTTTA AAGGCTATGG AATACAGATA GAGCAGATTC GCATTCTCAA 1020 

GTCTCCACAA GAGGTAAAAC CTGGTGAAAA GCACTACAAC ATGGCAAAAA GTTACCCAAA 1080 

TGAAGAAAAG GATGCTTGGG ATGTGAAGAT GTTGCTAGAG CAATTTAGCT TTGATATAGC 1140 

TGAGGAAGCA TCTAAAGTTT GCTTGGCACA CCTTTTCACA TACCAAGATT TTGATATGGG 1200 

AACTCTTGGA TTAGCTTATG TTGGCTCTCC CAGAGCAAAC AGCCATGGAG GTGTTTGTCC 1260 

AAAGGCTTAT TATAGCCCAG TTGGGAAGAA AAATATCTAT TTGAATAGTG GTTTGACGAG 1320 

CACAAAGAAT TATGGTAAAA CCATCCTTAC AAAGGAAGCT GACCTGGTTA CAACTCATGA 1380 

ATTGGGACAT AATTTTGGAG CAGAACATGA TCCGGATGGT CTAGCAGAAT GTGCCCCGAA 1440 
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TGAGGACCAG GGAGGGAAAT ATGTCATGTA TCCCATAGCT GTGAGTGGCG ATCACGAGAA 1500 

CAATAAGATG TTTTCAAACT GCAGTAAACA ATCAATCTAT AAGACCATTG AAAGTAAGGC 1560 

CCAGGAGTGT TTTCAAGAAC GCAGCAATAA AGTTTGTGGG AACTCGAGGG TGGATGAAGG 1620 

AGAAGAGTGT GATCCTGGCA TCATGTATCT GAACAACGAC ACCTGCTGCA ACAGCGACTG 1680 

CACGTTGAAG GAAGGTGTCC AGTGCAGTGA CAGGAACAGT CCTTGCTGTA AAAACTGTCA 1740 

GTTTGAGACT GCCCAGAAGA AGTGCCAGGA GGCGATTAAT GCTACTTGCA AAGGCGTGTC 1800 

CTACTGCACA GGTAATAGCA GTGAGTGCCC GCCTCCAGGA AATGCTGAAG ATGACACTGT I860 

TTGCTTGGAT CTTGGCAAGT GTAAGGATGG GAAATGCATC CCTTTCTGCG AGAGGGAACA 1920 

GCAGCTGGAG TCCTGTGCAT GTAATGAAAC TGACAACTCC TGCAAGGTGT GCTGCAGGGA 1980 

CCTTTCCGGC CGCTGTGTGC CCTATGTOGA TGCTGAACAA AAGAACTTAT TTTTGAGGAA 2040 

AGGAAAGCCC TGTACAGTAG GATTTTGTGA CATGAATGGC AAATGTGAGA AACGAGTACA 2100 

GGATGTAATT GAACGATTTT GGGATTTCAT TGACCAGCTG AGCATCAATA CTTTTGGAAA 2160 

GTTTTTAGCA GACAACATCG TTGGGTCTGT CCTGGTTTTC TCCTTGATAT TTTGGATTCC 222 0 

TTTCAGCATT CTTGTCCATT GTGTGTAACG TCGAAATGCT GAGCAGCATG GATTCTGCAT 2280 

CGGTTCGCAT TATCAAACCC TTTCCTGCGC CCCAGACTCC AGGCCGCCTG CAGCCTGCCC 2340 

CTGTGATCCC TTCGGCGCCA GCAGCTCCAA AACTGGACCA CCAGAGAATG GACACCATCC 2400 

AGGAAGACCC CAGCACAGAC TCACATATGG ACGAGGATGG GTTTGAGAAG GACCCCTTCC 2460 

CAAATAGCAG CACAGCTGCC AAGTCATTTG AGGATCTCAC GGACCATCCG GTCACCAGAA 2520 

GTGAAAAGGC TGCCTCCTTT AAACTGCAGC GTCAGAATCG TGTTGACAGC AAAGAAACAG 2580 

AGTGCTAATT TAGTTCTCAG CTCTTCTGAC TTAAGTGTGC AAAATATTTT TATAGATTTG 2640 

ACCTACAATC AATCACAGCT TATATTTTGT GAAGACTGGG AAGTGACTTA GCAGATGCTG 2700 

GTCATGTGTT TGAACTTCCT GCAGGTAAAC AGTTCTTGTG TGGTTTGGCC CTTCTCCTTT 2760 

TGAAAAGGTA AGGTGAAGGT GAATCTAGCT TATTTTGAGG CTTTCAGGTT TTAGTTTTTA 2820 

AAATATCTTT TGACCTGTGG TGCAAAAGCA GAAAATACAG CTGGATTGGG TTATGAGTAT 2880 

TTACGTTTTT GTAAATTAAT CTTTTATATT GATAACAGGC ACTGACTAGG GAAATGATCA 2940 

GTTTTTTTTT ATACACTGTA ATGAACCGCT GAATATGAAG CATTTGGCAT TTATTTGTGA 3000 

GAAAAGTGGA ATAGTTTTTT TTTTTTTTTT TTTTTTTTGC CTTCAACTAA AAACAAAGGA 3060 

GATAAATTTA GTATACATTG TATCTAAATT GTGGGTCTAT TTCTAGTTAT TACCCAGAGT 3120 

TTTTATGTAG CAGGGAAAAT ATATATCTAA ATTTAGAAAT CATTTGGGTT AATATGGCTC 3180 

TTCATAATTC TAAGACTAAT GCTCAGAACC TAACCACTAC CTTACAGTGA GGGCTATACA 32 40 

TGGTAGCCAG TTGAATTTAT GGAATCTACC AACTGTTTAG GGCCCTGATT TGCTGGGCAG 3300 

TTTTTCTGTA TTTTATAAGT ATCTTCATGT ATCCCTGTTA CTGATAGGGA TACATGTCTT 33 60 

AGAAAATTCA CTATTGGCTG GGAGTGGTGG CTCATGCCTG TAATCCCAGC ACTTGGAGAG 3420 
3421 GCTGAGGTTG CGCCACTACA CTCCAGCCTG GGTGACAGAG TGAGATCTGC CTC 

Seq ID NO: 557 Protein sequence 
Protein Accession ft: NP_068604.1 

1 11 21 31 41 51 

I I I I 1 I 

MRQSLfcFLTS WPFVIAPRP PDDPGFGPHQ RLEKLDSLLS DYDILSLSNI QQHSVRKRDL 60 

QTSTHVETLL TPSALKRHPK LYLTSSTERP SQNFKVWVD GKNE9EYTVK WQDFFTGHW 120 

GEPDSRVLAH IRDDDVIIRI NTDGAEYNIE PLWRFVNDTK DKRMLVYKSE DIKNVSRLQS 180 

PKVCGYLKVD NEELLPKGLV DREPPEELVH RVKRRADPDP MKNTCKLLW ADHRFYRYMG 240 

RGEESTTTNY LIELIDRVDD IYRNTSWDNA GFKGYGIQIE QIRILKSPQE VKPGEKHYNM 300 

AKSYPNEEKD AWDVKMLLEQ PSFDIAEEAS KVCLAHLFTY QDFDMGTLGL AYVGSPRANS 360 

HGGVCPKAYY SPVGKKNIYL NSGLTSTKNY GKTILTKEAD LVTTHELGHN FGAEHDPDGIi 420 

AECAPNEDQG GKYVMYPIAV SGDHENNKMF SNCSKQSIYK TIESKAQECF QERSNKVCGN 480 

SRVDEGEECD PGIMYLNNOT CCNSDCTLKE GVQCSDRNSP CCKNCQFETA QKKCQEAINA 540 

TCKGVSYCTG NSSECPPPGN AEDDTVCLDL GKCKDGKCIP FCEREQQLES CACNETDNSC 1 600 

KVCCRDLSGR CVPYVDAEQK NIiFLRKGKPC TVGFCDMNGK CEKRVQDVIE RPWDFIDQLS 660 
INTFGKFLAD NIVGSVI»VFS LIPWIPPSIL VHCV 

Seq ID NO: 558 DNA sequence 

Nucleic Acid Accession #: NM_004994.1 

Coding sequence: 20. . 2143 

1 11 21 31 41 51 

I I I I I I 

AGACACCTCT GCCCTCACCA TGAGCCTCTG GCAGCCCCTG GTCCTGGTGC TCCTGGTGCT 60 

GGGCTGCTGC TTTGCTGCCC CCAGACAGCG CCAGTCCACC CTTGTGCTCT TCCCTGGAGA 120 

CCTGAGAACC AATCTCACCG ACAGGCAGCT GGCAGAGGAA TACCTGTACC GCTATGGTTA 180 

CACTCGGGTG GCAGAGATGC GTGGAGAGTC GAAATCTCTG GGGCCTGCGC TGCTGCTTCT 240 

CCAGAAGCAA CTGTCCCTGC CCGAGACCGG TGAGCTGGAT AGCGCCACGC TGAAGGCCAT 300 

GCGAACCCCA CGGTGCGGGG TCCCAGACCT GGGCAGATTC CAAACCTTTG AGGGCGACCT 360 

CAAGTGGCAC CACCACAACA TCACCTATTG GATCCAAAAC TACTCGGAAG ACTTGCCGCG 420 

GGCGGTGATT GACGACGCCT TTGCCCGCGC CTTCGCACTG TGGAGCGCGG TGACGCCGCT 480 

CACCTTCACT CGCGTGTACA GCCGGGACGC AGACATCGTC ATCCAGTTTG GTGTCGCGGA 540 

GCACGGAGAC GGGTATCCCT TCGACGGGAA GGACGGGCTC CTGGCACACG CCTTTCCTCC 600 

TGGCCCCGGC ATTCAGGGAG ACGCCCATTT CGACGATGAC GAGTTGTGGT CCCTGGGCAA 660 

GGGCGTCGTG GTTCCAACTC GGTTTGGAAA CGCAGATGGC GCGGCCTGCC ACTTCCCCTT 720 

CATCTTCGAG GGCCGCTCCT ACTCTGCCTG CACCACCGAC GGTCGCTCCG ACGGCTTGCC 780 

CTGGTGCAGT ACCACGGCCA ACTACGACAC CGACGACCGG TTTGGCTTCT GCCCCAGCGA 840 

GAGACTCTAC ACCCGGGACG GCAATGCTGA TGGGAAACCC TGCCAGTTTC CATTCATCTT 900 

CCAAGGCCAA TCCTACTCCG CCTGCACCAC GGACGGTCGC TCCGACGGCT ACCGCTGGTG 960 

CGCCACCACC GCCAACTACG ACCGGGACAA GCTCTTCGGC TTCTGCCCGA CCCGAGCTGA 1020 

CTCGACGGTG ATGGGGGGCA ACTCGGCGGG GGAGCTGTGC GTCTTCCCCT TCACTTTCCT 1080 

GGGTAAGGAG TACTCGACCT GTACCAGCGA GGGCCGCGGA GATGGGCGCC TCTGGTGCGC 1140 

TACCACCTCG AACTTTGACA GCGACAAGAA GTGGGGCTTC TGCCCGGACC AAGGATACAG 1200 

TTTGTTCCTC GTGGCGGCGC ATGAGTTCGG CCACGCGCTG GGCTTAGATC ATTCCTCAGT 1260 

GCCGGAGGCG CTCATGTACC CTATGTACCG CTTCACTGAG GGGCCCCCCT TGCATAAGGA 1320 

CGACGTGAAT GGCATCCGGC ACCTCTATGG TCCTCGCCCT GAACCTGAGC CACGGCCTCC 1380 

AACCACCACC ACACCGCAGC CCACGGCTCC CCCGACGGTC TGCCCCACCG GACCCCCCAC 1440 

TGTCCACCCC TCAGAGCGCC CCACAGCTGG CCCCACAGGT CCCCCCTCAG CTGGCCCCAC 1500 

AGGTCCCCCC ACTGCTGGCC CTTCTACGGC CACTACTGTG CCTTTGAGTC CGGTGGACGA 1560 

TGCCTGCAAC GTGAACATCT TCGACGCCAT CGCGGAGATT GGGAACCAGC TGTATTTGTT 1620 

CAAGGATGGG AAGTACTGGC GATTCTCTGA GGGCAGGGGG AGCCGGCCGC AGGGCCCCTT 1680 
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CCTTATCGCC GACAAGTGGC CCGOGCTGCC CCGCAAGCTG GACTCGGTCT TTGAGGAGCC 1740 

GCTCTCCAAG AAGCTTTTCT TCTTCTCTGG GCGCCAGGTG TGGGTGTACA CAGGCGCGTC 1800 

GGTGCTGGGC CCGAGGCGTC TGGACAAGCT GGGCCTGGGA GCCGACGTGG CCCAGGTGAC 1860 

CGGGGCCCTC OGGAGTGGCA GGGGGAAGAT GCTGCTGTTC AGOGGGCGGC GCCTCTGGAG 1920 

GTTOGACGTG AAGGCGCAGA TGGTGGATCC CCGGAGCGCC AGCGAGGTGG ACCGGATGTT 1980 

CCCCGGGGTG CCTTTGGACA CGCACGACGT CTTCCAGTAC CGAGAGAAAG CCTATTTCTG 2040 

CCAGGACCGC TTCTACTGGC GCGTGAGTTC CCGGAGTGAG TTGAACCAGG TGGACCAAGT 2100 

GGGCTACGTG ACCTATGACA TCCTGCAGTG CCCTGAGGAC TAGGGCTCCC GTCCTGCTTT 2160 

GCAGTGCCAT GTAAATCCCC ACTGGGACCA ACCCTGGGGA AGGAGCCAGT TTGCCGGATA 2220 

CAAACTGGTA TTCTGTTCTG GAGGAAAGGG AGGAGTGGAG GTGGGCTGGG CCCTCTCTTC 2280 
TCACCTTTGT TTTTTGTTGG AGTGTTTCTA ATAAACTTGG ATTCTCTAAC CTTT 

Seq ID MO: 559 Protein sequence 
Protein Accession #i NP_004985.1 

1 11 21 31 41 51 

I I I I I I 

MSLWQPLVI>V liLVLGCCPAA PRQRQSTLVL FPGDLRTNLT DRQLAEEYLY RYGYTRVAEM 60 

RGESKSLGPA LLLLQKQLSL PETGELDSAT LKAMRTPRCG VPDLGRFQTF EGDLKWHHHN 120 

ITYWIQNYSE DLPRAVIDDA FARAFALWSA VTPLTFTRVY 3RD AD I VI QF GVAEHGDGYP 180 

FDGKDGLLAH AFPPGPGIQG DAHFDDDELW SLGKGVWPT RFGNADGAAC HFPFIFEGRS 240 

YSACTTDGRS DGLPWCSTTA NYDTDDRFGF CPSERLYTRD GNADGKPCQF PFIFQGQSYS 300 

ACTTDGRSDG YRWCATTAMY DRDKLFGFCP TRADSTVMGG NSAGELCVFP FTFLGKEYST 360 

CTSEGRGDGR LWCATTSNFD SDKKWGFCPD QGYSLFLVAA HEFGHALGLD HSSVPEALMY 420 

PMYRFTEGPP LH KDDVNG I R HLYGPRPEPE PRPPTTTTPQ PTAPPTVCPT GPPTVHPSER 480 

PTAGPTGPPS AGPTGPPTAG PSTATTVPLS PVDDACNVNI FDAIAEIGNQ LYLFKDGKYW 540 

RFSEGRGSRP QGPFLIADKW PALPRKLDSV FEEPLSKXLF FFSGRQVWVY TGASVLGPRR 600 

LDKLGLGADV AQVTGALRSG RGKMLLFSGR RLWRFDVKAQ MVDPRSASEV DRMFPGVPLD 660 
THDVFQYREK AYFCQDRFYW RVSSRSELNQ VDQVGYVTYD ILQCPED 

Seq ID NOt 560 DNA sequence 

Nucleic Acid Accession ttt NM_000213.1 

Coding sequence: 127.. 5385 

1 11 21 31 41 51 

I I I I I I 

CGCCCGCGCG CTGCAGCCCC ATCTCCTAGC GGCAGCCCAG GCGCGGAGGG AGCGAGTCCG 60 

CCCCGAGGTA GGTCCAGGAC GGGCGCACAG CAGCAGCCGA GGCTGGCCGG GAGAGGGAGG 120 

AAGAGGATGG CAGGGCCACG CCCCAGCCCA TGGGCCAGGC TGCTCCTGGC AGCCTTGATC 180 

AGCGTCAGCC TCTCTGGGAC CTTGGCAAAC CGCTGCAAGA AGGCCCCAGT GAAGAGCTGC 240 

ACGGAGTGTG TCCGTGTGGA TAAGGACTGC GCCTACTGCA CAGACGAGAT GTTCAGGGAC 300 

CGGCGCTGCA ACACCCAGGC GOAGCTGCTG GCOGCGGGCT GCCAGCGGGA GAGCATCGTG 360 

GTCATGGAGA GCAGCTTCCA AATCACAGAG GAGACCCAGA TTGACACCAC CCTGCGGCGC 420 

AGCCAGATGT CCCCCCAAGG CCTGCGGGTC CGTCTGCGGC CCGGTGAGGA GCGGCATTTT 480 

GAGCTGGAGG TGTTTGAGCC ACTGGAGAGC CCCGTGGACC TGTACATCCT CATGGACTTC 540 

TCCAACTCCA TGTCCGATGA TCTGGACAAC CTCAAGAAGA TGGGGCAGAA CCTGGCTCGG 600 

GTCCTG AG CC AGCTCACCAG CGACTACACT ATTGGATTTG GCAAGTTTGT GGACAAAGTC 660 

AGCGTCCCGC AGACGGACAT GAGGCCTGAG AAGCTGAAGG AGCCCTGGCC CAACAGTGAC 720 

CCCCCCTTCT CCTTCAAGAA CGTCATCAGC CTGACAGAAG ATGTGGATGA GTTCCGGAAT 780 

AAACTGCAGG GAGAGCGGAT CTCAGGCAAC CTGGATGCTC CTGAGGGCGG CTTCGATGCC 840 

ATCCTGCAGA CAGCTGTGTG CACGAGGGAC ATTGGCTGGC GCCCGGACAG CACCCACCTG 900 

CTGGTCTTCT CCACCGAGTC AGCCTTCCAC TATGAGGCTG ATGGCGCCAA CGTGCTGGCT 960 

GGCATCATGA GCCGCAACGA TGAACGGTGC CACCTGGACA CCACGGGCAC CTACACCCAG 1020 

TACAGGACAC AGGACTACCC GTCGGTGCCC ACCCTGGTGC GCCTGCTCGC CAAGCACAAC 1080 

ATCATCCCCA TCTTTGCTGT CACCAACTAC TCCTATAGCT ACTACGAGAA GCTTCACACC 1140 

TATTTCCCTG TCTCCTCACT GGGGGTGCTG CAGGAGGACT CGTCCAACAT CGTGGAGCTG 1200 

CTGGAGGAGG CCTTCAATCG GATCCGCTCC AACCTGGACA TCCGGGCCCT AGACAGCCCC 1260 

CGAGGCCTTC GGACAGAGGT CACCTCCAAG ATGTTCCAGA AGACGAGGAC TGGGTCCTTT 1320 

CACATCCGGC GGGGGGAAGT GGGTATATAC CAGGTGCAGC TGCGGGCCCT TGAGCACGTG 13 80 

GATGGGAGGC ACGTGTGCCA GCTGCCGGAG GACCAGAAGG GCAACATCCA TCTGAAACCT 1440 

TCCTTCTCCG ACGGCCTCAA GATGGACGCG GGCATCATCT GTGATGTGTG CACCTGCGAG 1500 

CTGCAAAAAG AGGTGCGGTC AGCTCGCTGC AGCTTCAACG GAGACTTCGT GTGCGGACAG 1560 

TGTGTGTGCA GCGAGGGCTG GAGTGGCCAG ACCTGCAACT GCTCCACCGG CTCTCTGAGT 1620 

GACATTCAGC CCTGCCTGCG GGAGGGCGAG GACAAGCCGT GCTCCGGCCG TGGGGAGTGC 1680 

CAGTGCGGGC ACTGTGTGTG CTACGGCGAA GGCCGCTACG AGGGTCAGTT CTGCGAGTAT 1740 

GACAACTTCC AGTGTCCCCG CACTTCCGGG TTCCTCTGCA ATGACCGAGG ACGCTGCTCC 1800 

ATGGGCCAGT GTGTGTGTGA GCCTGGTTGG ACAGGCCCAA GCTGTGACTG TCCCCTCAGC 1860 

AATGCCACCT GCATCGACAG CAATGGGGGC ATCTGTAATG GACGTGGCCA CTGTGAGTGT 1920 

GGCCGCTGCC ACTGCCACCA GCAGTCGCTC TACACGGACA CCATCTGCGA GATCAACTAC 1980 

TCGGCGATCC ACCCGGGCCT CTGCGAGGAC CTACGCTCCT GCGTGCAGTG CCAGGCGTGG 2040 

GGCACCGGCG AQAAGAAGGG . GCGCACGTGT GAGGAATGCA ACTTCAAGGT CAAGATGGTG 2100 

GACGAGCTTA AGAGAGCCGA GGAGGTGGTG QTQCGCTGCT CCTTCCGGGA CGAGGATGAC 2160 

GACTGCACCT ACAGCTACAC CATGGAAGGT GACGGCGCCC CTGGGCCCAA CAGCACTGTC 2220 

CTGGTGCACA AGAAGAAGGA CTGCCCTCCG GGCTCCTTCT GGTGGCTCAT CCCCCTGCTC 2280 

CTCCTCCTCC TGCCGCTCCT GG CCCTGCT A CTGCTGCTAT GCTGGAAGTA CTGTGCCTGC 2340 

TGCAAGGCCT GCCTGGCACT TCTCCCGTGC TGCAACCGAG GTCACATGGT GGGCTTTAAG 2400 

GAAGACCACT ACATGCTGCG GGAGAACCTG ATGGCCTCTG ACCACTTGGA CACGCCCATG 2460 

CTGCGCAGCG GGAACCTCAA GGGCCGTGAC GTGGTCCGCT GGAAGGTCAC CAACAACATG 2520 

CAGCGGCCTG GCTTTGCCAC TCATGCCGCC AGCATCAACC CCACAGAGCT GGTGCCCTAC 2580 

GGGCTGTCCT TGCGCCTGGC CCGCCTTTGC ACCGAGAACC TGCTGAAGCC TGACACTCGG 2640 

GAGTGCGCCC AGCTGCGCCA GGAGGTGGAG GAGAACCTGA ACGAGGTCTA CAGGCAGATC 2700 

TCCGGTGTAC ACAAGCTCCA GCAGACCAAG TTCCGGCAGC AGCCCAATGC CGGGAAAAAG 2760 

CAAGACCACA CCATTGTGGA CACAGTGCTG ATGGCGCCCC GCTCGGCCAA GCCGGCCCTG 2820 

CTGAAGCTTA CAGAGAAGCA GGTGGAACAG AGGGCCTTCC ACGACCTCAA GGTGGCCCCC 2880 

GGCTACTACA CCCTCACTGC AGACCAGGAC GCCCGGGGCA TGGTGGAGTT CCAGGAGGGC 2940 

GTGGAGCTGG TGGACGTACG GGTGCCCCTC TTTATCCGGC CTGAGGATGA CGACGAGAAG 3000 

CAGCTGCTGG TGGAGGCCAT CGACGTGCCC GCAGGCACTG CCACCCTCGG CCGCCGCCTG 3060 



398 
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GTAAACATCA CCATCATCAA GQAGCAAGCC AGAGACGTGG TGTCCTTTGA GCAGCCTGAG 3120 

TTCTCGGTCA GCCGCGGGGA CCAGGTGGCC CGCATCCCTG TCATCCGGCG TGTCCTGGAC 31B0 

GGCGGGAAGT CCCAGGTCTC CTACCGCACA CAGGATGGCA CCGCGCAGGG CAACCGGGAC 3240 

TACATCCCCG TGGAGGGTGA GCTGCTGTTC CAGCCTGGGG AGGCCTGGAA AGAGCTGCAG 3300 

GTGAAGCTCC TGGAGCTGCA AGAAGTTGAC TCCCTCCTGC GGGGCCGCCA GGTCCGCCGT 3360 

TTCCACGTCC AGCTCAGCAA CCCTAAGTTT GGGGCCCACC TGGGCCAGCC CCACTCCACC 3420 

ACCATCATCA TCAGGGACCC AGATGAACTG GACCGGAGCT TCACGAGTCA GATGTTGTCA 3480 

TCACAGCCAC CCCCTCACGG CGACCTGGGC GCCCCGCAGA ACCOCAATGC TAAGGCCGCT 3540 

GGGTCCAGGA AGATCCATTT CAACTGGCTG CCCCCTTCTG GCAAGCCAAT GGGGTACAGG 3600 

GTAAAGTACT GGATTCAGGG TGACTCCGAA TCCGAAGCCC ACCTGCTCGA CAGCAAGGTG 3660 

CCCTCAGTGG AGCTCACCAA CCTGTACCCG TATTGCGACT ATGAGATGAA GGTGTGCGCC 3720 

TACGGGGCTC AGGGCGAGGG ACCCTACAGC TCCCTGGTGT CCTGCCGCAC CCACCAGGAA 3780 

GTGCCCAGCG AGCCAGGGCG TCTGGCCTTC AATGTCGTCT CCTCCACGGT GACCCAGCTG 3840 

AGCTGGGCTG AGCCGGCTGA GACCAACGGT GAGATCACAG CCTACGAGGT CTGCTATGGC 3900 

CTGGTCAACG ATGACAACCG ACCTATTGGG CCCATGAAGA AAGTGCTGGT TGACAACCCT 3960 

AAGAACCGGA TGCTGCTTAT TGAGAACCTT CGGGAGTCCC AGCCCTACCG CTACACGGTG 4020 

AAGGCGCGCA ACGGGGCCGG CTGGGGGCCT GAGCGGGAGG CCATCATCAA CCTGGCCACC 4080 

CAGCCCAAGA GGCCCATGTC CATCCCCATC ATCCCTGACA TCCCTATCGT GGACGCCCAG 4140 

AGCGGGGAGG ACTACGACAG CTTCCTTATG TACAGCGATG ACGTTCTACG CTCTCCATOG 4200 

GGCAGCCAGA GGCCCAGCGT CTCCGATGAC ACTGAGCACC TGGTGAATGG CCGGATGGAC 42 60 

TTTGCCTTCC CGGGCAGCAC CAACTCCCTG CACAGGATGA CCACGACCAG TGCTGCTGCC 4320 

TATGGCACCC ACCTGAGCCC ACACGTGCCC CACCGCGTGC TAAGCACATC CTCCACCCTC 4380 

ACACGGGACT ACAACTCACT GACCCGCTCA GAACACTCAC ACTCGACCAC ACTGCCGAGG 44 40 

GACTACTCCA CCCTCACCTC CGTCTCCTCC CACGACTCTC GCCTGACTGC TGGTGTGCCC 4500 

GACACGCCCA CCCGCCTGGT GTTCTCTGCC CTGGGGCCCA CAT CTCTCAG AGTGAGCTGG 4560 

CAGGAGCCGC GGTGCGAGCG GCCGCTGCAG GGCTACAGTG TGGAGTACCA GCTGCTGAAC 462 0 

GGCGGTGAGC TGCATCGGCT CAACATCCCC AACCCTGCCC AGACCTCGGT GGTGGTGGAA 4680 

GACCTCCTGC CCAACCACTC CTACGTGTTC CGCGTGCGGG CCCAGAGCCA GGAAGGCTGG 4740 

GGCCGAGAGC GTGAGGGTGT CATCACCATT GAATCCCAGG TGCACCCGCA GAGCCCACTG 4800 

TGTCCCCTGC CAGGCTCCGC CTTCACTTTG AGCACTCCCA GTGCCCCAGG CCCGCTGGTG 4860 

TTCACTGCCC TGAGCCCAGA CTCGCTGCAG CTGAGCTGGG AGCGGCCACG GAGGCCCAAT 4920 

GGGGATATCG TCGGCTACCT GGTGACCTGT GAGATGGCCC AAGGAGGAGG GCCAGCCACC 4980 

GCATTCCGGG TGGATGGAGA CAGCCCCGAG AGCCGGCTGA CCGTGCCGGG CCTCAGCGAG 5040 

AACGTGCCCT ACAAGTT CAA GGTGCAGGCC AGGACCACTG AGGGCTTCGG GCCAGAGCGC 5100 

GAGGGCATCA TCACCATAGA GTCCCAGGAT GGAGGACCCT TCCCGCAGCT GGGCAGCCGT 5160 

GCCGGGCTCT TCCAGCACCC GCTGCAAAGC GAGTACAGCA GCATCACCAC CACCCACACC 5220 

AGCGCCACCG AGCCCTTCCT AGTGGATGGG CCGACCCTGG GGGCCCAGCA CCTGGAGGCA 5280 

GGCGGCTCCC TCACCCGGCA TGTGACCCAG GAGTTTGTGA GCCGGACACT GACCACCAGC 5340 

GGAACCCTTA GCACCCACAT GGACCAACAG TTCTTCCAAA CTTGACCGCA CCCTGCCCCA 5400 

CCCCCGCCAT GTCCCACTAG GCGTCCTCCC GACTCCTCTC CCGGAGCCTC CTCAGCTACT 5460 

CCATCCTTGC ACCCCTGGGG GCCCAGCCCA CCCGCATGCA CAGAGCAGGG GCTAGGTGTC 5520 

TCCTGGGAGG CATGAAGGGG GCAAGGTCCG TCCTCTGTGG GCCCAAACCT ATTTGTAACC 5580 

AAAGAGCTGG GAGCAGCACA AGGACCCAGC CTTTGTTCTG CACTTAATAA ATGGTTTTGC 5640 
TACTG 

Seq ID NO: 561 Protein sequence 
Protein Accession #: NP_000204.l 

1 11 21 31 41 51 

I I I I' I I 

MAGPRPSPWA RLIiLAALISV SLSGTLANRC KKAPVKSCTE CVRVDKDCAY CTDEMFRDRR 60 
CNTQAELLAA GCQRESIWM ESSFQITEET QIDTTLRRSQ MSPQGLRVRL RPGEERHFEL 120 
BVFEPLESPV DLYILMDFSN SMSDDLDNLK KMGQNLARVL SQLTSDYTIG FGKFVDKVSV 180* 
PQTDMRPEKL KEPWPNSDPP FSFKNVISLT EDVDEFRNKL QGERISGNLD APEGGFDAIL 240 
QTAVCTRDIG WRPDSTHLLV FSTESAFHYE ADGANVLAGI MSRNDERCHL DTTGTYTQYR 300 
TQDYPSVPTL VRLLAKHNII PIFAVTNYSY SYYEKLHTYF PVSSLGVLQE DSSNTVELLE 360 
EAFNRIRSNL DIRALDSPRG LRTEVTSKMF QKTRTGSFHI RRGEVGIYQV QLRALEHVDG 420 
THVCQLPEDQ KGNIHLKPSF SDGLKMDAGI ICDVCTCELQ KEVRSARCSF NGDFVCGQCV 480 
CSEGWSGQTC NCSTGSLSDI QPCLREGEDK PCSGRGECQC GHCVCYGEGR YEGQFCEYDN 540 
FQCPRTSGFL CNDRGRCSMG QCVCEPGWTG PSCDCPLSMA TCIDSNGGIC NGRGHCECGR 600 
CHCHQQSLYT DTICEINYSA IHPGLCEDLR SCVQCQAWGT GEKKGRTCEE CNFKVKMVDE 660 
LKRAEEVWR CSFRDEDDDC TYSYTMEGDG AFGPNSTVLV HKKKDCPPGS FWWLIPLLLL 720 
LLPLLALLLL LCWKYCACCK ACL ALL PC CN RGHMVGFKED HYMLRENLMA SDHLDTPMLR 780 
SGNLKGRDW RWKVTNNMQR PGFATHAAS I NPTELVPYGL 8LRLARLCTE NLLKPDTRBC 840 
AQLRQEVEEN LNEVYRQISG VHKLQQTKFR QQPNAGKKQD HTIVDTVLMA PRSAKPALLK 900 
LTEKQVEQRA FHDLKVAPGY YTLTADQDAR GMVEFQEGVE LVDVRVPLFI RPEDDDEKQL 960 

LVEAIDVPAG TATLGRRLVN ITIIKEQARD WSFEQPEFS VSRGDQVARI PVIRRVLDGG 1020 

KSQVSYRTQD GTAQGNRDYI PVEGELLFQP GEAWKELQVK LLELQEVDSL LRGRQVRRFH 1080 

VQLSNPKFGA HLGQPHSTTI IIRDPDELDR SFTSQMLSSQ PPPHGDLGAP QNPNAKAAGS 1140 

RKIHFNWLPP SGKPMGYRVK YWIQGDSESE AHLLDSKVPS VELTNLYPYC DYEMKVCAYG 1200 

AQGEGPYS9L VSCRTHQEVP SEPGRLAFNV VSSTVTQLSW AEPAETNGEI TAYEVCYGLV 1260 

NDDNRPIGPM KKVLVDNPKN RMLLIENLRE SQPYRYTVKA RNGAGWGPER EAIINLATQP 1320 

KRPMSIPIIP DIPIVDAQSG EDYDSFLMYS DDVLRSPSGS QRPSVSDDTE HLVNGRMDFA 1380 

FPGSTNSLHR MTTTSAAAYG THLSPHVPHR VLSTSSTLTR DYNSLTRSEH SHSTTLPRDY 1440 

STLTSVSSHD SRLTAGVPDT PTRLVFSALG PTSLRVSWQE PRCERPLQGY SVEYQLLNGG 1500 

E LHRLN I PNP AQTSWVEDL LPNHSYVFRV RAQSQEGWGR EREGVITIES QVHPQSPLCP 1560 

LPGSAFTLST PSAPGPLVFT ALSPDSLQLS WERPRRPNGD IVGYLVTCEM AQGGGPATAF 1620 

RVDGDSPESR LTVPGLSENV PYKFKVQART "TEGFGPEREG IITIESQDGG PFPQLGSRAG 1680 

LFQHPLQSEY SSITTTHTSA TEPFLVDGPT LGAQHLEAGG 8LTRHVTQEF VSRTLTTSGT 1740 
LSTHMDQQFF QT 

Seq ID NO: 562 DNA sequence 

Nucleic Acid Accession »: NM_013332.1 

Coding sequence : 1 . . 63 

1 ii 21 31 41 51 

I I I I I I 
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GCACGAGGGC GCTTTTGTCT CCGGTGAGTT TTGTGGCGGG AAGCTTCTGC 
AGTAACCGAC TTTCCTCCGG ACTCCTGCAC GACCTGCTCC TACAGCCGGC GATCCACTCC 
CGGCTGTTCC CCCGGAGGGT CCAGAGGCCT TTCAGAAGGA GAAGGCAGCT CTGTTTCTCT 
GCAGAGGAGT AGGGTCCTTT CAGCCATGAA GCATGTGTTG AACCTCTACC TGTTAGGTGT 
GGTACTGACC CTACTCTCCA TCTTCGTTAG AGTGATGGAG TCCCTAGAAG GCTTACTAGA 
GAGCCCATCG CCTGGGACCT CCTGGACCAC CAGAAGCCAA CTAGCCAACA CAGAGCCCAC 
CAAGGGCCTT CCAGACCATC CATCCAGAAG CATGTGATAA GACCTCCTTC CATACTGGCC 
ATATTTTGGA ACACTGACCT AGACATGTCC AGATGGGAGT CCCATTCCTA GCAGACAAGC 
TGAGCACCGT TGTAACCAGA GAACTATTAC TAGGCCTTGA AGAACCTGTC TAACTGGATG 
CTCATTGCCT GGGCAAGGCC TGTTTAGGCC GGTTGCGGTG GCTCATGCCT GTAATCCTAG 
CACTTTGGGA GGCTGAGGTG GGTGGATCAC CTGAGGTCAG GAGTTCGAGA CCAGCCTCGC 
CAACATGGCG AAACCCCATC TCTACTAAAA ATACAAAAGT TAGCTGGGTG TGGTGGCAGA 
GGCCTGTAAT CCCAGTTCCT TGGGAGGCTG AGGCGGGAGA ATTGCTTGAA CCCGGGGACG 
GAGGTTGCAG TGAACCGAGA TCGCACTGCT GTACCCAGCC TGGGCCACAG TGCAAGACTC 
CATCTCAAAA AAAAAAAGAA AAGAAAAAGC CTGTTTAATG CACAGGTGTG AGTGGATTGC 
TTATGGCTAT GAGATAGGTT GATCTCGCCC TTACCCOGGG GTCTGGTGTA TGCTGTGCTT 
TCCTCAGCAG TATGGCTCTG ACATCTCTTA GATGTCCCAA CTTCAGCTGT TGGGAGATGG 
TGATATTTTC AACCCTACTT CCTAAACATC TGTCTGGGGT TCCTTTAGTC TTGAATGTCT 
TATGCTCAAT TATTTGGTGT TGAGCCTCTC TTCCACAAGA GCTCCTCCAT GTTTGGATAG 
CAGTTGAAGA GGTTGTGTGG GTGGGCTGTT GGGAGTGAGG ATGGAGTGTT CAGTGCCCAT 
TTCTCATTTT ACATTTTAAA GTCGTTCCTC CAACATAGTG TGTATTGGTC TGAAGGGGGT 
GGTGGGATGC CAAAGCCTGC TCAAGTTATG GACATTGTGG CCACCATGTG GCTTAAATGA 
TTTTTTCTAA CTAATAAAGT GGAATATATA TTTCAAAAAA AAAAAAAAAA AA 

Seq ID NO: 563 Protein sequence 
Protein Accession #: NP_037464.1 

l ii 21 31 41 51 

I ! i I I I 

MKHVLNLYLL GWLTLLSIF VRVME9LEGL LBSPSPGTSW TTHSQLANTE PTKGLPDHPS 
RSM 

Seq ID NOi 564 DNA sequence 

Nucleic Acid Accession ft: NM_023 915.1 

Coding sequence: 250.. 1326 
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60 
120 
180 
240 
300 
360 
420 
460 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1060 
1140 
1200 
1260 
1320 



GGCACGAGGG 
TCAAAGCTTA 
GTGAATGGAC 
CCCACGCCTC 
AACTGAAGAA 
CAAGAGAGTC 
AATGAATTTG 
TTGCTGAATG 
TTCTATCTCA 
ATAGTCCATG 
TCAGTTTTGT 
GATCGCTATC 
ACGAAGGTTT 
ATCCTGACAA 
CCTTTGGGGG 
GTGCTGGTGA 
AGGCAATTCA 
GTGGCTGTGT 
AGTCACTTAG 
ATTACACTTT 
TGTAGGTCAT 
ATCAGATCAC 
GTGTAGGCCT 
TTCATTATCC 



11 
I 

TTTCGTTTTC 
TTCTTAATTA 
AGCCAGCCAC 
AATCGTCCCC 
TGGGGTTCAA 
ACAATTCAGG 
ACACAATTGT 
GTTTAGCAGT 
AAAACATAGT 
ATGCAGGATT 
TTTATGCAAA 
TGAAGGTGGT 
TATCTGTTTG 
ATGGTCAGCC 
TCAAATGGCA 
TTCTGATCGG 
TAAGTCAGTC 
TTTTTACCTG 
ACAGGCTTTT 
TCTTGTCTGC 
TTTCAAGAAG 
TGCAAAGTGT 
TTTATTGTTT 
TTAAAAAAAA 



21 
I 

ATGCTTTACC 
GAGACAAGAA 
CACAATGAAA 
AAGTGTTTCC 
CTTGACGCTT 
CAACAGGAGC 
CTTGCCGGTG 
GTGGATCTTC 
GGTTGCAGAC 
TGGACCTTGG 
CATGTATACT 
CAAGCCATTT 
TGTTTGGGTG 
AACAQAGGAC 
TACGGCAGTC 
ATGTTACATA 
AAGCCGAAAG 
CTTTCTACCA 
AGATGAATCT 
GTGTAATGTT 
GCTGTTCAAA 
GAGAAGATCG 
GTTGGAATCG 
AA 



31 

I 

AGAAAATCCA 
ACCTGTTTCA 
GAAATCAAAC 
TGACACGCAT 
GCAAAATTAC 
GACGGGCCAG 
CTTTATCTCA 
TTCCACATTA 
CTCATAATGA 
TACTTCAAGT 
TCCATCGTGT 
GGGGACTCTC 
ATCATGGCTG 
AATATCCATG 
ACCTATGTGA 
GCCATATCCA 
CGAAAACATA 
TATCACTTGT 
GCACAAAAAA 
TGCCTGGATC 
AAATCAAATA 
GAAGTTCGCA 
ATATGTACAA 



41 
I 

C TTCC CTGCC 
ACTTGAAGAC 
CAGGAATAAC 
CTTTGCTTAC 
CAAATAACGA 
GAAAGAACAC 
TTATATTTGT 
GGAATAAAAC 
CGCTGACATT 
TTATTCTCTG 
TCCTTGGGCT 
GGATGTACAG 
TTTTGTCTTT 
ACTGCTCAAA 
ACAGCTGCTT 
GGTACATCCA 
ACCAGAGCAT 
GCAGAATTCC 
TCCTATATTA 
CAATAATTTA 
TCAGAACCAG 
TATATTATGA 
AGTGTAAATA 



51 
I 

GACCTTAGTT 
ACCGTATGAG 
CTATGCTGAA 
AGTGCATCAC 
GCTGCACGGC 
CACCCTTCAC 
GGCAAGCATC 
CAGCTTCATA 
TCCATTTCGA 
CAGATACACT 
GATAAGCATT 
CATAACCTTC 
GCCAAACATC 
ACTTAAAAGT 
GTTTGTGGCC 
CAAATCCAGC 
CAGGGTTGTT 
TTTTACTTTT 
CTGCAAAGAA 
CTTTTTCATG 
GAGTGAAAGC 
TTACACTGAT 
AATGTTTCTT 



Seq ID NO: 565 Protein sequence 
Protein Accession #: NP_076404 



MGFNLTLAKL 
GLAVWIFFHI 
PYANMYTSIV 
NGQPTEDNIH 
ISQSSRKRKH 
FLSACNVCLD 



11 
I 

PNNELHGQES 
RNKTSFIFYI* 
FLGLISIDRY 
DCSKLKSPLG 
NQSIRVWAV 
PIIYFFMCRS 



21 
I 

HNSGNRSDGP 
KNIWADLIM 
LKWKPFGDS 
VKWHTAVTYV 
FFTCFLPYHL 
FSRRLFKKSN 



31 
I 

GKNTTLHNEF 
TLTFPPRIVH 
RMYSITFTKV 
NSCIiFVAVLV 
CRIPFTFSHL 
IRTRSESIRS 



41 

I 

DTIVLPVLY1* 
DAGFGPWYFK 
LSVCVWVIMA 
ILIGCYIAIS 
DRLLDESAQK 
LQSVRRSEVR 



51 
I 

IIFVASILLN 
FILCRYTSVL 
VLSLPNIILT 
RYIHKSSRQF 
ILYYCKEITL 
IYYDYTDV 



Seq ID NO: 566 DNA sequence 

Nucleic Acid Accession #: NM_005365.l 

Coding sequence: 1..948 



1 
I 

ATGTCTCTCG 
GAGGACTTGG 
TCCTCTGACA 
CCTCAGGGAG 
GAGGGCTCCA 
GAGTTCATGT 



11 

I 

AGCAGAGGAG 
GCCTGATGGG 
GCAAGGAGGA 
GCGCTTCCTC 
GCAGTCAAGA 
TCCAAGAAGC 



21 
I 

TCCGCACTGC 
TGCACAGGAA 
GGAGGTGTCT 
CTCCATTTCC 
AGAGGAAGAG 
ACTGAAATTG 



31 

I 

AAGCCTGATG 
CCCACAGGCG 
GCTGCTGGGT 
GTCTACTACA 
CCAAGCTCCT 
AAGGTGGCTG 



41 

I 

AAGACCTTGA 
AGGAGGAGGA 
CAT CAAGT CC 
CTTTATGGAG 
CGGTCGACCC 
AGTTGGTTCA 



51 
I 

AGCCCAAGGA 
GACTACCTCC 
TCCCCAGAGT 
CCAATTCGAT 
AGCTCAGCTG 
TTTCCTGCTC 



60 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 
360 
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CACAAATATC GAGTCAAGGA GCCGGTCACA AAGGCAQAAA 
AATTACAAGC GCTACTTTCC TGTGATCTTC GGCAAAGCCT 
TTTGGCACTG ATGTGAAGGA GGTGGACCCC GCCGGCCACT 
CTTGGCCTCT CGTGCGATAG CATGCTGGGT GATGGTCATA 
CTGATCATTG TCCTGGGTOT GATCCTAACC AAAGACAACT 
TGGGAAGCGT TGAGTGTGAT GGGGGTGTAT GTTGGGAAGG 
CCCAGGAAGC TGCTCACCCA AGATTGGGTG CAGGAAAACT 
CCCGGCAGTG ATCCTGCGCA CTACGAGTTC CTGTGGGGTT 
AGCTATGAGA AGGTCATAAA TTATTTGGTC ATGCTCAATG 
CCATCCCTTT ATGAAGAGGT TTTGGGAGAG GAGCAAGAGG 

Seq ID NO: 567 Protein sequence 
Protein Accession #* NP_0 053 56.1 
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1 

[ 

MSLEQRSPHC 
PQGGASSSIS 
HKYRVKEPVT 
LGLSCDSMLG 
PRKLLTQDWV 
PSLYEEVLGE 



11 
I 

KPDEDLEAQG 
VYYTLWSQFD 
KAEMLESVIK 
DGHSMPKAAL 
QENYLEYRQV 
EQEGV 



21 31 

I I 

EDLGLMGAQE PTGEEEETT9 

EGSSSQEEEE PSSSVDPAQL 

NYKRYFPVIF GKASEFMQVI 

LIIVLGVILT KDNCAPEEVI 

PGSDPAHYEF LWGSKAHAET 



Seq ID NO: 568 DNA sequence 
Nucleic Acid Accession #: NM_014400 
Coding sequence! 86.. 1126 



GGTTACTCAT 
GACGCCAAGG 
GATCTGGACT 
GTGCTACAGC 
GAAGTGCGCG 
CGGACAATTC 
CGGCCTGGAT 
CTGCAACGCC 
ATACCCGCCC 
GGGTACATCG 
CTTCGACGGC 
CTGTGTCCAG 
TGGCTCCTGT 
CCCTCGAATC 
CACATCTGTC 
GCCAGCGCCA 
GGAGCCCAGG 
TCCTGCAAAA 
ATTGGCAGCC 
AAATTTCCCT 
CCCACCACTG 
CTTCTGCTGC 
GGGTGTTCTA 
TCCTCTTGTG 
AGGATGCTAA 
GGTGGGACAA 
ATCGGTTCCC 
CTTATGTCTG 
TTGTATAGTG 



11 
I 

CCTGGGCTCA 
GAG CAGGACG 
GCAGGCTGGC 
TGCGTGCAGA 
CCGGGCGTGG 
TCGCTGGCAG 
CTTCACGGGC 
AAGCTCAACC 
AACGGCGTGG 
CCGCCGGTCG 
AACGTCACCT 
GATGAATTCT 
TGCCAGGGGT 
CCACCCCTTG 
ACCACTTCTA 
ACCAGTCAGA 
TTGACTGGAG 
GGGGGGCCCC 
CTTCTGTTGG 
CTCACCTACT 
GACTGGGCTG 
GCTGGTTTGC 
GCTTTTTGAG 
ATGTTAGGAC 
GCTTCCTACT 
TGGCTCCCCA 
CATATGTCTT 
TGTGTGATCA 
AAAAAAAA 



21 
I 

GGTAAGAGGG 
GAGCCATGGA 
TGCTGCTGCT 
AAGCAGATGA 
ACGTCTGCAC 
TGCSGGGTTG 
TTCTGGCGTT 
TCACCTCGCG 
AGTGCTACAG 
TGAGCTGCTA 
TGACGGCAGC 
GCACTCGGGA 
CCCGCTGTAA 
TCCGGCTGCC 
CCTCGGCCCC 
CTCCGAGACA 
GCGCCGCTGG 
AGCAGCCCCA 
CCGTGGCTGC 
TCTCTGGCCC 
GCCCAGCCCC 
GGCTTTGGGA 
GACAGCTCCT 
AGAGTGAGAG 
CACTTTCTCC 
CTCTAAGCAC 
CCTTACTAGA 
GTTTCTGGCA 



31 

I 

CCCGAGCTCG 
CCCCGCCAGG 
GCTGCTTCGC 
CGGATGCTCC 
CGAGGCCGTG 
CGGTTCGGGA 
CATCCAGCTG 
GGCGCTCGAC 
CTGTGTGGGC 
CAACGCCAGC 
TAATGTGACT 
TGGAGTAACA 
CTCTGACCTC 
CCCTCCAGAG 
AGTGAGACCC 
GGGAGTAGAA 
CCACCAGGAC 
TAATAAAGGC 
TGGTGT CCTA 
TGGGTACCCC 
TGTTTTTCCA 
AATAAAATAC 
GTATCCTTCT 
AAGTCAGCTG 
TAGCCAGCCT 
TGCCTCCCCT 
CTGTGAGCTC 
CATAAATGCC 



TGCTGGAGAG 
CCGAGTTCAT 
CCTACATCCT 
GCATGCCCAA 
GCGCCCCTGA 
AGCACATGTT 
ACCTGGAGTA 
CCAAGGCCCA 
CAAGAGAGCC 
GAGTCTGA 



41 

1 

SSDSKEEEVS 
EFMFQEALKIj 
FGTDVKEVDP 
WEALSVMGVY 
SYEKVINYLV 



41 
I 

GAGGCGGCAC 
AAAGCAGGTG 
GGAGGAGCGC 
CCGAACAAGA 
GGGGCGGTGG 
CTCCCCGGCA 
CAGCAATGCG 
CCGGCAGGTA 
CTGAGCCGGG 
GATCATGTCT 
GTGTCCTTGC 
GGCCCAGGGT 
CGCAACAAGA 
CCCACGACTG 
ACATCCACCA 
CACGAGGCCT 
CGCAGCAATT 
TGTGTGGCTC 
CTGTGAGCTT 
TCTTCTCATC 
ACATTCCCCA 
CGTTGTATAT 
CATCCTTGTC 
TCACGGGGAA 
GGACTTTGGA 
ACTCCCCGCA 
CTCGAGGGCA 
TCAATAAAGA 



CGTCATCAAA 
GCAGGTGATC 
TGTCACTGCT 
GGCCGCCCTC 
AGAGGTTATC 
CTACGGGGAG 
CCGGCAGGTG 
CGCTGAAACC 
CATCTGCTAC 



51 
I 

AAGSSSPPQS 
KVAELVHFLL 
AGHSYILVTA 
VGKEHMFYGE 
MLNAREPICY 



51 
I 

ACCCAGGGGG 
CCCAGGCCAT 
AGGCCCTGGA 
TGAAGACAGT 
AGACCATCCA 
AGAATGACCG 
CTCAGGATCG 
ATGAGAGTGC 
AGGCGTGCCA 
ACAAGGGCTG 
CTGTCCGGGG 
TCACGCTCAG 
CCTACTTCTC 
TGGCCTCAAC 
CCAAACCCAT 
CCCGGGATGA 
CAGGGCAGTA 
CCACAGCTGG 
CTCCACCTGG 
ACTTCCTGTT 
GTATCCCCAG 
ATTCTGGCAG 
TCTCCGCTTG 
GGTGAGAGAG 
GCGTGGGGTG 
TCTTTGGGGA 
GGGACCGTGC 
TTTAATTACT 



Seq ID NO: 569 Protein sequence 
Protein Accession #t NP_055215 



1 
I 

MDPARKAGAQ 
CTEAVGAVET 
SRAIiDPAGNE 
AANVTVSLPV 
LPPPEPTTVA 



11 

I 

AMIWTAGWLL 
IHGQFSLAVX 
SAYPPNGVEC 
RGCVQDBFCT 
STTSVTTSTS 
QYPAKGGPQQ 



21 

I 

LLLLRGGAQA 
GCGSGLPGKN 
YSCVGLSREA 
RDGVTGPGFT 
APVRPTSTTK 
PHNKGCVAPT 



31 

I 

LECYSCVQKA 
DRGLDIiHGLL 
CQGTSPPWS 
LSGSCCQGSR 
PMPAPTSQTP 
AGLAALLLAV 



Seq ID NO: 570 DNA sequence 

Nucleic Acid Accession #< NMJ>05329.1 

Coding sequence: 1..1662 



ATGCCGGTGC 
GTGCTGGGTG 
CACTACCTGT 
CTTTTTGCCT 
TCCCCGCGGC 
TTGCGCAAGT 
GTGGTGGATG 
GGCGGCACCG 
GGTGAGACGG 
AGCACCTTCT 



11 
I 

AGCTGACGAC 
GCATCCTGGC 
CCTTCGGCCT 
TCCTGGAGCA 
GGGGCTCGGT 
GCCTGCGCTC 
GCAACCGCCA 
AGCAGGCCGG 
AGGCCAGCCT 
CGTGCATCAT 



21 
I 

AGCCCTGCGT 
AGCCTATGTG 
GTACGGCGCC 
CCGGCGCATG 
GGCACTGTGC 
GGCCCAGCGC 
GGAGGACGCC 
CTTCTTTGTG 
GCAGGAGGGC 
GCAGAAGTGG 



31 
I 

GTGGTGGGCA 
ACGGGCTACC 
ATCCTGGGCC 
CGACGTGCCG 
ATTGCCGCGT 
ATCTCCTTCC 
TACATGCTGG 
TGGCGCAGCA 
ATGGACCGTG 
GGAGGCAAGC 



41 
I 

DDGCSPNKMK 
AFIQLQQCAQ 
CYNASDHVYK 
CNSDLRNKTY 
RQGVEHEASR 
AAGVLL 



41 

I 

CCAGCCTGTT 
AGTTCATCCA 
TGCACCTGCT 
GCCAGGCCCT 
ACCAGGAGGA 
CTGACCTCAA 
ACATCTTCCA 
ACTTCCATGA 
TGCGGGATGT 
GCGAGGTCAT 



420 
4B0 
540 
600 
660 
720 
780 
840 
900 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 



51 
I 

TVKCAPGVDV 
DRCNAKLNLT 
GCFDGNVTLT 
FSPRIPPLVR 
DEEPRLTGGA 



51 
I 

TGCCCTGGCA 
CACGGAAAAG 
CATTCAGAGC 
GAAGCTGCCC 
CCCTGACTAC 
GGTGGTCATG 
CGAGGTGCTG 
GGCAGGCGAG 
GGTGCGGGCC 
GTACACGGCC 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
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TTCAAGGCCC TCGGCGATTC GGTGGACTAC ATCCAGGTGT GCGACTCTGA CACTGTGCTO 660 

GATCCAGCCT GCACCATCGA GATGCTTCGA GTCCTGGAGG AGGATCCCCA AGTAGGGGGA 720 

GTCGGGGGAG ATGTCCAGAT CCTCAACAAG TACGACTCAT GGATTTCCTT. CCTGAGCAGC 780 

GTGCGGTACT GGATGGCCTT CAACGTGGAG CGGGCCTGCC AGTCCTACTT TGGCTGTGTG 840 

CAGTGTATTA GTGGGCCCTT GGGCATGTAC CGCAACAGCC TCCTCCAGCA GTTCCTGGAG 900 

GACTGGTACC AT.CAGAAGTT CCTAGGCAGC AAGTGCAGCT TCGGGGATGA CCGGCACCTC 960 

ACCAACCGAG TCCTGAGCCT TGGCTACCGA ACTAAGTATA CCGCGCGCTC CAAGTGCCTC 1020 

ACAGAGACCC CCACTAAGTA CCTCCGGTGG CTCAACCAGC AAACCCGCTG GAGCAAGTCT 1080 

TACTTCCGGG AGTGGCTCTA CAACT CTCTG TGGTTCCATA AGCACCACCT CTGGATGACC 1140 

TACGAGTCAG TGGTCACGGG T.TTCTTCCCC TTCTTCCTCA TTGCCACGGT TATACAGCTT 1200 

TTCTACCGGG GCCGCATCTG GAACATTCTC CTCTTCCTGC TGACGGTGCA GCTGGTGGGC 1260 

ATTATCAAGG CCACCTACGC CTGCTTCCTT CGGGGCAATG CAGAGATGAT CTTCATGTCC 1320 

CTCTACTCCC TCCTCTATAT GTCCAGCCTT CTGCCGGCCA AGATCTTTGC CATTGCTACC 1380 

ATCAACAAAT CTGGCTGGGG CACCTCTGGC CGAAAAACCA TTGTGGTGAA CTTCATTGGC 1440 

CTCATTCCTG TGTCCATCTG GGTGGCAGTT CTCCTGGGAG GGCTGGCCTA CACAGCTTAT 1500 

TGCCAGGACC TGTTCAGTGA GACAGAGCTA GCCTTCCTTG TCTCTGGGGC TATACTGTAT 1560 

GGCTGCTACT GGGTGGCCCT CCTCATGCTA TATCTGGCCA TCATCGCCCG GCGATGTGGG 1620 
AAGAAGCCGG AGCAGTACAG CTTGGCTTTT GCTGAGGTGT OA 

Seq ID NO: 571 Protein sequence 
Protein Accession #: NP__005320.1 

1 11 21 31 41 51 

I I I 1 I I 

MPVQLTTALR WGTSLPALA VLGGILAAYV TGYQPIHTEK HYLSFGLYGA ILGLHLLIQS 60 

LFAFLEHRRM RRAGQALKDP SPRRGSVALC IAAYQEDPDY LRKCLRSAQR ISPPDIiKWM 120 

WDGNRQEDA YMLDIFHEVL GGTEQAGFFV WRSNFHEAGE GETEASLQEG MDRVRDWRA 180 

STFSCIMQKW GGKREVMYTA FKAIX3DSVDY IQVCDSDTVL DPACTIEMLR VLEEDPQVGG 240 

VGGDVQILNK YDSWISPLSS VRYWMAFNVE RACQSYFGCV QCISGPLGMY RNSIiLQQFIiE 300 

DWYHQKFLGS KCSFGDDRHL TNRVLSLGYR TKYTARSKCL TETPTKYLRW LNQQTRWSKS 360 

YFREWLYNSL WFHKHHIiWMT YESWTGPPP FFIjIATVIQIj PYRGRIWNIL LFLLTVQLVG 420 

1 1 KATYACFL RGNAEMIPMS LYSLLYMSS1* LPAKIPAIAT IKKSGWGTSG RKTIWNFIG 480 

LIPVSIWVAV LLGGLAYTAY CQDLFSETEL AFLVSGAILY GCYWVALLML YLAIIARRCG 540 
KKPEQYSLAF AEV 

Seq ID NO: 572 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 148-7095 

l 11 21 31 41 51 

I I I I I I 

CACACATACG CACGCACGAT CTCACTTCGA TCTATACACT GGAGGATTAA AACAAACAAA 60 

CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGGAGCCGCA 120 

CGGCGAGGGG CCGCAGACCG TCTGGAAATG CGAATCCTAA AGCGTTTCCT CGCTTGCATT 180 

CAGCTCCTCT GTGTTTGCCG CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA 240 

CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAAAA TTGGGGAAAG 300 

AAATATCCAA CATGTAATAG CCCAAAACAA TCTCCTATCA ATATTGATGA AGATCTTACA 360 

CAAGTAAATG TGAATCTTAA GAAACTTAAA TTTCAGGGTT GGGATAAAAC ATCATTGGAA 420 

AACACATTCA TTCATAACAC TGGGAAAACA GTGGAAATTA ATCTCACTAA TGACTACCGT 480 

GTCAGCGGAG GAGTTTCAGA AATGGTGTTT AAAGCAAGCA AGATAACTTT TCACTGGGGA 540 

AAATGCAATA TGTCATCTGA • TGGATCAGAG CATAGTTTAG AAGGACAAAA ATTTCCACTT 600 

GAGATGCAAA TCTACTGCTT TGATGCGGAC CGATTTTCAA GTTTTGAGGA AGCAGTCAAA 660 

GGAAAAGGGA AGTTAAGAGC TTTATCCATT TTGTTTGAGG TTGGGACAGA AGAAAATTTG 720 

GATTTCAAAG CGATTATTGA TGGAGTCGAA AGTGTTAGTC GTTTTGGGAA GCAGGCTGCT 780 

TTAGATCCAT TCATACTGTT GAACCTTCTG CCAAACTCAA CTGACAAGTA TTACATTTAC 840 

AATGGCTCAT TGACATCTCC TCCCTGCACA GACACAGTTG ACTGGATTGT TTTTAAAGAT 900 

ACAGTTAGCA TCTCTGAAAG CCAGTTGGCT GTTTTTTGTG AAGTTCTTAC AATGCAACAA 960 

TCTGGTTATG TCATGCTGAT GGACTACTTA CAAAACAATT TTCGAGAGCA ACAGTACAAG 1020 

TTCTCTAGAC AGGTGTTTTC CTCATACACT GGAAAGGAAG AGATTCATGA AGCAGTTTGT 1080 

AGTTCAGAAC CAGAAAATGT TCAGGCTGAC CCAGAGAATT ATACCAGCCT TCTTGTTACA 1140 

TGGGAAAGAC CTCGAGTCGT TTATGATACC ATGATTGAGA AGTTTGCAGT TTTGTACCAG 1200 

CAGTTGGATG GAGAGGACCA AACCAAGCAT GAATTTTTGA CAGATGGCTA TCAAGACTTG 1260 

GGTGCTATTC TCAATAATTT GCTACCCAAT ATGAGTTATG TTCTTCAGAT AGTAGCCATA 1320 

TGCACTAATG GCTTATATGG AAAATACAGC GACCAACTGA TTGTCGACAT GCCTACTGAT 1380 

AATCCTGAAC TTGATCTTTT CCCTGAATTA ATTGGAACTG AAGAAATAAT CAAGGAGGAG 1440 

GAAGAGGGAA AAGACATTGA AGAAGGCGCT ATTGTGAATC CTGGTAGAGA CAGTGCTACA 1500 

AACCAAATCA GGAAAAAGGA ACCCCAGATT TCTACCACAA CACACTACAA TCGCATAGGG 1560 

ACGAAATACA ATGAAGCCAA GACTAACCGA TCCCCAACAA GAGGAAGTGA ATTCTCTGGA 1620 

AAGGGTGATG TTCCCAATAC ATCTTTAAAT TCCACTTCCC AACCAGTCAC TAAATTAGCC 1680 

ACAGAAAAAG ATATTTCCTT GACTTCTCAG ACTGTGACTG AACTGCCACC TCACACTGTG 1740 

GAAGGTACTT CAGCCTCTTT AAATGATGGC TCTAAAACTG TTCTTAGATC TCCACATATG 1800 

AACTTGTCGG GGACTGCAGA ATCCTTAAAT ACAGTTTCTA TAACAGAATA TGAGGAGGAG 1860 

AGTTTATTGA CCAGTTTCAA GCTTGATACT GGAGCTGAAG ATTCTTCAGG CTCCAGTCCC 1920 

GCAACTTCTG CTATCCCATT CATCTCTGAG AACATATCCC AAGGGTATAT ATTTTCCTCC 1980 

GAAAACCCAG AGACAATAAC ATATGATGTC CTTATACCAG AATCTGCTAG AAATGCTTCC 2040 

GAAGATTCAA CTTCATCAGG TTCAGAAGAA TCACTAAAGG ATCCTTCTAT GGAGGGAAAT 2100 

GTGTGGTTTC CTAGCTCTAC AGACATAACA GCACAGCCCG ATGTTGGATC AGGCAGAGAG 2160 

AGCTTTCTCC AGACTAATTA CACTGAGATA CGTGTTGATG AATCTGAGAA GACAACCAAG 2220 

TCCTTTTCTG CAGGCCCAGT GATGTCACAG GGTCCCTCAG TTACAGATCT GGAAATGCCA 2280 

CATTATTCTA CCTTTGCCTA CTTCCCAACT GAGGTAACAC CTCATGCTTT TACCCCATCC 2340 

TCCAGACAAC AGGATTTGGT CTCCACGGTC AACGTGGTAT ACTCGCAGAC AACCCAACCG 2400 

GTATACAATG GTGAGACACC TCTTCAACCT TCCTACAGTA GTGAAGTCTT TCCTCTAGTC 2460 

ACCCCTTTGT TGCTTGACAA TCAGATCCTC AACACTACCC CTGCTGCTTC AAGTAGTGAT 2520 

TCGGCCTTGC ATGCTACGCC TGTATTTCCC AGTGTCGATG TGTCATTTGA ATCCATCCTG 2580 

TCTTCCTATG ATGGTGCACC TTTGCTTCCA TTTTCCTCTG CTTCCTTCAG TAGTGAATTG 2640 

TTTCGCCATC TGCATACAGT TTCTCAAATC CTTCCACAAG TTACTTCAGC TACCGAGAGT 2700 

GATAAGGTGC CCTTGCATGC TTCTCTGCCA GTGGCTGGGG GTGATTTGCT ATTAGAGCCC 2760 
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AGCCTTGCTC AGTATTCTGA TQTGCTGTCC ACTACTCATG CTQCTTCA6A QACQCTGGAA 2820 

TTTGGTAGTO AATCTGGTGT TCTTTATAAA ACGCTTATGT TTTCTCAAGT TGAACCACCC 2880 

AGCAGTGATG CCATGATGCA TGCACGTTCT TCAGGGCCTG AACCTTCTTA TGCCTTGTCT 2 940 

GATAATGAGG GCTCCCAACA CATCTTCACT GTTTCTTACA GTTCTGCAAT ACCTGTGCAT 3000 

GATTCTGTGG GTGTAACTTA TCAGGGTTCC TTATTTAGCG GCCCTAGCCA TATACCAATA 3060 

CCTAAGTCTT CGTTAATAAC CCCAACTGCA TCATTACTGC AGCCTACTCA TGCCCTCTCT 3120 

GGTGATGGGG AATGGTCTGG AGCCTCTTCT GATAGTGAAT TTCTTTTACC TGACACAGAT 3180 

GGGCTGACAG CCCTTAACAT TTCTTCACCT GTTTCTGTAG CTGAATTTAC ATATACAACA 3240 

TCTGTGTTTG GTGATGATAA TAAGGCGCTT TCTAAAAGTG AAATAATATA TGGAAATGAG 3300 

ACTGAACTGC AAATTCCTTC TTTCAATGAG ATGGTTTACC CTTCTGAAAG CACAGTCATG 3360 

CCCAACATGT ATGATAATGT AAATAAGTTG AATGCGTCTT TACAAGAAAC CTCTGTTTCC 3420 

ATTTCTAGCA CCAAGGGCAT GTTTCCAGGG TCCCTTGCTC ATACCACCAC TAAGGTTTTT 3480 

GATCATGAGA TTAGTCAAGT TCCAGAAAAT AACTTTTCAG TTCAACCTAC ACATACTGTC 3S40 

TCTCAAGCAT CTGGTGACAC TTCGCTTAAA CCTGTGCTTA GTGCAAACTC AGAGCCAGCA 3 600 

TCCTCTGACC CTGCTTCTAG TGAAATGTTA TCTCCTTCAA CTCAGCTCTT ATTTTATGAG 3 660 

ACCTCAGCTT CTTTTAGTAC TGAAGTATTG CTACAACCTT CCTTTCAGGC TTCTGATGTT 3720 

GACACCTTGC TTAAAACTGT TCTTCCAGCT GTGCCCAGTG ATCCAATATT GGTTGAAACC 3780 

CCCAAAGTTG ATAAAATTAG TTCTACAATG TTGCATCTCA TTGTATCAAA TTCTGCTTCA 3840 

AGTGAAAACA TGCTGCACTC TACATCTGTA CCAGTTTTTG ATGTGTCGCC TACTTCTCAT 3 900 

ATGCACTCTG CTTCACTTCA AGGTTTGACC ATTTCCTATG CAAGTGAGAA ATATGAACCA 3 960 

GTTTTGTTAA AAAGTGAAAG TTCCCACCAA GTGGTACCTT CTTTGTACAG TAATGATGAG 4020 

TTGTTCCAAA CGGCCAATTT GGAGATTAAC CAGGCCCATC CCCCAAAAGG AAGGCATGTA 4080 

TTTGCTACAC CTGTTTTATC AATTGATGAA CCATTAAATA CACTAATAAA TAAGCTTATA 4140 

CATTCCGATG AAATTTTAAC CTCCACCAAA AGTTCTGTTA CTGGTAAGGT ATTTGCTGGT 4200 

ATTCCAACAG TTGCTTCTGA TACATTTGTA TCTACTGATC ATTCTGTTCC TATAGGAAAT 4260 

GGGCATGTTG CCATTACAGC TGTTTCTCCC CACAGAGATG GTTCTGTAAC CTCAACAAAG 4320 

TTGCTGTTTC CTTCTAAGGC AACTTCTGAG CTGAGTCATA GTGCCAAATC TGATGCCGGT 4380 

TTAGTGGGTG GTGGTGAAGA TGGTGACACT GATGATGATG GTGATGATGA TGATGATGAC 4440 

AGAGGTAGTG ATGGCTTATC CATTCATAAG TGTATGTCAT GCTCATCCTA TAGAGAATCA 4500 

CAGGAAAAGG TAATGAATGA TTCAGACACC CACGAAAACA GTCTTATGGA TCAGAATAAT 4560 

CCAATCTCAT ACT CACT ATC TGAGAATTCT GAAGAAGATA ATAGAGTCAC AAGTGTATCC 4620 

TCAGACAGTC AAACTGGTAT GGACAGAAGT CCTGGTAAAT CACCATCAGC AAATGGGCTA 4680 

TCCCAAAAGC ACAATGATGG AAAAGAGGAA AATGACATTC AGACTGGTAG TGCTCTGCTT 4740 

CCTCTCAGCC CTGAATCTAA AGCATGGGCA GTTCTGACAA GTGATGAAGA AAGTGGATCA 4800 

GGGCAAGGTA CCTCAGATAG CCTTAATGAG AATGAGACTT CCACAGATTT CAGTTTTGCA 4860 

GACACTAATG AAAAAGATGC TGATGGGATC CTGGCAGCAG GTGACTCAGA AATAACTCCT 4920 

GGATTCCCAC AGTCCCCAAC ATCATCTGTT ACTAGCGAGA ACTCAGAAGT GTTCCACGTT 4980 

TCAGAGGCAG AGGCCAGTAA TAGTAGCCAT GAGTCTCGTA TTGGTCTAGC TGAGGGGTTG 5040 

GAATCCGAGA AGAAGGCAGT TATACCCCTT GTGATCGTGT CAGCCCTGAC TTTTATCTGT 5100 

CTAGTGGTTC TTGTGGGTAT TCTCATCTAC TGGAGGAAAT GCTTCCAGAC TGCACACTTT 5160 

TACTTAGAGG ACAGTACATC CCCTAGAGTT ATATCCACAC CTCCAACACC TATCTTTCCA 5220 

ATTTCAGATG ATGTCGGAGC AATTCCAATA AAGCACTTTC CAAAGCATGT TGCAGATTTA 5280 

CATGCAAGTA GTGGGTTTAC TGAAGAATTT GAGACACTGA AAGAGTTTTA CCAGGAAGTG 5340 

CAGAGCTGTA CTGTTGACTT AGGTATTACA GCAGACAGCT CCAACCACCC AGACAACAAG 5400 

CACAAGAATC GATACATAAA TATCGTTGCC TATGATCATA GCAGGGTTAA GCTAGCACAG 5460 

CTTGCTGAAA AGGATGGCAA ACTGACTGAT TATATCAATG CCAATTATGT TGATGGCTAC 5520 

AACAGACCAA AAGCTTATAT TGCTGCCCAA GGCCCACTGA AATCCACAGG TGAAGATTTC 5580 

TGGAGAATGA TATGGGAACA TAATGTGGAA GTTATTGTCA TGATAACAAA CCTCGTGGAG 5640 

AAAGGAAGGA GAAAATGTGA TCAGTACTGG CCTGCCGATG GGAGTGAGGA GTACGGGAAC S700 

TTTCTGGTCA CTCAGAAGAG TGTGCAAGTG CTTGCCTATT ATACTGTGAG GAATTTTACT 5760 

CTAAGAAACA CAAAAATAAA AAAGGGCTCC CAGAAAGGAA GACCCAGTGG ACGTGTGGTC 5820 

ACACAGTATC ACTACACGCA GTGGCCTGAC ATGGGAGTAC CAGAGTACTC CCTGCCAGTG 5880 

CTGACCTTTG TGAGAAAGGC AGCCTATGCC AAGCGCCATG CAGTGGGGCC TGTTGTCGTC 5940 

CACTGCAGTG CTGGAGTTGG AAGAACAGGC A CAT AT AT TG TGCTAGACAG TATGTTGCAG 6000 

CAGATTCAAC ACGAAGGAAC TGTCAACATA TTTGGCTTCT TAAAACACAT CCGTTCACAA 6060 

AGAAATTATT TGGTACAAAC TGAGGAGCAA TATGTCTTCA TTCATGATAC ACTGGTTGAG 6120 

GCCATACTTA GTAAAGAAAC TGAGGTGCTG GACAGTCATA TTCATGCCTA TGTTAATGCA 6180 

CTCCTCATTC CTGGACCAGC AGGCAAAACA AAGCTAGAGA AACAATTCCA GCTCCTGAGC 6240 

CAGTCAAATA TACAGCAGAG TGACTATTCT GCAGCCCTAA AGCAATGCAA CAGGGAAAAG 6300 

AATCGAACTT CTTCTATCAT CCCTGTGGAA AGATCAAGGG TTGGCATTTC ATCCCTGAGT 6360 

GGAGAAGGCA CAGACTACAT CAATGCCTCC TATATCATGG GCTATTACCA GAGCAATGAA 6420 

TTCATCATTA CCCAGCACCC TCTCCTTCAT ACCATCAAGG ATTTCTGGAG GATGATATGG 6480 

GACCATAATG CCCAACTGGT GGTTATGATT CCTGATGGCC AAAACATGGC AGAAGATGAA 6540 

TTTGTTTACT GGCCAAATAA AGATGAGCCT ATAAATTGTG AGAGCTTTAA GGTCACTCTT 6600 

ATGGCTGAAG AACACAAATG TCTATCTAAT GAGGAAAAAC TTATAATTCA GGACTTTATC 6660 

TTAGAAGCTA CACAGGATGA TTATGTACTT GAAGTGAGGC ACTTTCAGTG TCCTAAATGG 6720 

CCAAATCCAG ATAGCCCCAT TAGTAAAACT TTTGAACTTA TAAGTGTTAT AAAAGAAGAA 6780 

GCTGCCAATA GGGATGGGCC TATGATTGTT CATGATGAGC ATGGAGGAGT GACGGCAGGA 6840 

ACTTTCTGTG CTCTGACAAC CCTTATGCAC CAACTAGAAA AAGAAAATTC CGTGGATGTT 6900 

TACCAGGTAG CCAAGATGAT CAATCTGATG AGGCCAGGAG TCTTTGCTGA CATTGAGCAG 6960 

TATCAGTTTC TCTACAAAGT GATCCTCAGC CTTGTGAGCA CAAGGCAGGA AGAGAATCCA 7020 

TCCACCTCTC TGGACAGTAA TGGTGCAGCA TTGCCTGATG GAAATATAGC TGAGAGCTTA 7080 

GAGTCTTTAG TTTAACACAG AAAGGGGTGG GGGGACTCAC ATCTGAGCAT TGTTTT CCTC 7140 

TTCCTAAAAT TAGGCAGGAA AATCAGTCTA GTTCTGTTAT CTGTTGATTT CCCATCACCT 7200 

GACAGTAACT TTCATGACAT AGGATTCTGC CGCCAAATTT ATATCATTAA CAATGTGTGC 7260 

CTTTTTGCAA GACTTGTAAT TTACTTATTA TGTTTGAACT AAAATGATTG AATTTTACAG 7320 

TATTTCTAAG AATGGAATTG TGGTATTTTT TTCTGTATTG ATTTTAACAG AAAATTTCAA 7380 

TTTATAGAGG TTAGGAATTC CAAACTACAG AAAATGTTTG TTTTTAGTGT CAAATTTTTA 7440 

GCTGTATTTG TAG CAAT TAT CAGGTTTGCT AGAAATATAA CTTTTAATAC AGTAGCCTGT 7500 

AAATAAAACA CTCTTCCATA TGATATTCAA CATTTTACAA CTGCAGTATT CACCTAAAGT 7560 

AGAAATAATC TGTTACTTAT TGTAAATACT GCCCTAG TGT CTCCATGGAC CAAATTTATA 7620 

TTTATAATTG TAGATTTTTA TATTTTACTA CTGAGTCAAG TTTTCTAGTT CTGTGTAATT 7680 

GTTTAGTTTA ATGACGTAGT TCATTAGCTG GTCTTACTCT ACCAGTTTTC TGACATTGTA 7740 

TTGTGTTACC TAAGTCATTA ACTTTGTTTC AGCATGTAAT TTTAACTTTT GTGGAAAATA 7600 

GAAATACCTT CATTTTGAAA GAAGTTTTTA TGAGAATAAC ACCTTACCAA ACATTGTTCA 7860 

AATGGTTTTT ATCCAAGGAA TTGCAAAAAT AAATATAAAT ATTGCCATTA AAAAAAAAAA 7920 
AAAAAAAAAA AAAAAAAAAA AAAA 
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Seq ID NOt 573 Protein' sequence : 
Protein Accession #: Eos sequence 

1 11 21 31 41 51 

I I I I I I 

MRILKRFLAC IQLLCVCRLD WANGYYRQQR KLVEEIGWSY TGALNQKNWG KKYPTCNSPK 60 

QSPINIDEDL TQVNVNLiKKL KFQGWDKTSL ENTF I HNTGK TVEINIiTNDY RVSGGVSEMV 120 

FKASKITPHW GKCNMSSDGS EHSLEGQKFP LEMQIYCFDA DRPSSPEBAV KGKGKLRAL 5 180 

I LFEVGTEEN LDFKAIIDGV ESVSRFGKQA ALDPFILLNL LPNSTDKYYI YNGSLTSPPC 240 

TDTVDWIVFK DTVSISESQL AVFCEVLTMQ QSGYVMLMDY LQNNFREQQY KFSRQVFSSY 300 

TGKEEIKEAV CS9EPENVQA DPENYTSLLV TWERPRWYD TMIEKFAVLY QQLDGEDQTK 360 

HEFLTDGYQD LGAILNNLLP NMSYVLQIVA ICTNGLYGKY SDQLIVDMPT DNPELDLFPE 420 

LIGTEEIIKE EEEGKDIEEG AIVNPGRDSA TNQIRKKEPQ ISTTTHYNRI GTKYNEAKTN 480 

RSPTRGSEFS GKGDVPNTSL NSTSQPVTKL ATEKDISLTS QTVTELPPHT VEGTSASLND 540 

GSKTVLRSPH MNLSGTAESL NTVSITEYEE ESLLTSFKLD TGAEDSSGSS PAT8AIPFIS 600 

ENISQGYIFS SENPETITYD V L I PES ARNA SEDSTSSGSE ESLKDPSMEG NVWFPSSTDI 660 

TAQPDVGSGR ESPLQTNYTE IRVDESEKTT KSFSAGPVMS QGPSVTDLEM PHYSTFAYFP 720 

TEVTPHAFTP SSRQQDIiVST VNWYSQTTQ PVYNGETPLQ PSYSSEVFPL VTPLI*LDNQI 780 

LNTTPAASSS DSALHATPVF PSVDVSFESI LSSYDGAPLL PFSSASFSSE LFRHLHTVSQ 840 

ILPQVTSATE SDKVPLHASL PVAGGDLLLE PSLAQYSDVL 9TTHAASETL EFGSESGVLY 900 

KTLMFSQVEP PSSDAMMHAR SSGPEPSYAL SDNEGSQHTF TVSYSSAIPV HDSVGVTYQG 960 

SLFSGPSHIP IPKSSLITPT ASLLQPTHAL SGDGEWSGAS SDSEFLLPDT DGLTALNISS 1020 

PVSVAEFTYT TSVFGDDNKA LSKSEIIYGN ETELQIPSFN EMVYPSESTV MPNMYDNVNK 1080 

LNASLQETSV SISSTKGMFP GSLAHTTTKV FDHEISQVPE NNFSVQPTHT VSQASGDTSL 1140 

KPVLSANSEP AS SD PASS EM LSPSTQLLFY ETSASFSTEV LLQPSFQASD VDTLLKTVLP 1200 

AVPSDPILVE TPKVDKISST MLHliIVSNSA SSENMLHSTS VPVFDVSPTS HMHSASIiQGL 1260 

TISYASEKYE PVLLKSESSH QWPSLYSND ELFQTANLEI NQAHPPKGRH VFATPVLSID 1320 

EPLNTLINKL IHSDEILTST KSSVTGKVFA GIPTVASDTF VSTDHSVPIG NGHVAITAVS 1380 

PHRDGSVTST KLLFPSKATS ELSHSAKSDA GLVGGGEDGD TDDDGDDDDD DRGSDGLSIH 1440 

KCMSCSSYRE SQEKVMNDSD THENSLMDQN NPISYSLSEN SEEDNRVTSV S SDSQTGMDR 1500 

SPGKSPSANG LSQKHNDGKE ENDIQTGSAL LPLSPESKAW AVLTSDEESG SGQGTSDSLN 1560 

ENETSTDFSF ADTNEKDADG ILAAGDSEIT PGFPQSPTSS VTSENSEVFH VSEAEASNSS 1620 

HESRIGXiAEG LESEKKAVI P LVIVSALTFI CLWLVGILI YWRKCFQTAH FYLEDSTSPR 1680 

VISTPPTPIF PISDDVGAIP I KHFPKHVAD LHASSGFTEE FETLKEFYQE VQSCTVDLGI 1740 

TADSSNHPDN KHKNRYINIV AYDHSRVKIA QLAEKDGKLT DYINANYVDG YNRPKAYIAA 1800 

QGPLKSTAED FWRMIWEHNV EVIVMITNLV EKGRRKCDQY WPADGSEEYG NFIjVTQKSVQ 1860 

VLAYYTVRNF TLRNTKIKKG SQKGRPSGRV VTQYHYTQWP DMGVPEYSLP VLTFVRKAAY 1920 

AKRHAVGPW VHCSAGVGRT GTYIVIiDSML QQIQHEGTVN IFGFLKHIRS QRNYLVQTEE 1980 

QYVFIHDTLV EAILSKETEV LDSHIHAYVN AI*LIPGPAGK TKLEKQFQLL SQSNIQQSDY 2040 

SAALKQOJRE KNRTSSIIPV ERSRVGISSL SGEGTDYINA SYIMGYYQSN EFIITQHPLL 2100 

HTIKDFWRMI WDHNAQLWM IPDGQNMAED EFVYWPNKDE PINCESFKVT LMAEEHKCLS 2160 

NEEKLIIQDF ILEATQDDYV LEVRHFQCPK WPNPDSPISK TFELISVIKE EAANRDGPMI 2220 

VHDEHGGVTA GTFCALTTLM HQLEKENSVD VYQVAKMINL MRPGVFAD I E QYQFLYKVIL 2280 
SLVSTRQEEN PSTSLDSNGA ALPDGNIAES LESLV 

Seq ID NO t 574 DNA sequence 

Nucleic Acid Accession ttt Eos sequence 

Coding sequence: 148-4518 

1 11 21 31 41 51 

I I I I I I 

CACACATACG CACGCACGAT CTCACTTCGA TCTATACACT GGAGGATTAA AACAAACAAA 60 

CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGGAGCCGCA 120 

CGGCGAGGGG CCGCAGACCG TCTGGAAATG CGAATCCTAA AGCGTTTCCT CGCTTGCATT 180 

CAGCTCCTCT GTGTTTGCCG CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA 240 

CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAAAA TTGGGGAAAG 300 

AAATATCCAA CATGTAATAG CCCAAAACAA TCTCCTATCA ATATTGATGA AGATCTTACA 360 

CAAGTAAATG TGAATCTTAA GAAACTTAAA TTTCAGGGTT GGGATAAAAC ATCATTGGAA 420 

AACACATTCA TTCATAACAC TGGGAAAACA GTGGAAATTA ATCTCACTAA TGACTACCGT 480 

GTCAGCGGAG GAGTTTCAGA AATGGTGTTT AAAGCAAGCA AGATAACTTT TCACTGGGGA 540 

AAATGCAATA TGTCATCTGA TGGATCAGAG CATAGTTTAG AAGGACAAAA ATTTCCACTT 600 

GAGATGCAAA TCTACTGCTT TGATGCGGAC CGATTTTCAA GTTTTGAGGA AG CAGTCAAA 660 

GGAAAAGGGA AGTTAAGAGC TTTATCCATT TTGTTTGAGG TTGGGACAGA AGAAAATTTG 720 

GATTTCAAAG CGATTATTGA TGGAGTCGAA AGTGTTAGTC GTTTTGGGAA GCAGGCTGCT 780 

TTAGATCCAT TCATACTGTT GAACCTTCTG CCAAACTCAA CTGACAAGTA TTACATTTAC 840 

AATGGCT CAT TGACATCTCC TCCCTGCACA GACACAGTTG ACTGGATTGT TTTTAAAGAT 900 

ACAGTTAGCA TCTCTGAAAG CCAGTTGGCT GTTTTTTGTG AAGTTCTTAC AATGCAACAA 960 

TCTGGTTATG TCATG CTGAT GGACTACTTA CAAAACAATT TTCGAGAGCA ACAGTACAAG 1020 

TTCTCTAGAC AGGTGTTTTC CTCATACACT GGAAAGGAAG AGATTCATGA AGCAGTTTGT 1080 

AGTTCAGAAC CAGAAAATGT TCAGGCTGAC CCAGAGAATT ATACCAGCCT TCTTGTTACA 1140 

TGGGAAAGAC CTCGAGTCGT TTATGATACC ATGATTGAGA AGTTTGCAGT TTTGTACCAG 1200 

CAGTTGGATG GAGAGGACCA AACCAAGCAT GAATTTTTGA CAGATGGCTA TCAAGACTTG 1260 

GGTGCTATTC TCAATAATTT GCTACCCAAT ATGAGTTATG TTCTTCAGAT AGTAGCCATA 1320 

TGCACTAATG GCTTATATGG AAAATACAGC GACCAACTGA TTGTCGACAT GCCTACTGAT 1380 

AATCCTGAAC TTGATCTTTT CCCTGAATTA ATTGGAACTG AAGAAATAAT CAAGGAGGAG 1440 

GAAGAGGGAA AAGACATTGA AGAAGGCGCT ATTGTGAATC CTGGTAGAGA CAGTGCTACA 1500 

AACCAAATCA GGAAAAAGGA ACCCCAGATT TCTACCACAA CACACTACAA TCGCATAGGG 1560 

ACGAAATACA ATGAAGCCAA GACTAACCGA TCCCCAACAA GAGGAAGTGA ATTCTCTGGA 1620 

AAGGGTGATG TTCCCAATAC ATCTTTAAAT TCCACTTCCC AACCAGTCAC TAAATTAGCC 1680 

ACAGAAAAAG ATATTTCCTT GACTTCTCAG ACTGTGACTG AACTGCCACC TCACACTGTG 1740 

GAAGGTACTT CAGCCTCTTT AAATGATGGC TCTAAAACTG TTCTTAGATC TCCACATATG 1800 

AACTTGTCGG GGACTGCAGA AT CCTT AAAT ACAGTTTCTA TAACAGAATA TGAGGAGGAG 1860 

AGTTTATTGA CCAGTTTCAA GCTTGATACT GGAGCTGAAG ATTCTTCAGG CTCCAGTCCC 1920 

GCAACTTCTG CTATCCCATT CATCTCTGAG AACATATCCC AAGGGTATAT ATTTTCCTCC 1980 

GAAAACCCAG AGACAATAAC ATATGATGTC CTTATACCAG AATCTGCTAG AAATGCTTCC 2040 

GAAGATTCAA CTTCATCAGG TTCAGAAGAA TCACTAAAGG ATCCTTCTAT GGAGGGAAAT 2100 
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GTGTGGTTTC CTAGCTCTAC AGACATAACA GCACAGCCCG ATGTTGGATC AGGCAGAGAG 2160 

AGCTTTCTCC AGACTAATTA CACTGAGATA CGTGTTGATG AATCTGAGAA GACAACCAAG 2220 

TCCTTTTCTG CAGGCCCAGT GATQTCACAG GGTCCCTCAG TTACAGATCT GGAAATGCCA 2280 

CATTATTCTA CCTTTGCCTA CTTCCCAACT GAGGTAACAC CTCATGCTTT TACCCCATCC 2340 

TCCAGACAAC AGGATTTGGT CTCCACGGTC AACGTGGTAT ACTCGCAGAC AACCCAACCG 2400 

GTATACAATG CAGAGGCCAG TAATAGTAGC CATGAGTCTC GTATTGGTCT AGCTGAGGGG 2460 

TTGGAATCCG AGAAGAAGGC AGTTATACCC CTTGTGATCG TGTCAGCCCT GACTTTTATC 2520 

TGTCTAGTGG TTCTTGTGGG TATTCTCATC TACTGGAGGA AATGCTTCCA GACTGCACAC 2 5 BO 

TTTTACTTAG AGGACAGTAC ATCCCCTAGA GTTATATCCA CACCTCCAAC ACCTATCTTT 2640 

CCAATTTCAG ATGATGTCGG AGCAATTCCA ATAAAGCACT TTCCAAAGCA TGTTGCAGAT 27 00 

TTACATGCAA GTAGTGGGTT TACTGAAGAA TTTGAGACAC TGAAAGAGTT TTACCAGGAA 2760 

GTGCAGAGCT GTACTGTTGA CTTAGGTATT ACAGCAGACA GCTCCAACCA CCCAGACAAC 2820 

AAGCACAAGA ATCGATACAT AAATATOGTT GCCTATGATC ATAGCAGGGT TAAGCTAGCA 2880 

CAGCTTGCTG AAAAGGATGG CAAACTGACT GATTATATCA ATGCCAATTA TGTTGATGGC 2940 

TACAACAGAC CAAAAGCTTA TATTGCTGCC CAAGGCCCAC TGAAATCCAC AGCTGAAGAT 3000 

TTCTGGAGAA TGATATGGGA ACATAATGTG GAAGTTATTG TCATGATAAC AAACCTCGTG 3060 

GAGAAAGGAA GGAGAAAATG TGATCAGTAC TGGCCTGCCG ATGGGAGTGA GGAGTACGGG 3120 

AACTTTCTGG TCACTCAGAA GAGTGTGCAA GTGCTTGCCT ATTATACTGT GAGGAATTTT 3180 

ACTCTAAGAA ACACAAAAAT AAAAAAGGGC TCCCAGAAAG GAAGACCCAG TGGACGTGTG 3240 

GTCACACAGT ATCACTACAC GCAGTGGCCT GACATGGGAG TACCAGAGTA CTCCCTGCCA 3300 

GTGCTGACCT TTGTGAGAAA GGCAGCCTAT GCCAAGCGCC ATGCAGTGGG GCCTGTTGTC 3360 

GTCCACTGCA GTGCTGGAGT TGGAAGAACA GGCACATATA TTGTGCTAGA CAGTATGTTG 3420 

CAGCAGATTC AACACGAAGG AACTGTCAAC ATATTTGGCT TCTTAAAACA CATCCGTTCA 3480 

CAAAGAAATT ATTTGGTACA AACTGAGGAG CAATATGTCT TCATTCATGA TACACTGGTT 3540 

GAGGCCATAC TTAGTAAAGA AACTGAGGTG CTGGACAGTC ATATTCATGC CTATGTTAAT 3600 

GCACTCCTCA TTCCTGGACC AGCAGGCAAA ACAAAGCTAG AGAAACAATT CCAGCTCCTG 3660 

AGCCAGTCAA ATATACAGCA GAGTGACTAT TCTGCAGCCC TAAAGCAATG CAACAGGGAA 3720 

AAGAATCGAA CTTCTTCTAT CATCCCTGTG GAAAGATCAA GGGTTGGCAT TTCATCCCTG 3780 

AGTGGAGAAG GCACAGACTA CATCAATGCC TCCTATATCA TGGGCTATTA CCAGAGCAAT 3840 

GAATTCATCA TTACCCAGCA CCCTCTCCTT CATACCATCA AGGATTTCTG GAGGATGATA 3900 

TGGGACCATA ATGCCCAACT GGTGGTTATG ATTCCTGATG GCCAAAACAT GGCAGAAGAT 3960 

GAATTTGTTT ACTGGCCAAA TAAAGATGAG CCTATAAATT GTGAGAGCTT TAAGGTCACT 4020 

CTTATGGCTG AAGAACACAA ATGTCTATCT AATGAGGAAA AACTTATAAT TCAGGACTTT 4080 

ATCTTAGAAG CTACACAGGA TGATTATGTA CTTGAAGTGA GGCACTTTCA GTGTCCTAAA 4140 

TGGCCAAATC CAGATAGCCC CATTAGTAAA ACTTTTGAAC TTATAAGTGT TATAAAAGAA 4200 

GAAGCTGCCA ATAGGGATGG GCCTATGATT GTTCATGATG AGCATGGAGG AGTGACGGCA 4260 

GGAACTTTCT GTGCTCTGAC AACCCTTATG CACCAACTAG AAAAAGAAAA TTCCGTGGAT 4320 

GTTTACCAGG TAGCCAAGAT GATCAATCTG ATGAGGCCAG GAGTCTTTGC TGACATTGAG 4380 

CAGTATCAGT TTCTCTACAA AGTGATCCTC AGCCTTGTGA GCACAAGGCA GGAAGAGAAT 4440 

CCATCCACCT CTCTGGACAG TAATGGTGCA GCATTGCCTG ATGGAAATAT AGCTGAGAGC 4500 

TTAGAGTCTT TAGTTTAACA CAGAAAGGGG TGGGGGGACT CACATCTGAG CATTGTTTTC 4560 

CTCTTCCTAA AATTAGGCAG GAAAATCAGT CTAGTTCTGT TATCTGTTGA TTTCCCATCA 4620 

CCTGACAGTA ACTTTCATGA CATAGGATTC TGCCGCCAAA TTTATATCAT TAACAATGTG 4680 

TGCCTTTTTG CAAGACTTGT AATTTACTTA TTATGTTTGA ACTAAAATGA TTGAATTTTA 4740 

CAGTATTTCT AAGAATGGAA TTGTGGTATT TTTTTCTGTA TTGATTTTAA CAGAAAATTT 4800 

CAATTTATAG AGGTTAGGAA TTCCAAACTA CAGAAAATGT TTGTTTTTAG TGTCAAATTT 4860 

TTAGCTGTAT TTGTAGCAAT TATCAGGTTT GCTAGAAATA TAACTTTTAA TACAGTAGCC 4920 

TGTAAATAAA ACACTCTTCC ATATGATATT CAACATTTTA CAACTGCAGT ATTCACCTAA 4980 

AGTAGAAATA ATCTGTTACT TATTGTAAAT ACTGCCCTAG TGTCTCCATG GACCAAATTT 5040 

ATATTTATAA TTGTAGATTT TTATATTTTA CTACTGAGTC AAGTTTTCTA GTTCTGTGTA 5100 

ATTGTTTAGT TTAATGACGT AGTTCATTAG CTGGTCTTAC TCTACCAGTT TTCTGACATT 5160 

GTATTGTGTT ACCTAAGTCA TTAACTTTGT TTCAGCATGT AATTTTAACT TTTGTGGAAA 5220 

ATAGAAATAC CTTCATTTTG AAAGAAGTTT TTATGAGAAT AACACCTTAC CAAACATTGT 5280 

TCAAATGGTT TTTATCCAAG GAATTGCAAA AATAAATATA AATATTGCCA TTAAAAAAAA 5340 
AAAAAAAAAA AAAAAAAAAA AAAAAAA 



Seq ID NO: 575 Protein sequence: 
Protein Accession #: Eos sequence 

1 11 21 31 41 51 

I I I 1 I I 

MRILKRFLAC IQLLCVCRLD WANGYYRQQR KLVEEIGWSY TGALNQKNWG KKYPTCNSPK 60 

QSPINIDEDL TQVNVNLKKLi KFQGWDKTSL ENTPIHNTGK TVEINLTNDY RVSGGVSEMV 120 

FKASKITFHW GKCNMSSDGS EHSliEGQKF? LEMQIYCFDA DRFSSFEEAV KGKGKLRALS 180 

ILFEVGTEEN LDFKAIIDGV ESVSRPGKQA ALDPFILLNL LPNSTDKYYI YNGSLTSPPC 240 

TDTVDWIVFK DTVSISESQL AVFCEVLTMQ QSGYVMLMDY LQNNFREQQY KF8RQVFSSY 300 

TGKEEIHEAV CSSEPENVQA DPENYTSIiLV TWERPRWYD TMIEKFAVLY QQLDGEDQTK 360 

HEFLTDGYQD LGAILNNLLP NMSYVLQIVA ICTNGLYGKY SDQLIVDMPT DNPELDLFPB 420 

LIGTEEIIKE EEEGKDIEEG AIVNPGRDSA TNQIRKKEPQ I STTTHYNRI GTKYNEAKTN 480 

RSPTRGSEFS GKGDVFNT8L N9TSQPVTKL ATEKDISLTS QTVTELPPHT VEGTSASLND 540 

GSKTVLRSPH MNLSGTAESI* NTVS ITEYEE ESLLTSFKLD TGAEDSSGSS PATSAIPFIS 600 

ENISQGYIFS SEN PET IT YD VLIPESARNA SEDSTSSGSE ESLKDPSMEG NVWFPSSTDI 660 

TAQPDVGSGR ESFLQTNYTE IRVDESEKTT KSFSAGPVMS QGPSVTDLEM PHYSTFAYFP 720 

TEVTPHAFTP SSRQQDLVST VNWYSQTTQ PVYNAEASNS SHESRIGLAE GLESEKKAVI 780 

PLVIVSALTF ICLWLVGIIj IYWRKCFQTA HFYLEDSTSP RVISTPPTPI FPISDDVGAI 840 

PIKHFPKHVA DLHASSGFTE EFETLKEFYQ EVQSCTVDLG ITADSSNHPD NKHKNRYINI 900 

VAYDHSRVKL AQLAEKDGKL TDYINANYVD GYNRPKAYIA AQGPLKSTAE DFWRMIWEHN 960 

VEVIVMITNL VEKGRRKCDQ YWPADGSEEY GNFLVTQKSV QVLAYYTVRN FTLRNTKIKK 1020 

GSQKGRPSGR WTQYHYTQW PDMGVPEYSL PVLTFVRKAA YAKRHAVGPV WHCSAGVGR 1080 

TGTYIVLDSM LQQIQHEGTV NIFGFLKHIR SQRNYLVQTE EQYVFIHDTL VEAILSKETE 1140 

VLDSHIHAYV NALLIPGPAG KTKLEKQFQL LSQSNIQQSD YSAALKQCNR EKNRTSSIIP 1200 

VERSRVGISS LSGEGTDYIN ASYIMGYYQ3 NEFIITQHPL LHT1KDFWRM IWDHNAQLW 1260 

MIPDGQNMAE DEFVYWPNKD EPINCESFKV TLMAEEHKCL SNEEKLIIQD FILEATQDDY 1320 

VLEVRHFQCP KWPNPDSPIS KTFEIilSVIK EEAANRDGPM IVHDEHGGVT AGTFCALTTL 1380 

MHQLEKENSV DVYQVAKMIN LMRPGVFADI EQYQFLYKVI LSLVSTRQEE NPSTSLDSNG 1440 
AALPDGNIAE SLESLV 
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Seq ID NO i 576 DNA sequence 

Nucleic Acid Accession #: EOS sequence 

Coding sequence: 148-4494 

1 11 21 31 41 51 

I I i I I I 

CACACATACQ CACGCACQAT CTCACTTCGA TCTATACACT GGAGGATTAA AACAAACAAA 60 

CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGOAGCCGCA 120 

CGGCGAGGGG CCGCAGACCG TCTGGAAATG CGAATCCTAA AGCGTTTCCT CGCTTGCATT 180 

CAGCTCCTCT GTGTTTGCCG CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA 240 

CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAAAA TTGGGGAAAG 300 

AAATATCCAA CATGTAATAG CCCAAAACAA TCTCCTATCA ATATTGATGA AGATCTTACA 360 

CAAGTAAATG TGAATCTTAA GAAACTTAAA TTTCAGGGTT GGGATAAAAC ATCATTGGAA 420 

AACACATTCA TT CAT AACAC TGGGAAAACA GTGGAAATTA ATCT CACTAA TGACTACCGT 480 

GTCAGCGGAG GAGTTTCAGA AATGGTGTTT AAAGCAAGCA AGATAACTTT TCACTGGGGA 540 

AAATGCAATA TGTCATCTGA TGGATCAGAG CATAGTTTAG AAGGACAAAA ATTTCCACTT 600 

GAGATGCAAA TCTACTGCTT TGATGCAGAC CGATTTTCAA GTTTTGAGGA AGCAGTCAAA 660 

GGAAAAGGGA AGTTAAGAGC TTTATCCATT TTGTTTGAGG TTGGGACAGA AGAAAATTTG 720 

GATTTCAAAG CGATTATTGA TGGAGTCGAA AGTGTTAGTC GTTTTGGGAA GCAGGCTGCT 780 

TTAGATCCAT TCATACTGTT GAACCTTCTG CCAAACTCAA CTGACAAGTA TTACATTTAC 840 

AATGGCTCAT TGACATCTCC TCCCTGCACA GACACAGTTG ACTGGATTGT TTTTAAAGAT 900 

ACAGTTAGCA TCTCTGAAAG CCAGTTGGCT GTTTTTTGTG AAGTTCTTAC AATGCAACAA 960 

TCTGGTTATG TCATGCTGAT GGACTACTTA CAAAACAATT TTCGAGAGCA ACAGTACAAG 1020 

TTCTCTAGAC AGGTGTTTTC CTCATACACT GGAAAGGAAG AGATTCATGA AGCAGTTTGT 1080 

AGTTCAGAAC CAGAAAATGT TCAGGCTGAC CCAGAGAATT ATACCAGCCT TCTTGTTACA 1140 

TGGGAAAGAC CTCGAGTCGT TTATGATACC ATGATTGAGA AGTTTGCAGT TTTGTACCAG 1200 

CAGTTGGATG GAGAGGACCA AACCAAGCAT GAATTTTTGA CAGATGGCTA TCAAGACTTG 1260 

GGTGCTATTC TCAATAATTT GCTACCCAAT ATGAGTTATG TTCTTCAGAT AGTAGCCATA 1320 

TGCACTAATG GCTTATATGG AAAATACAGC GACCAACTGA TTGTCGACAT GCCTACTGAT 1380 

AATCCTGAAC TTGATCTTTT CCCTGAATTA ATTGGAACTG AAGAAATAAT CAAGGAGGAG 1440 

QAAGAGGGAA AAGACATTGA AGAAGGCGCT ATTGTGAATC CTGGTAGAGA CAGTGCTACA 1500 

AACCAAATCA GGAAAAAGGA ACCCCAGATT TCTACCACAA CACACTACAA TCGCATAGGG 1560 

ACGAAATACA ATGAAGCCAA GACTAACCGA TCCCCAACAA GAGGAAGTGA ATTCTCTGGA 1620 

AAGGGTGATG TTCCCAATAC ATCTTTAAAT TCCACTTCCC AACCAGTCAC TAAATTAGCC 1680 

ACAGAAAAAG ATATTTCCTT GACTTCTCAG ACTGTGACTG AACTGCCACC TCACACTGTG 1740 

GAAGGTACTT CAGCCTCTTT AAATGATGGC TCTAAAACTG TTCTTAGATC TCCACATATG 1800 

AACTTGTCGG GGACTGCAGA ATCCTTAAAT ACAGTTTCTA TAACAGAATA TGAGGAGGAG 1860 

AGTTTATTGA CCAGTTTCAA GCTTGATACT GGAGCTGAAG ATTCTTCAGG CTCCAGTCCC 1920 

GCAACTTCTG CTATCCCATT CATCTCTGAG AACATATCCC AAGGGTATAT ATTTTCCTCC 1980 

GAAAACCCAG AGACAATAAC ATATGATGTC CTTATACCAG AATCTGCTAG AAATGCTTCC 2040 

GAAGATTCAA CTTCATCAGG TTCAGAAGAA TCACTAAAGG ATCCTTCTAT GGAGGGAAAT 2100 

GTGTGGTTTC CTAGCTCTAC AGACATAACA GCACAGCCCG ATGTTGGATC AGGCAGAGAG 2160 

AGCTTTCTCC AGACTAATTA CACTGAGATA CGTGTTGATG AATCTGAGAA GACAACCAAG 2220 

TCCTTTTCTG CAGGCCCAGT GATGTCACAG GGTCCCTCAG TTACAGATCT GGAAATGCCA 2280 

CATTATTCTA CCTTTGCCTA CTTCCCAACT GAGGT AACAC CTCATGCTTT TACCCCATCC 2340 

TCCAGACAAC AGGATTTGGT CTCCACGGTC AACGTGGTAT ACTCGCAGAC AACCCAACCG 2400 

GTATACAATG AGG CCAGTAA TAGTAGCCAT GAGTCTCGTA TTGGTCTAGC TGAGGGGTTG 2460 

GAATCCGAGA AGAAGGCAGT TATACCCCTT GTGATCGTGT CAGCCCTGAC TTTTATCTGT 2520 

CTAGTGGTTC TTGTGGGTAT TCTCATCTAC TGGAGGAAAT GCTTCCAGAC TGCACACTTT 2580 

TACTTAGAGG ACAGTACATC CCCTAGAGTT ATATCCACAC CTCCAACACC TATCTTTCCA 2640 

ATTTCAGATG ATGTCGGAGC AATTCCAATA AAGCACTTTC CAAAGCATGT TGCAGATTTA 2700 

CATGCAAGTA GTGGGTTTAC TGAAGAATTT GAGGAAGTGC AGAGCTGTAC TGTTGACTTA 2760 

GGTATTACAG CAGACAGCTC CAACCACCCA GACAACAAGC ACAAGAATCG ATACATAAAT 2820 

ATCGTTGCCT ATGATCATAG CAGGGTTAAG CTAGCACAGC TTGCTGAAAA GGATGGCAAA 2880 

CTGACTGATT ATATCAATGC CAATTATGTT GATGGCTACA ACAGACCAAA AGCTTATATT 2940 

G CTG CCCAAO GCCCACTGAA ATCCACAGCT GAAGATTTCT GGAGAATGAT ATGGGAACAT 3 000 

AATGTGGAAG TTATTGTCAT GATAACAAAC CTCGTGGAGA AAGGAAGGAG AAAATGTGAT 3060 

CAGTACTGGC CTGCCGATGG GAGTGAGGAG TACGGGAACT TTCTGGTCAC TCAGAAGAGT 3120 

GTGCAAGTGC TTGCCTATTA TACTGTGAGG AATTTTACTC TAAGAAACAC AAAAATAAAA 3180 

AAGGGCTCCC AGAAAGGAAG ACCCAGTGGA CGTGTGGTCA CACAGTATCA CTACACGCAG 3240 

TGGCCTGACA TGGGAGTACC AGAGTACTCC CTGCCAGTGC TGACCTTTGT GAGAAAGGCA 3300 

GCCTATGCCA AGCGCCATGC AGTGGGGCCT GTTGTCGTCC ACTGCAGTGC TGGAGTTGGA 3360 

AGAACAGGCA CATATATTGT GCTAGACAGT ATGTTGCAGC AGATTCAACA CGAAGGAACT 3420 

GTCAACATAT TTGGCTTCTT AAAACACATC CGTTCACAAA GAAATTATTT GGTACAAACT 3480 

GAGGAGCAAT ATGTCTTCAT TCATGATACA CTGGTTGAGG CCATACTTAG TAAAGAAACT 3540 

GAGGTGCTGG ACAGTCATAT TCATGCCTAT GTTAATGCAC TCCTCATTCC TGGACCAGCA 3600 

GGCAAAACAA AGCTAGAGAA ACAATTCCAG CTCCTGAGCC AGTCAAATAT ACAGCAGAGT 3660 

GACTATTCTG CAGCCCTAAA GCAATGCAAC AGGGAAAAGA ATCGAACTTC TTCTATCATC 3720 

CCTGTGGAAA GATCAAGGGT TGGCATTTCA TCCCTGAGTG GAGAAGGCAC AGACTACATC 3 760 

AATGCCTCCT ATATCATGGG CTATTACCAG AGCAATGAAT TCATCATTAC CCAGCACCCT 3 840 

CTCCTTCATA CCATCAAGGA TTTCTGGAGG ATGATATGGG ACCATAATGC CCAACTGGTG 3 900 

GTTATGATTC CTGATGGCCA AAACATGGCA GAAGATGAAT TTGTTTACTG GCCAAATAAA 3960 

GATGAGCCTA TAAATTGTGA GAGCTTTAAG GTCACTCTTA TGGCTGAAGA ACACAAATGT 4020 

CTATCTAATG AGGAAAAACT TATAATTCAG GACTTTATCT TAGAAGCTAC ACAGGATGAT 4080 

TATGTACTTG AAGTGAGGCA CTTTCAGTGT CCTAAATGGC CAAATCCAGA TAGCCCCATT 4140 

AGTAAAACTT TTGAACTTAT AAGTGTTATA AAAGAAGAAG CTGCCAATAG GGATGGGCCT 4200 

ATGATTGTTC ATGATGAGCA TGGAGGAGTG ACGGCAGGAA CTTTCTGTGC TCTGACAACC 4260 

CTTATGCACC AACTAGAAAA AQAAAATTCC GTGGATGTTT ACCAGGTAGC CAAGATGATC 432 0 

AATCTGATGA GGCCAGGAGT CTTTGCTGAC ATTGAGCAGT ATCAGTTTCT CTACAAAGTG 4380 

ATCCTCAGCC TTGTGAGCAC AAGGCAGGAA GAGAATCCAT CCACCTCTCT GGACAGTAAT 4440 

GGTGCAGCAT TGCCTGATGG AAATATAGCT GAGAGCTTAG AGTCTTTAGT TTAACACAGA 4500 

AAGGGGTGGG GGGACTCACA TCTGAGCATT GTTTTCCTCT TCCTAAAATT AGGCAGGAAA 4560 

ATCAGTCTAG TTCTGTTATC TGTTGATTTC CCATCACCTG ACAGTAACTT TCATGACATA 4620 

GGATTCTGCC GCCAAATTTA TATCATTAAC AATGTGTGCC TTTTTGCAAG ACTTGTAATT 4680 

TACTTATTAT GTTTGAACTA AAATGATTGA ATTTTACAGT ATTTCTAAGA ATGGAATTGT 4740 

GGTATTTTTT TCTGTATTGA TTTTAACAGA AAATTTCAAT TTATAGAGGT TAGGAATTCC 4800 
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AAACTACAGA AAATGTTTGT TTTTAGTGTC AAATTTTTAG 
AGGTTTGCTA GAAATATAAC TTTTAATACA GTAGCCTGTA 
GATATTCAAC ATTTTACAAC TGCAGTATTC ACCTAAAGTA 
GTAAATACTG CCCTAGTGTC TCCATGGACC AAATTTATAT 
ATTTTACTAC TGAGTCAAGT TTTCTAGTTC TGTGTAATTG 
CATTAGCTGG TCTTACTCTA CCAGTTTTCT GACATTGTAT 
CTTTGTTTCA GCATGTAATT TTAACTTTTG TGGAAAATAG 
AAGTTTTTAT GAGAATAACA CCTTACCAAA CATTGTTCAA 
TGCAAAAATA AATATAAATA TTGCCATTAA AAAAAAAAAA 



Seq ID NO: 577 Protein sequence: 
Protein Accession #: EOS sequence 



PCTAJS02/12476 



i 
I 

MRILKRFLAC 
QSPINIDEDL 
FKASKITPHW 
ILFEVGTEEN 
TDTVDWIVPK 
TGKEEIHEAV 
HEFLTDGYQD 
LIGTEEIIKE 
RSPTRGSEF9 
GSKTVLRSPH 
ENISQGYIFS 
TAQPDVG5GR 
TEVTPHAFTP 
LVIVSALTFI 
I KHFPKHVAD 
KLAQLAEKDG 
NLVEKGRRKC 
GRWTQYHYT 
SMLQQIQHEG 
YVNALLIPGP 
SSLSGEGTDY 
AEDEFVYWPN 
CPKWPNPDSP 
SVDVYQVAKM 
AESLESLV 



11 
I 

IQLLCVCRLD 
TQVNVNLKKL 
GKCNMSSDGS 
LDFKAIIDGV 
DTVSISESQL 
CSSEPENVQA 
LGAILNNIjLP 
EEEGKDIEEG 
GKGDVPNTSL 
MNLSGTAESL 
SENPETITYD 
ESFLQTNYTE 
SSRQQDLVST 
CLWLVGILI 
LHASSGFTEE 
KLTDYINANY 
DQYWPADGSE 
QWPDMGVPEY 
TVNIFGFLKH 
AGKTKLEKQF 
INASYIMGYY 
KDEPINCESF 
ISKTFEIiISV 
INLMRPGVFA 



21 
I 

WANGYYRQQR 
KFQGWDKTSL 
EHSLEGQKFP 
ESVSRFGKQA 
AVFCEVLTMQ 
DPENYTSLLV 
NMSYVLQIVA 
AIVNPGRDSA 
NSTSQPVTKL 
NTVSITEYEE 
VLIPESARNA 
IRVDESEKTT 
VNWYSQTTQ 
YWRKCFQTAH 
FEEVQSCTVD 
VDGYNRPKAY 
EYGNFLVTQK 
SItPVLT FVRK 
IRSQRNYLVQ 
QLLSQSNIQQ 
QSNEFIITQH 
KVTLMAEEHK 
IKEEAANRDG 
DIEQYQFLYK 



31 
I 

KLVEEIGWSY 
ENTFIHNTGK 
LEMQIYCFDA 
ALDPFILiLNL 
QSGYVMLMDY 
TWERPRWYD 
ICTNGLYGKY 
TNQIRKKEPQ 
ATEKD I SLTS 
ESLLTSFKLD 
SEDSTSSGSE 
KSFSAGPVMS 
PVYNEASNSS 
FYLEDSTSPR 
LGITADSSNH 
IAAQGPLKST 
SVQVLAYYTV 
AAYAKRHAVG 
TEEQYVFIHD 
SDYSAALKQC 
PLLiHTIKDFW 
CLSNEEKLII 
PMIVHDEHGG 
VILSLVSTRQ 



CTGTATTTGT 
AATAAAACAC 
GAAATAATCT 
TTATAATTGT 
TTTAGTTTAA 
TGTGTTACCT 
AAATACCTTC 
ATGGTTTTTA 
AAAAAAAAAA 



41 
I 

TGALNQKNWG 
TVEINLTNDY 
DRFSSFEEAV 
LPNSTDKYYI 
LQNNFREQQY 
TMIEKFAVLY 
SDQLIVDMPT 
ISTTTKYNRI 
QTVTELPPHT 

tgaedssgss 
e6lkdpsmeg 
qgpsvtdlem 
hesriglaeg 
vistpptpif 
pdnkhknryi 
aedfwrmiwe 

RNFTLRNTKI 
PWVHCSAGV 
TLVEAILSKE 
NREKNRTSSI 
RMIWDHNAQL 
QDFILEATQD 
VTAGTFCALT 
EENPSTSLDS 



AGCAATTATC 
TCTTCCATAT 
GTTACTTATT 
AGATTTTTAT 
TGACGTAGTT 
AAGTCATTAA 
ATTTTGAAAG 
TCCAAGGAAT 
AAAAAAAAAA 



51 
I 

KKYPTCNSPK 
RVSGGVSEMV 
KGKGKLRALS 
YNG9LTSPPC 
KFSRQVFSSY 
QQLDGEDQTK 
DNPELDLFPE 
GTKYNEAKTN 
VEGTSAS LND 
PATSAIPFIS 
NVWFPSSTDI 
PHYSTFAYFP 
LESEKKAVIP 
PISDDVGAIP 
NIVAYDHSRV 
HNVEVIVMIT 
KKGSQKGRPS 
GRTGTYIVLD 
TEVLDSHIHA 
IPVERSRVGI 
WMIPDGQNM 
DYVLEVRHFQ 
TLMHQLEKEN 
NGAALPDGNI 



4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
640 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 



45 
50 
55 
60 
65 
70 
75 
80 
85 



Seg ID NO: 578 DNA sequence 

Nucleic Acid Accession #: EOS sequence 

Coding sequence * 501 -4514 



1 
I 

CACACATACG 
CAAAAAAAAC 
CGGCGAGGGG 
CAGCTCCTCT 
CTTGTTGAAG 
AATATCCAAC 
AAGTAAATGT 
ACACATTCAT 
TCAGCGGAGG 
AATGCAATAT 
AGATGCAAAT 
GAAAAGGGAA 
ATTTCAAAGC 
TAGATCCATT 
ATGGCTCATT 
CAGTTAGCAT 
CTGGTTATGT 
TCTCTAGACA 
GTTCAGAACC 
GGGAAAGACC 
AGTTGGATGG 
GTGCTATTCT 
GCACTAATGG 
ATCCTGAACT 
AAGAGGGAAA 
ACCAAATCAG 
CGAAATACAA 
AGGGTGATGT 
CAGAAAAAGA 
AAGGTACTTC 
ACTTGTCGGG 
GTTTATTGAC 
CAACTTCTGC 
AAAACCCAGA 
AAGATTCAAC 
TGTGGTTTCC 
GCTTTCTCCA 
CCTTTTCTGC 



11 
I 

CACGCACGAT 
ATTTCCTTCG 
CCGCAGACCG 
GTGTTTGCCG 
AGATTGGCTG 
ATGTAATAGC 
GAATCTTAAG 
TCATAACACT 
AGTTTCAGAA 
GTCATCTGAT 
CTACTGCTTT 
GTTAAGAGCT 
GATTATTGAT 
CATACTGTTG 
GACATCTCCT 
CTCTGAAAGC 
CATGCTGATG 
GGTGTTTTCC 
AGAAAATGTT 
TCGAGTCGTT 
AGAGGACCAA 
CAATAATTTG 
CTTATATGGA 
TGATCTTTTC 
AGACATTGAA 
GAAAAAGGAA 
TGAAGCCAAG 
TCCCAATACA 
TATTTCCTTG 
AGCCTCTTTA 
GACTGCAGAA 
CAGTTTCAAG 
TATCCCATTC 
GACAATAACA 
TTCATCAGGT 
TAGCTCTACA 
GACTAATTAC 
AGGCCCAGTG 



21 

I 

CTCACTTCGA 
CTCCCCCTCC 
TCTGGAAATG 
CCTGGATTGG 
GTCCTATACA 
CCAAAACAAT 
AAACTTAAAT 
GGGAAAACAG 
ATGGTGTTTA 
GGATCAGAGC 
GATGCGGACC 
TTATCCATTT 
GGAGTCGAAA 
AACCTTCTGC 
CCCTGCACAG 
CAGTTGGCTG 
GACTACTTAC 
TCATACACTG 
CAGGCTGACC 
TATGATACCA 
ACCAAGCATG 
CTACCCAATA 
AAATACAGCG 
CCTGAATTAA 
GAAGGCGCTA 
CCCCAGATTT 
ACTAACCGAT 
TCTTTAAATT 
ACTTCTCAGA 
AATGATGGCT 
TCCTTAAATA 
CTTGATACTG 
ATCTCTGAGA 
TATGATGTCC 
TCAGAAGAAT 
GACATAACAG 
ACTGAGATAC 
ATGTCACAGG 



31 
I 

TCTATACACT 
CTCTCCACTC 
CGAATCCTAA 
GCTAATGGAT 
GGAGCACTGA 
CTCCTATCAA 
TTCAGGGTTG 
TGGAAATTAA 
AAGCAAGCAA 
ATAGTTTAGA 
GATTTTCAAG 
TGTTTGAGGT 
GTGTTAGTCG 
CAAACTCAAC 
ACACAGTTGA 

AAAACAATTT 
GAAAGGAAGA 
CAGAGAATTA 
TGATTGAGAA 
AATTTTTGAC 
TGAGTTATGT 
ACCAACTGAT 
TTGGAACTGA 
TTGTGAATCC 
CTACCACAAC 
CCCCAACAAG 
CCACTTCCCA 
CTGTGACTGA 
CTAAAACTGT 
CAGTTT CTAT 
GAGCTGAAGA 
ACATATCCCA 
TTATACCAGA 
CACTAAAGGA 
CACAGCCCGA 
GTGTTGATGA 
GTCCCTCAGT 



41 

f 

GGAGGATTAA 
TGAGAAGCAG 
AGCGTTTCCT 
ACTACAGACA 
ATCAAAAAAT 
TATTGATGAA 
GGATAAAACA 
TCTCACTAAT 
GATAACTTTT 
AGGACAAAAA 
TTTTGAGGAA 
TGGGACAGAA 
TTTTGGGAAG 
TGACAAGTAT 
CTGGATTGTT 
AGTTCTTACA 
TCGAGAGCAA 
GATTCATGAA 
TACCAGCCTT 
GTTTGCAGTT 
AGATGGCTAT 
TCTTCAGATA 
TGTCGACATG 
AGAAATAATC 
TGGTAGAGAC 
ACACTACAAT 
AGGAAGTGAA 
ACCAGTCACT 
ACTGCCACCT 
TCTTAGATCT 
AACAGAATAT 
TTCTTCAGGC 
AGGGTATATA 
ATCTGCTAGA 
TCCTTCTATQ 
TGTTGGATCA 
ATCTGAGAAG 
TACAGATCTG 



51 
I 

AACAAACAAA 
AGGAGCCGCA 
CGCTTGCATT 
ACAGAGAAAA 
TGGGGAAAGA 
GATCTTACAC 
TCATTGGAAA 
GACTACCGTG 
CACTGGGGAA 
TTTCCACTTG 
GCAGTCAAAG 
GAAAATTTGG 
CAGGCTGCTT 
TACATTTACA 
TTTAAAGATA 
ATGCAACAAT 
CAGTACAAGT 
GCAGTTTGTA 
CTTGTTACAT 
TTGTACCAGC 
CAAGACTTGG 
GTAGCCATAT 
CCTACTGATA 
AAGGAGGAGG 
AGTGCTACAA 
CGCATAGGGA 
TTCTCTGGAA 
AAATTAGCCA 
CACACTGTGG 
CCACATATGA 
GAGGAGGAGA 
TCCAGTCCCG 
TTTTCCTCCG 
AATGCTTCCG 
GAGGGAAATG 
GGCAGAGAGA 
ACAACCAAGT 
GAAATGCCAC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1060 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 



407 



WO 02/086443 

ATTATTCTAC CTTTGCCTAC TTCCCAACTO 
CCAGACAACA GGATTTGGTC TCCACGGTCA 
TATACAATGA GGCCAGTAAT AGTAGCCATG 
AATCCGAGAA GAAGGCAGTT ATACCCCTTG 
TAGTGGTTCT TGTGGGTATT CTCATCTACT 
ACTTAGAGGA CAGTACATCC CCTAGAGTTA 
TTTCAGATGA TGTCGGAGCA ATTCCAATAA 
ATGCAAGTAG TGGGTTTACT GAAGAATTTG 
AGAGCTGTAC TGTTGACTTA GGTATTACAG 
ACAAGAATCG ATACATAAAT ATCGTTGCCT 
TTGCTGAAAA GGATGGCAAA CTGACTGATT 
ACAGACCAAA AGCTTATATT GCTGCCCAAG 
GGAGAATGAT ATGGGAACAT AATGTGGAAG 
AAGGAAGGAG AAAATGTGAT CAGTACTGGC 
TTCTGGTCAC TCAGAAGAGT GTGCAAGTGC 
TAAGAAACAC AAAAATAAAA AAGGGCTCCC 
CACAGTATCA CTACACGCAG TGGCCTGACA 
TGACCTTTGT GAGAAAGGCA GCCTATGCCA 
ACTGCAGTGC TGGAGTTGGA AGAACAGGCA 
AGATTCAACA CGAAGGAACT GTCAACATAT 
GAAATTATTT GGTACAAACT GAGGAGCAAT 
CCATACTTAG TAAAGAAACT GAGGTGCTGG 
TCCTCATTCC TGGACCAGCA GGCAAAACAA 
AGTCAAATAT ACAGCAGAGT GACTATTCTG 
ATCGAACTTC TTCTATCATC CCTGTGGAAA 
GAGAAGGCAC AGACTACATC AATGCCTCCT 
• TCATCATT AC CCAGCACCCT CTCCTTCATA 
ACCATAATGC CCAACTGGTG GTTATGATTC 
TTGTTTACTG GCCAAATAAA GATGAGCCTA 
TGGCTGAAGA ACACAAATGT CT AT CTAATG 
TAGAAGCTAC ACAGGATGAT TATGTACTTG 
CAAATCCAGA TAGCCCCATT AGTAAAACTT 
CTGCCAATAG GGATGGGCCT ATGATTGTTC 
CTTTCTGTGC TCTGACAACC CTTATGCACC 
ACCAGGTAGC CAAGATGATC AATCTGATGA 
ATCAGTTTCT CTACAAAGTG ATCCTCAGCC 
CCACCTCTCT GGACAGTAAT GGTGCAGCAT 
AGTCTTTAGT TTAACACAGA AAGGGGTGGG 
TCCTAAAATT AGGCAGGAAA ATCAGTCTAG 
ACAGTAACTT TCATGACATA GGATTCTGCC 
TTTTTGCAAG ACTTGTAATT TACTTATTAT 
ATTTCTAAGA ATGGAATTGT GGT ATT TTTT 
TTATAGAGGT TAGGAATTCC AAACTACAGA 
CTGTATTTGT AGCAATTATC AGGTTTGCTA 
AATAAAACAC TCTTCCATAT GATATTCAAC 
GAAATAATCT GTTACTTATT GTAAATACTG 
TTATAATTGT AGATTTTTAT ATTTTACTAC 
TTTAGTTTAA TGACGTAGTT CATTAGCTGG 
TGTGTTACCT AAGTCATTAA CTTTGTTTCA 
AAATACCTTC ATTTTGAAAG AAGTTTTTAT 
ATGGTTTTTA TCCAAGGAAT TGCAAAAATA 
AAAAAAAAAA AAAAAAAAAA AAA 



AGGTAACACC TCATGCTTTT ACCCCATCCT 2340 

ACGTGGTATA CTCGCAGACA ACCCAACCGG 2400 

AGTCTCGTAT TGGTCTAGCT GAGGGGTTGG 2460 

TGATCGTGTC AGCCCTGACT TTTATCTGTC 2520 

GGAGGAAATG CTTCCAGACT GCACACTTTT 2580 

TATCCACACC TCCAACACCT ATCTTTCCAA 2640 

AGCACTTTCC AAAGCATGTT GCAGATTTAC 2700 

AGACACTGAA AGAGTTTTAC CAGGAAGTGC 2760 

CAGACAGCTC CAACCACCCA GACAACAAGC 2820 

ATGATCATAG CAGGGTTAAO CTAGCACAGC 2880 

ATATCAATGC CAATTATGTT GATGGCTACA 2940 

GCCCACTGAA ATCCACAGCT GAAGATTTCT 3000 

TTATTGTCAT GATAACAAAC CTCGTGGAGA 3060 

CTGCCGATGG GAGTGAGGAG TACGGGAACT 3120 

TTGCCTATTA TACTGTGAGG AATTTTACTC 3180 

AGAAAGGAAG ACCCAGTGGA CGTGTGGTCA 3240 

TGGGAGTACC AGAGTACTCC CTGCCAGTGC 3300 

AGCGCCATGC AGTGGGGCCT GTTGTCGTCC 3360 

CATATATTGT GCTAGACAGT ATGTTGCAGC 3420 

TTGGCTTCTT AAAACACATC CGTTCACAAA 3480 

ATGTCTTCAT TCATGATACA CTGGTTGAGG 3540 

ACAGTCATAT TCATGCCTAT GTTAATGCAC 3600 

AGCTAGAGAA ACAATTCCAG CTCCTGAGCC 3660 

CAGCCCTAAA GCAATGCAAC AGGGAAAAGA 3720 

GATCAAGGGT TGGCATTTCA TCCCTGAGTG 3780 

ATATCATGGG CTATTACCAG AGCAATGAAT 3840 

CCATCAAGGA TTTCTGGAGG ATGATATGGG 3900 

CTGATGGCCA AAACATGGCA GAAGATGAAT 3960 

TAAATTGTGA GAGCTTTAAG GTCACTCTTA 4020 

AGGAAAAACT TATAATTCAG GACTTTATCT 4080 

AAGTGAGGCA CTTTCAGTGT CCTAAATGGC 4140 

TTGAACTTAT AAGTGTTATA AAAGAAGAAG 4200 

ATGATGAGCA TGGAGGAGTG ACGGCAGGAA 4260 

AACTAGAAAA AGAAAATTCC GTGGATGTTT 4320 

GGCCAGGAGT CTTTGCTGAC ATTGAGCAGT 4380 

TTGTGAGCAC AAGGCAGGAA GAGAATCCAT 4440 

TGCCTGATGG AAATATAGCT GAGAGCTTAG 4500 

GGGACTCACA TCTGAGCATT GTTTTCCTCT 4560 

TTCTGTTATC TGTTGATTTC CCATCACCTG 4620 

GCCAAATTTA TATCATTAAC AATGTGTGCC 4680 

GTTTGAACTA AAATGATTGA ATTTTACAGT 4740 

TCTGTATTGA TTTTAACAGA AAATTTCAAT 4800 

AAATGTTTGT TTTTAGTGTC AAATTTTTAG 4860 

GAAATATAAC TTTTAATACA GTAGCCTGTA 4 920 

ATTTTACAAC TGCAGTATTC ACCTAAAGTA 4980 

CCCTAGTGTC TCCATGGACC AAATTTATAT 5040 

TGAGTCAAGT TTT CTAGTTC TGTGTAATTG 5100 

TCTTACTCTA CCAGTTTTCT GACATTGTAT 5160 

GCATGTAATT TTAACTTTTG TGGAAAATAG S220 

GAGAATAACA CCTTACCAAA CATTGTTCAA S280 

AATATAAATA TTGCCATTAA AAAAAAAAAA 5340 



Seq ID NOt 579 Protein sequence: 
Protein Accession #: EOS sequence 



1 11 21 

I I I 

MVPKASKITF HWGKCNMSSD GSEHSLEGQK 
LSILFEVGTE ENLDFKAIID GVESVSRFGK 
PCTDTVDWIV FKDTVSISES QLAVFCEVLT 
SYTGKEEIHE AVCSSEPENV QADPENYTSL 
TKHEFLTDGY QDLGAILNNL LPNMSYVLQI 
PEL I GT EE I I KEEEEGKDIE EGAIVNPGRD 
TNRSPTRGSE PSGKGDVPNT SLNSTSQFVT 
NDGSKTVLRS PHMNLSGTAE SLNTVSITEY 
ISENISQGYI FSSENPETIT YDVLIPESAR 
DZTAQPDVGS GRESFLQTNY TEIRVDESEK 
FPTEVTPHAF TPSSRQQDLV STVNWYSQT 
IPLVIVSALT FICLWLVGI LIYWRKCFQT 
IPIKHFPKHV ADLHASSGFT EEFETLKEFY 
IVAYDHSRVK LAQLAEKDGK LTDYXNANYV 
NVEVIVMITN LVEKGRRKCD QYWPADGSEE 
XGSQKGRPSG RWTQYHYTO WPDMGVPEYS 
RTGTYIVLDS MLQQIQHEGT VNI FGFLKHI 
EVLDSHIHAY VNALLIPGPA GKTKLEKQFQ 
PVERSRVGIS SLSGEGTDYI NASYIMGYYQ 
VMIPDGQNMA EDEFVYWPNK: DEPINCESFK 
YVLEVRHFQC PKWPNPDSPI SKTFELISVI 
LMHQLEKENS VDVYQVAKMI NLMRPGVFAD 
GAALPDGNIA ESLESLV 



31 41 51 

I t I 

FPLEMQIYCF DADRFSSFEE AVKGKGKLRA 60 

QAALDPFILL NLLPNSTDKY YIYNGSLTSP 120 

MQQSGYVMLM DYLQNNFREQ QYKFSRQVFS 180 

LVTWERPRW YDTMIEKFAV LYQQLDGEDQ 240 

VAICTNGLYG KYSDQIiIVDM PTDNPELDLF 300 

SATNQIRKKE PQISTTTHYN RIGTKYNEAK 360 

KLATEKDISL TSQTVTELPP HTVEGTSASL 420 

EEESLLTSFK LOTGAEDSSG SSPATSAIPF 480 

NASEDSTSSG SEESLKDPSM EGNVWFPSST 540 

TTKSFSAGPV MSQGPSVTDL EMPHYSTFAY 600 

TQPVYNEASN SSHESRIGLA EGLESEKKAV 660 

AHFYLEDSTS PRVISTPPTP IFPISDDVGA 720 

QEVQSCTVDL GITADSSNHP DNKHKNRYIN 780 

DGYNRPKAYI AAQGPLKSTA EDFWRMIWEH 840 

YGNFLVTQKS VQVLAYYTVR NFTLRNTKIK 900 

LPVLTFVRKA AYAKRHAVGP VWHCSAGVG 960 

RSQRNYIiVQT .EEQYVFIHDT LVEAILSKET 1020 

LLSQSNIQQS DYSAAIiKQCN REKNRTSSII 1080 

SNEFIITQHP LLHTI KDFWR MIWDHNAQLV 1140 

VTLMAEEHKC LSNEEKLIIQ DFILEATQDD 1200 

KEEAANRDGP MIVHDEKGGV TAGTFCALTT 1260 

IEQYQFLYKV ILSLVSTRQE ENPSTSLDSN 1320 



Seq ID NO: 580 DNA sequence 

Nucleic Acid Accession #: EOS sequence 

Coding sequence: 148-4632 

X ll 21 31 41 51 



408 



WO 02/086443 

I I I I I I 

CACACATACG CACGCACGAT CTCACTTCGA TCTATACACT GGAGGATTAA AACAAACAAA 60 

CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGGAGCCGCA 120 

CGGCGAGGGG CCGCAGACCG TCTGGAAATG CGAATCCTAA AACGTTTCCT CGCTTGCATT 180 

CAGCTCCTCT GTGTTTGCCG CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA 240 

CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAAAA TTGGGGAAAG 300 

AAATATCCAA CATGTAATAG CCCAAAACAA TCTCCTATCA ATATTGATGA AGATCTTACA 360 

CAAGTAAATG TGAATCTTAA GAAACTTAAA TTTCAGGGTT GGGATAAAAC ATCATTGGAA 420 

AACACATTCA TTCATAACAC TGGGAAAACA GTGGAAATTA ATCTCACTAA TGACTACGGT 480 

GTCAGCGGAG GAGTTTCAGA AATGGTGTTT AAAGCAAGCA AGATAACTTT TCACTGGGGA 540 

AAATGCAATA TGTCATCTGA TGGATCAGAG CATAGTTTAG AAGGACAAAA ATTTCCACTT 600 

GAGATGCAAA TCTACTGCTT TGATGCGGAC CGATTTTCAA GTTTTGAGGA AGCAGTCAAA 660 

GGAAAAGGGA AGTTAAGAGC TTT AT CCATT TTGTTTGAGG TTGGGACAGA AGAAAATTTG 720 

GATTTCAAAG CGATTATTGA TGGAGTCGAA AGTGTTAGTC GTTTTGGGAA GCAGGCTGCT 780 

TTAGATCCAT TCATACTGTT GAACCTTCTG CCAAACTCAA CTGACAAGTA TTACATTTAC 840 

AATGGCTCAT TGACATCTCC TCCCTGCACA GACACAGTTG ACTGGATTGT TTTTAAAGAT 900 

ACAGTTAGCA TCTCTGAAAG CCAGTTGGCT GTTTTTTGTG AAGTTCTTAC AATGCAACAA 960 

TCTGGTTATG TCATGCTGAT GGACTACTTA CAAAACAATT TTCGAGAGCA ACAGTACAAG 1020 

TTCTCTAGAC AGGTGTTTTC CTCATACACT GGAAAGGAAG AGATTCATGA AGCAGTTTGT 1080 

AGTTCAGAAC CAGAAAATGT TCAGGCTGAC CCAGAGAATT ATACCAGCCT TCTTGTTACA 1140 

TGGGAAAGAC CTCGAGTCGT TTATGATACC ATGATTGAGA AGTTTGCAGT TTTGTACCAG 1200 

CAGTTGGATG GAGAGGACCA AACCAAGCAT GAATTTTTGA CAGATGGCTA TCAAGACTTG 1260 

GGTGCTATTC TCAATAATTT GCTACCCAAT ATGAGTTATG TTCTTCAGAT AGTAGCCATA 1320 

TGCACTAATG GCTTATATGG AAAATACAGC GACCAACTGA TTGTCGACAT GCCTACTGAT 1380 

AATCCTGAAC TTGATCTTTT CCCTGAATTA ATTGGAACTG AAGAAATAAT CAAGGAGGAG 1440 

GAAGAGGGAA AAGACATTGA AGAAGGCGCT ATTGTGAATC CTGGTAGAGA CAGTGCTACA 1500 

AACCAAATCA GGAAAAAGGA ACCCCAGATT TCTACCACAA CACACTACAA TCGCATAGGG 1560 

ACGAAATACA ATGAAGCCAA GACTAACCGA TCCCCAACAA GAGGAAGTGA ATTCTCTGGA 1620 

AAGGGTGATG TTCCCAATAC ATCTTTAAAT TCCACTTCCC AACCAGTCAC TAAATTAGCC 1680 

ACAGAAAAAG ATATTTCCTT GACTTCTCAG ACTGTGACTG AACTGCCACC TCACACTGTG 1740 

GAAGGTACTT CAGCCTCTTT AAATGATGGC TCTAAAACTG TTCTTAGATC TCCACATATG 1800 

AACTTGTCGG GGACTGCAGA ATCCTTAAAT ACAGTTTCTA TAACAGAATA TGAGGAGGAG 1860 

AGTTTATTGA CCAGTTTCAA GCTTGATACT GGAGCTGAAG ATTCTTCAGG CTCCAGTCCC 1920 

GCAACTTCTG CTATCCCATT CATCTCTGAG AACATATCCC AAGGGTATAT ATTTTCCTCC 1980 

GAAAACCCAG AGACAATAAC ATATGATGTC CTTATACCAG AATCTGCTAG AAATGCTTCC 2040 

GAAGATTCAA CTTCATCAGG TTCAGAAGAA TCACTAAAGG ATCCTTCTAT GGAGGGAAAT 2100 

GTGTGGTTTC CTAGCTCTAC AGACATAACA GCACAGCCCG ATGTTGGATC AGGCAGAGAG 2160 

AGCTTTCTCC AGACTAATTA CACTGAGATA CGTGTTGATG AATCTGAGAA GACAACCAAG 2220 

TCCTTTTCTG CAGGCCCAGT GATGTCACAG GGTCCCTCAG TTACAGATCT GGAAATGCCA 2280 

CATTATTCTA CCTTTGCCTA CTTCCCAACT GAGGTAACAC CTCATGCTTT TACCCCATCC 2340 

TCCAGACAAC AGGATTTGGT CTCCACGGTC AACGTGGTAT ACTCGCAGAC AACCCAACCG 2400 

GTATACAATG AGGCCAGTAA TAGTAGCCAT GAGTCTCGTA TTGGTCTAGC TGAGGGGTTG 2460 

GAATCCGAGA AGAAGGCAGT TATACCCCTT GTGATCGTGT CAGCCCTGAC TTTTATCTGT 2520 

CTAGTGGTTC TTGTGGGTAT TCTCATCTAC TGGAGGAAAT GCTTCCAGAC TGCACACTTT 2580 

TACTTAGAGG ACAGTACATC CCCTAGAGTT ATATCCACAC CTCCAACACC TATCTTTCCA 2640 

ATTTCAGATG ATGTCGGAGC AATTCCAATA AAGCACTTTC CAAAGCATGT TGCAGATTTA 2700 

CATGCAAGTA GTGGGTTTAC TGAAGAATTT GAGACACTGA AAGAGTTTTA CCAGGAAGTG 2760 

CAGAGCTGTA CTGTTGACTT AGGTATTACA GCAGACAGCT CCAACCACCC AGACAACAAG 2820 

CACAAGAATC GATACATAAA TATCGTTGCC TATGATCATA GCAGGGTTAA GCTAGCACAG 2880 

CTTGCTGAAA AGGATGGCAA ACTGACTGAT TATATCAATG CCAATTATGT TGATGGCTAC 2940 

AACAGACCAA AAGCTTATAT TGCTGCCCAA GGCCCACTGA AATCCACAGC TGAAGATTTC 3000 

TGGAGAATGA TATGGGAACA TAATGTGGAA GTTATTGTCA TGATAACAAA CCTCGTGGAG 3060 

AAAGGAAGGA GAAAATGTGA TCAGTACTGG CCTGCCGATG GGAGTGAGGA GTACGGGAAC 3120 

TTTCTGGTCA CTCAGAAGAG TGTGCAAGTG CTTGCCTATT ATACTGTGAG GAATTTTACT 3180 

CTAAGAAACA CAAAAATAAA AAAGGGCTCC CAGAAAGGAA GACCCAGTGG ACGTGTGGTC 3240 

ACACAGT AT C ACTACACGCA GTGGCCTGAC ATGGGAGTAC CAGAGTACTC CCTGCCAGTG 3300 

CTGACCTTTG TGAGAAAGGC AGCCTATGCC AAGCGCCATG CAGTGGGGCC TGTTGTCGTC 3360 

CACTGCAGTG CTGGAGTTGG AAGAACAGGC ACATATATTG TGCTAGACAG TATGTTGCAG 3420 

CAGATTCAAC ACGAAGGAAC TGTCAACATA TTTGGCTTCT TAAAACACAT CCGTTCACAA 3480 

AGAAATTATT TGGTACAAAC TGAGGAGCAA TATGTCTTCA TTCATGATAC ACTGGTTGAG 3540 

GCCATACTTA GTAAAGAAAC TGAGGTGCTG GACAGTCATA TTCATGCCTA TGTTAATGCA 3600 

CTCCTCATTC CTGGACCAGC AGGCAAAACA AAGCTAGAGA AACAATTCCA GGGTCTCACT 3660 

CTGTCACCCA GGCTGGAGTG CAGAGGCACA ATCTCGGCTC ACTGCAACCT TCCTCTCCCT 3720 

GGCTTAACTG ATCCTCCTAC CTCAGCCTCC CGAGTGGCTG GGACTATACT CCTGAGCCAG 3780 

TCAAATATAC AGCAGAGTGA CTATTCTGCA GCCCTAAAGC AATGCAACAG GGAAAAGAAT 3840 

CGAACTTCTT CTATCATCCC TGTGGAAAGA TCAAGGGTTG GCATTTCATC CCTGAGTGGA 3900 

GAAGGCACAG ACTA CAT CAA TGCCTCCTAT ATCATGGGCT ATTACCAGAG CAATGAATTC 3960 

ATCATTACCC AGCACCCTCT CCTTCATACC ATCAAGGATT TCTGGAGGAT GATATGGGAC 4020 

CATAATGCCC AACTGGTGGT TATGATTCCT GATGGCCAAA ACATGGCAGA AGATGAATTT 40 BO 

GTTTACTGGC CAAATAAAGA TGAGCCTATA AATTGTGAGA GCTTTAAGGT CACTCTTATG 4140 

GCTGAAGAAC ACAAATGTCT ATCTAATGAG GAAAAACTTA TAATTCAGGA CTTTATCTTA 4200 

GAAGCTACAC AGGATGATTA TGTACTTGAA GTGAGGCACT TTCAGTGTCC TAAATGGCCA 4260 

AATCCAGATA GCCCCATTAG TAAAACTTTT GAACTTATAA GTGTTATAAA AGAAGAAGCT 4320 

GCCAATAGGG ATGGGCCTAT GATTGTTCAT GATGAGCATG GAGGAGTGAC GGCAGGAACT 43 BO 

TTCTGTGCTC TGACAACCCT TATGCACCAA CTAGAAAAAG AAAATTCCGT GGATGTTTAC 4440 

CAGGTAGCCA AGATGATCAA TCTGATGAGG CCAGGAGTCT TTGCTGACAT TGAGCAGTAT 4500 

CAGTTTCTCT ACAAAGTGAT CCTCAGCCTT GTGGGCACAA GGCAGGAAGA GAATCCATCC 4560 

ACCTCTCTGG ACAGTAATGG TGCAGCATTG CCTGATGGAA ATATAGCTGA GAGCTTAGAG 4620 

TCTTTAGTTT AACACAGAAA GGGGTGGGGG GACTCACATC TGAGCATTGT TTTCCTCTTC 4680 

CTAAAATTAG GCAGGAAAAT CAGTCTAGTT CTGTTATCTG TTGATTTCCC ATCACCTGAC 4740 

AGTAACTTTC ATGACATAGG ATTCTGCCGC CAAATTTATA TCATTAACAA TGTGTGCCTT 4800 

TTTGCAAGAC TTGTAATTTA CTTATTATGT TTGAACTAAA ATGATTGAAT TTTACAGTAT 4860 

TTCTAAGAAT GGAATTGTGG TATTTTTTTC TGTATTGATT TTAACAGAAA ATTTCAATTT 4920 

ATAGAGGTTA GGAATTCCAA ACTACAGAAA ATGTTTGTTT TTAGTGTCAA ATTTTTAGCT 4980 

GTATTTGTAG CAATTATCAG GTTTGCTAGA AATATAACTT TTAATACAGT AGCCTGTAAA 5040 

TAAAACACTC TTCCATATGA TATTCAACAT TTTACAACTG CAGTATTCAC CTAAAGTAGA 5100 

AATAATCTGT TACTTATTGT AAATACTGCC CTAGTGTCTC CATGGACCAA ATTTATATTT 5160 
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5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 
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ATAATTGTAG ATTTTTATAT 
TAGTTTAATG ACGTAGTTCA 
TGTTACCTAA GTCATTAACT 
ATACCTTCAT TTTGAAAGAA 
GGTTTTTATC CAAGGAATTG 
AAAAAAAAAA AAAAAAAAAA 



PCT/US02/12476 



TTTACTACTG AGTCAAGTTT 
TTAGCTGGTC TTACTCTACC 
TTGTTTCAGC ATGTAATTTT 
GTTTTTATGA GAATAACACC 
CAAAAATAAA TATAAATATT 
A 



TCTAGTTCTG TGTAATTGTT 
AGTTTTCTGA CATTGTATTG 
AACTTTTGTG GAAAATAGAA 
TTACCAAACA TTGTTCAAAT 
GCCATTAAAA AAAAAAAAAA 



5220 
5280 
5340 
5400 
5460 



Seq ID NO; 581 Protein sequences 
Protein Accession 8: EOS sequence 



1 
I 

MRILKRFLAC 
QSPINIDEDL 
PKASKITFHW 
ILFEVGTEEN 
TDTVDWIVFK 
TGKEEIHEAV 
HEFLTDGYQD 
LIGTEEZIKE 
RSPTRGSEFS 
GSKTVLRSPH 
ENISQGYIFS 
TAQPDVGSGR 
TEVTPHAPTP 
LVIVSALTP1 
IKHPPKHVAD 
AYDHSRVKLA 
EVIVMITNLV 
SQKGRPSGRV 
GTYIVLDSML 
IjDSH I HAYVN 
SRVAGTILLS 
YIMGYYQSNE 
INCESFKVTL 
PELISVIKEE 
RPGVFADIEQ 



11 

I 

IQLLCVCRLD 
TQVNVNIiKKL 
GKCNMSSDGS 
LDFKAIIDGV 
DTV9ISE5QI) 
CSSEPENVQA 
LGAILNNLLP 
BEEGKDZEEG 
GKGDVPNTSL 
MNLSGTAESL 
SENPETITYD 
ESFLQTNYTE 
SSRQQDLVST 
CLWLVGIIiI 
LHASSGPTEE 
QLAEKDGKLT 
EKGRRKCDQY 
VTQYHYTQWP 
QQIQHEGTVN 
ALLIPGPAGK 
QSNIQQSDYS 
FIITQHPLLH 
MAEEHKCIiSN 
AANRDGPMIV 
YQFLYKVILS 



21 
I 

WANGYYRQQR 
KFQGWDKTSL 
EHSLEGQKFP 
ESVSRFGKQA 
AVFCEVLTMQ 
DPENYTSLLV 
NMSYVLQIVA 
AIVNPGRDSA 
NSTSQPVTKL 
NTVSITEYEE 
VLIPESARNA 
IRVDESEKTT 
VNWYSQTTQ 
YWRKCFQTAH 
PETLKEPYQE 
DYINANYVDG 
WPADGSEEYG 
DMGVPEYSLP 
IFGFLKHIRS 
TKLEKQFQGIi 
AALKQCNREK 
TIKDFWRMIW 
EEKLIIQDFI 
KOEHGGVTAG 
LVGTRQEENP 



31 
t 

KLVEEIGWSY 
ENTF IHNTGK 
LEMQIYCFDA 
ALDPFILLNL 
QSGYVMLMDY 
TWERPRWYD 
ICTNGLYGKY 
TNQIRKKEPQ 
ATEKDISLTS 
ESLLTSFKLD 



41 

I 

TGALNQKNWG 
TVEINLTNDY 
DRFSSFEEAV 
LPNSTDKYYI 
LQNNFREQQY 
TMIEKFAVLY 
SDQLIVDMPT 
ISTTTHYNRI 
QTVTELPPHT 



KSFSAGPVMS 
PVYNEASNSS 
PYLEDSTSPR 
VQSCTVDLGI 
YNRPKAY I AA 
NFLVTQKSVQ 
VLTFVRKAAY 
QRNYLVQTEE 
TLSPRLECRG 
NRTSSIIPVE 
DHNAQLWMI 
LEATQDDYVL 
TFCALTTLMH 
STSLDSNGAA 



ESLKDPSMEG 
QGPSVTDIiEM 
HESRIGLAEG 
VISTPPTPIP 
TADSSNHPDN 
QGPLKSTAED 
VLAYYTVRNF 
AKRHAVGPW 
QYVFIHDTLV 
TISAHCNIiPL 
RSRVGISSLS 
PDGQNMAEDE 
EVRHFQCPKW 
QLEKENSVDV 
IiPDGNIAESL 



51 
I 

KKYPTCNSPK 
RVSGGVSEMV 
KGKGKLRALS 
YNGSLTSPPC 
KFSRQVFSSY 
QQLDGEDQTK 
DNPELDLPPE 
GTKYNEAKTN 
VEGTSASUJD 
PATSAIPF1S 
NVWFPSSTDI 
PHYSTFAYFP 
LESEKKAVIP 
PISDDVGAIP 
KHKNRYINIV 
FWRMIWEHNV 
TLRNTKIKKG 
VHCSAGVGRT 
EAILSKETEV 
PGLTDPPTSA 
GEGTDYINAS 
FVYWPMKDEP 
PNPDSPISKT 
YQVAKMINLM 
ESLV 



Seq ID NO: 582 DNA sequence 

Nucleic Acid Accession #> NM_002851.1 

Coding sequence: 148.. 7092 



1 
I 

CACACATACG 
CAAAAAAAAC 
CGGCGAGGGG 
CAGCTCCTCT 
CTTGTTGAAG 
AAATATCCAA 
CAAGTAAATG 
AACACATTCA 
GTCAGCGGAG 
AAATGCAATA 
GAGATGCAAA 
GGAAAAGGGA 
GATTTCAAAG 
TTAGATCCAT 
AATGGCTCAT 
ACAGTTAGCA 
TCTGGTTATG 
TTCTCTAGAC 
AGTTCAGAAC 
TGGGAAAGAC 
CAGTTGGATG 
GGTGCTATrc 
TGCACTAATG 
AATCCTGAAC 
GAAGAGGGAA 
AACCAAATCA 
ACGAAATACA 
AAGGGTGATG 
ACAGAAAAAG 
GAAGGTACTT 
AACTTGTCGG 
AGTTTATTGA 
GCAACTTCTG 
GAAAACCCAG 
GAAGATTCAA 
GTGTGGTTTC 
AGCTTTCTCC 
TCCTTTTCTG 
CATTATTCTA 
TCCAGACAAC 
GTATACAATG 
ACCCCTTTGT 



11 

I 

CACGCACGAT 
ATTTCCTTCG 
CCGCAGACCG 
GTGTTTGCCG 
AGATTGGCTG 
CATGTAATAG 
TGAATCTTAA 
TTCATAACAC 
GAGTTTCAGA 
TGTCATCTGA 
TCTACTGCTT 
AGTTAAGAGC 
CGATTATTGA 
TCATACTGTT 
TGACATCTCC 
TCTCTGAAAG 
TCATGCTGAT 
AGGTGTTTTC 
CAGAAAATGT 
CTCGAGTCGT 
GAGAGGACCA 
TCAATAATTT 
GCTTATATGG 
TTGATCTTTT 
AAGACATTGA 
GGAAAAAGGA 
ATGAAGCCAA 
TTCCCAATAC 
ATATTTCCTT 
CAGCCTCTTT 
GGACTGCAGA 
CCAGTTTCAA 
CTATCCCATT 
AGACAATAAC 
CTTCATCAGG 
CTAGCTCTAC 
AGACTAATTA 
CAGGCCCAGT 
CCTTTGCCTA 
AGGATTTGGT 
GTGAGACACC 
• TGCTTGACAA 



21 

I 

CTCACTTCGA 
CTCCCCCTCC 
TCTGGAAATG 
CCTGGATTGG 
GTCCTATACA 
CCCAAAACAA 
GAAACTTAAA 
TGGGAAAACA 
AATGGTGTTT 
TGGATCAGAG 
TGATGCGGAC 
TTTATCCATT 
TGGAGTCX3AA 
GAACCTTCTG 
TCCCTGCACA 
CCAGTTGGCT 
GGACTACTTA 
CTCATACACT 
TCAGGCTGAC 
TTATGATACC 
AACCAAGCAT 
GCTACCCAAT 
AAAATACAGC 
CCCTGAATTA 
AGAAGGCGCT 
ACCCCAGATT 
GACTAACCGA 
ATCTTTAAAT 
GACTTCTCAG 
AAATGATGGC 
ATCCTTAAAT 
GCTTGATACT 
CATCTCTGAG 
ATATGATGTC 
TTCAGAAGAA 
AGACATAACA 
CACTGAGATA 
GATGTCACAG 
CTTCCCAACT 
CTCCACGGTC 
TCTTCAACCT 
TCAGATCCTC 



31 
I 

TCTATACACT 
CTCTCCACTC 
CGAATOCTAA 
GCTAATGGAT 
GGAGCACTGA 
TCTCCTATCA 
TTTCAGGGTT 
GTGGAAATTA 
AAAGCAAGCA 
CATAGTTTAG 
CQ ATT TT CAA 
TTGTTTGAGG 
AGTGTTAGTC 
CCAAACTCAA 
GACACAGTTG 
GTTTTTTGTG 
CAAAACAATT 
GGAAAGGAAG 
CCAGAGAATT 
ATGATTGAGA 
GAATTTTTGA 
ATGAGTTATG 
GACCAACTGA 
ATTGGAACTG 
ATTGTGAATC 
TCTACCACAA 
TCCCCAACAA 
TCCACTTCCC 
ACTGTGACTG 
TCTAAAACTG 
ACAGTTTCTA 
GGAGCTGAAG 
AACATATCCC 
CTTATACCAG 
TCACTAAAGG 
GCACAGCCCG 
CGTGTTGATG 
GGTCCCTCAG 
GAGGTAACAC 
AACGTGGTAT 
TCCTACAGTA 
AACACTACCC 



41 
I 

GGAGGATTAA 
TGAGAAGCAG 
AGCGTTTCCT 
ACTACAGACA 
ATCAAAAAAA 
ATATTGATGA 
GGGATAAAAC 
ATCTCACTAA 
AGATAACTTT 
AAGGACAAAA 
GTTTTGAGGA 
TTGGGACAGA 
GTTTTGGGAA 
CTGACAAGTA 
ACTGGATTGT 
AAGTTCTTAC 
TTCGAGAGCA 
AGATTCATGA 
ATACCAGCCT 
AGT TTGCAGT 
CAGATGGCTA 
TTCTTCAGAT 
TTGTCGACAT 
AAGAAATAAT 
CTGGTAGAGA 
CACACTACAA 
GAGGAAGTGA 
AACCAGTCAC 
AACTGCCACC 
TTCTTAGATC 
TAACAGAATA 
ATTCTTCAGG 
AAGGGTATAT 
AATCTGCTAG 
ATCCTTCTAT 
ATGTTGGATC 
AATCTGAGAA 
TTACAGATCT 
CTCATGCTTT 
ACTCGCAGAC 
GTGAAGTCTT 
CTGCTGCTTC 



51 
I 

AACAAACAAA 
AGGAGCCGCA 
CGCTTGCATT 
ACAGAGAAAA 
TTGGGGAAAG 
AGATCTTACA 
ATCATTGGAA 
TGACTACCGT 
TCACTGGGGA 
ATTTCCACTT 
AGCAGTCAAA 
AGAAAATTTG 
GCAGGCTGCT 
TTACATTTAC 
TTTTAAAGAT 
AATGCAACAA 
ACAGTACAAG 
AGCAGTTTGT 
TCTTGTTACA 
TTTGTACCAG 
TCAAGACTTG 
AGTAGCCATA 
GCCTACTGAT 
CAAGGAGGAG 
CAGTGCTACA 
TCGCATAGGG 
ATTCTCTGGA 
TAAATTAGCC 
TCACACTGTG 
TCCACATATG 
TGAGGAGGAG 
CTCCAGTCCC 
ATTTTCCTCC 
AAATGCTTCC 
GGAGGGAAAT 
AGG CAG AG AG 
GACAACCAAG 
GGAAATGCCA 
TACCCCATCC 
AACCCAACCG 
TCCTCTAGTC 
AAGTAGTGAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1390 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
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TCGGCCTTGC ATGCTACGCC TGTATTTCCC AGTGTCGATQ TGTCATTTQA ATCCATCCTG 2580 

TCTTCCTATG ATGGTGCACC TTTGCTTCCA TTTTCCTCTG CTTCCTTCAG TAGTGAATTG 2640 

TTTCGCCATC TGCATACAGT TTCTCAAATC CTTCCACAAG TTACTTCAGC TACCGAGAGT 2700 

GATAAGGTGC CCTTGCATGC TTCTCTGCCA GTGGCTGGGG GTGATTTGCT ATTAGAGCCC 2760 

AGCCTTGCTC AGTATTCTGA TGTGCTGTCC ACTACTCATG CTGCTTCAGA GACGCTGGAA 2820 

TTTGGTAGTG AATCTGGTGT TCTTTATAAA ACGCTTATGT TTTCTCAAGT TGAACCACCC 2880 

AGCAGTGATG CCATGATGCA TGCACGTTCT TCAGGGCCTG AACCTTCTTA TGCCTTGTCT 2940 

GATAATGAGG GCTCCCAACA CATCTTCACT GTTTCTTACA GTTCTGCAAT ACCTGTGCAT 3000 

GATTCTGTGG GTGTAACTTA TCAGGGTTCC TTATTTAGCG GCCCTAGCCA TATACCAATA 3060 

CCTAAGTCTT CGTTAATAAC CCCAACTGCA TCATTACTGC AGCCTACTCA TGCCCTCTCT 3120 

GGTGATGGGG AATGGTCTGG AGCCTCTTCT GATAGTGAAT TTCTTTTACC TGACACAGAT 3180 

GGGCTGACAG CCCTTAACAT TTCTTCACCT GTTTCTGTAG CTGAATTTAC ATATACAACA 3240 

TCTGTGTTTG GTGATGATAA TAAGGCGCTT TCTAAAAGTG AAATAATATA TGGAAATGAG 3300 

ACTGAACTGC AAATTCCTTC TTTCAATGAG ATGGTTTACC CTTCTGAAAG CACAGTCATG 3360 

CCCAACATGT ATGATAATGT AAATAAGTTG AATGCGTCTT TACAAGAAAC CTCTGTTTCC 3420 

ATTTCTAGCA CCAAGGGCAT GTTTCCAGGG TCCCTTGCTC ATACCACCAC TAAGGTTTTT 3480 

GATCATGAGA TTAGTCAAGT TCCAGAAAAT AACTTTTCAG TTCAACCTAC ACATACTGTC 3540 

TCTCAAGCAT CTGGTGACAC TTCGCTTAAA CCTGTGCTTA GTGCAAACTC AGAGCCAGCA 3600 

TCCTCTGACC CTGCTTCTAG TGAAATGTTA TCTCCTTCAA CTCAGCTCTT ATTTTATGAG 3 660 

ACCTCAGCTT CTTTTAGTAC TGAAGTATTG CTACAACCTT CCTTTCAGGC TTCTGATGTT 3720 

GACACCTTGC TTAAAACTGT TCTTCCAGCT GTGCCCAGTG ATCCAATATT GGTTGAAACC 3780 

CCCAAAGTTG ATAAAATTAG TTCTACAATG TTGCATCTCA TTGTATCAAA TTCTG CTTCA 3840 

AGTGAAAACA TGCTGCACTC TACATCTGTA CCAGTTTTTG ATGTGTCGCC TACTTCTCAT 3900 

ATGCACTCTG CTTCACTTCA AGGTTTGACC ATTTCCTATG CAAGTGAGAA ATATGAACCA 3960 

GTTTTGTTAA AAAGTGAAAG TTCCCACCAA GTGGTACCTT CTTTGTACAG TAATGATGAG 4020 

TTGTTCCAAA CGGCCAATTT GGAGATTAAC CAGGCCCATC CCCCAAAAGG AAGGCATGTA 4080 

TTTGCTACAC CTGTTTTATC AATTGATGAA CCATTAAATA CACTAATAAA TAAGCTTATA 4140 

CATTCCGATG AAATTTTAAC CTCCACCAAA AGTTCTGTTA CTGGTAAGGT ATTTGCTGGT 4200 

ATTCCAACAG TTGCTTCTGA TACATTTGTA TCTACTGATC ATTCTGTTCC TATAGGAAAT 4260 

GGGCATGTTG CCATTACAGC TGTTTCTCCC CA CAG AG AT G GTTCTGTAAC CTCAACAAAG 4320 

TTGCTGTTTC CTTCTAAGGC AACTTCTGAG CTGAGT CAT A GTGCCAAATC TGATGCCGGT 4380 

TTAGTGGGTG GTG GTGAAGA TGGTGACACT GATGATGATG GTGATGATGA TGATGACAGA 4440 

GATAGTGATG GCTTATCCAT TCATAAGTGT ATGTCATGCT CATCCTATAG AGAATCACAG 4500 

GAAAAGGTAA TGAATGATTC AGACACCCAC GAAAACAGTC TTATGGATCA GAATAATCCA 4560 

ATCTCATACT CACTATCTGA GAATTCTGAA GAAGATAATA GAGTCACAAG TGTATCCTCA 4620 

GACAGTCAAA CTGGTATGGA CAGAAGTCCT GGTAAATCAC CATCAGCAAA TGGGCTATCC 46S0 

CAAAAGCACA ATGATGGAAA AGAGGAAAAT GACATTCAGA CTGGTAGTGC TCTGCTTCCT 4740 

CTCAGCCCTG AATCTAAAGC ATGGGCAGTT CTGACAAGTG ATGAAGAAAG TGGATCAGGG 4800 

CAAGGTACCT CAGATAGCCT TAATGAGAAT GAGACTTCCA CAGATTTCAG TTTTGCAGAC 4860 

ACTAATGAAA AAGATGCTGA TGGGATCCTG GCAGCAGGTG ACTCAGAAAT AACTCCTGGA 4920 

TTCCCACAGT CCCCAACATC ATCTGTTACT AGCGAGAACT CAGAAGTGTT CCACGTTTCA 4980 

GAGGCAGAGG CCAGTAATAG TAGCCATGAG TCTCGTATTG GTCTAGCTGA GGGGTTGGAA 5040 

TCCGAGAAGA AGGCAGTTAT ACCCCTTGTG ATCGTGTCAG CCCTGACTTT TATCTGTCTA 5100 

GTGGTTCTTG TGGGTATTCT CATCTACTGG AGGAAATGCT TCCAGACTGC ACACTTTTAC 5160 

TTAGAGGACA GTACATCCCC TAGAGTTATA TCCACACCTC CAACACCTAT CTTTCCAATT 5220 

TCAGATGATG TCGGAGCAAT TCCAATAAAG CACTTTCCAA AGCATGTTGC AGATTTACAT 5280 

GCAAGTAGTG GGTTTACTGA AGAATTTGAG ACACTGAAAG AGTTTTACCA GGAAGTGCAG 5340 

AGCTGTACTG TTGACTTAGG TATTACAGCA GACAGCTCCA ACCACCCAGA CAACAAGCAC 5400 

AAGAATCGAT ACATAAATAT CGTTGCCTAT GATCATAGCA GGGTTAAGCT AGCACAGCTT 5460 

GCTGAAAAGG ATGGCAAACT GACTGATTAT ATCAATGCCA ATTATGTTGA TGGCTACAAC 5520 

AGACCAAAAG CTTATATTGC TGCCCAAGGC CCACTGAAAT CCACAGCTGA AGATTTCTGG 5580 

AGAATGATAT GGGAACATAA TGTGGAAGTT ATTGTCATGA TAACAAACCT CGTGGAGAAA 5640 

GGAAGGAGAA AATGTGATCA GTACTGGCCT GCCGATGGGA GTGAGGAGTA CGGGAACTTT 5700 

CTGGTCACTC AGAAGAGTGT GCAAGTGCTT GCCTATTATA CTGTGAGGAA TTTTACTCTA 5760 

AGAAACACAA AAATAAAAAA GGGCTCCCAG AAAGGAAGAC CCAGTGGACG TGTGGTCACA 5820 

CAGTATCACT ACACGCAGTG GCCTGACATG GGAGTACCAG AGTACTCCCT GCCAGTGCTG 5880 

ACCTTTGTGA GAAAGGCAGC CTATGCCAAG CGCCATGCAG TGGGGCCTGT TGTCGTCCAC 5940 

TGCAGTGCTG GAGTTGGAAG AACAGGCACA TATATTGTGC TAGACAGTAT GTTGCAGCAG 6000 

ATTCAACACG AAGGAACTGT CAACATATTT GGGTTCTTAA AACACATCCG TTCACAAAGA 6060 

AATTATTTGG TACAAACTGA GGAGCAATAT GTCTTCATTC ATGATACACT GGTTGAGGCC 6120 

ATACTTAGTA AAGAAACTGA GGTGCTGGAC AGTCATATTC ATGCCTATGT TAATGCACTC 6180 

CTCATTCCTG GACCAGCAGG CAAAACAAAG CTAGAGAAAC AATTCCAGCT CCTGAGCCAG 6240 

TCAAATATAC AGCAGAGTGA CTATTCTGCA GCCCTAAAGC AATGCAACAG GGAAAAGAAT 6300 

CGAACTTCTT CTATCATCCC TGTGGAAAGA TCAAGGGTTG GCATTTCATC CCTGAGTGGA 6360 

GAAGGCACAG ACTACATCAA TGCCTCCTAT ATCATGGGCT ATTACCAGAG CAATGAATTC 6420 

ATCATTACCC AGCACCCTCT CCTTCATACC ATCAAGGATT TCTGGAGGAT GATATGGGAC 6480 

CATAATGCCC AACTGGTGGT TATGATTCCT GATGGCCAAA ACATGGCAGA AGATGAATTT 6540 

GTTTACTGGC CAAATAAAGA TGAGCCTATA AATTGTGAGA GCTTTAAGGT CACTCTTATG 6600 

GCTGAAGAAC ACAAATGTCT ATCTAATGAG GAAAAACTTA TAATTCAGGA CTTTATCTTA 6660 

GAAGCTACAC AGGATGATTA TGTACTTGAA GTGAGGCACT TTCAGTGTCC TAAATGGCCA 6720 

AATCCAGATA GCCCCATTAG TAAAACTTTT GAACTTATAA GTGTTATAAA AGAAGAAGCT 6780 

GCCAATAGGG ATGGGCCTAT GATTGTTCAT GATGAGCATG GAGGAGTGAC GGCAGGAACT 6840 

TTCTGTGCTC TGACAACCCT TATGCACCAA CTAGAAAAAG AAAATTCCGT GGATGTTTAC 6900 

CAGGTAGCCA AGATGATCAA TCTGATGAGG CCAGGAGTCT TTGCTGACAT TGAGCAGTAT 6960 

CAGTTTCTCT ACAAAGTGAT CCTCAGCCTT GTGAGCACAA GGCAGGAAGA GAATCCATCC 7020 

ACCTCTCTGG ACAGTAATGG TGCAGCATTG CCTGATGGAA ATATAGCTGA GAGCTTAGAG 7080 

TCTTTAGTTT AACACAGAAA GGGGTGGGGG GACTCACATC TGAGCATTGT TTTCCTCTTC 7140 

CTAAAATTAG GCAGGAAAAT CAGTCTAGTT CTGTTATCTG TTGATTTCCC ATCACCTGAC 7200 

AGTAACTTTC ATGACATAGG ATTCTGCCGC CAAATTTATA TCATTAACAA TGTGTGCCTT 7260 

TTTGCAAGAC TTGTAATTTA CTTATTATGT TTGAACTAAA ATGATTGAAT TTTACAGTAT 7320 

TTCTAAGAAT GGAATTGTGG TATTTTTTTC TGTATTGATT TTAACAGAAA ATTTCAATTT 7380 

ATAGAGGTTA GGAATTCCAA ACTACAGAAA ATGTTTGTTT TTAGTGTCAA ATTTTTAGCT 7440 

GTATTTGTAG CAATTATCAG GTTTG CT AG A AATATAACTT TTAATACAGT AGCCTGTAAA 7500 

TAAAACACTC TTCCATATGA TATTCAACAT TTTACAACTG CAGTATTCAC CTAAAGTAGA 7560 

AATAATCTGT TACTTATTGT AAATACTGCC CTAGTGTCTC CATGGACCAA ATTTATATTT 7620 

ATAATTGTAG ATTTTTATAT TTTACTACTG AGTCAAGTTT TCTAGTTCTG TGTAATTGTT 7680 

TAGTTTAATG ACGTAGTTCA TTAGCTGGTC TTACTCTACC AGTTTTCTGA CATTGTATTG 7740 
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TOTTACCTAA GTCATTAACT TTGTTTCAOC ATGTAATTTT AACTTTTGTG GAAAATAGAA 7800 

ATACCTTCAT TTTGAAAGAA GTTTTTATGA GAATAACACC TTACCAAACA TTGTTCAAAT 7860 

GGTTTTTATC CAAGGAATTG CAAAAATAAA TATAAATATT GCCATTAAAA AAAAAAAAAA 7920 
AAAAAAAAAA AAAAAAAAAA A 

Seq ID NO : 583 Protein sequence 
Protein Accession #: NP_002842.1 

1 11 21 31 41 51 

MRILKRFLAC IQLLCVCRLD WANGYYP.QQR KLVEEIGWSY TGALNQKNWG KKYPTCNSPK €0 

QSPINIDEDL TQVNVNLKKL KFQGWDKTSL ENTFIHNTGK TVEINLTNDY RVSGGVSEMV 120 

PKASKITFHW GKCNMSSDGS EHS1*EGQKPP LEMQIYCFDA DRFSSPEEAV KGKGKLRALS 180 

ILFEVGTEEN LDFKAIIDGV ESVSRFGXQA ALOPFILLNI* LPNSTDKYYI YNGSLTSPPC 240 

TDTVDWIVFK DTVSISESQL AVFCEVLTMQ QSGYVMLMDY LQNNFREQQY KFSRQVFSSY 3 00 

TGKEEIHEAV CSSEPENVQA DPENYTSLLV TWERPRWYD TMIEKFAVLY QQLDGEDQTK 360 

HEFLTDGYQD LGAILNNLLP NMSYVLQIVA ICTNGLYGKY SDQLIVDMPT DNPELDLFPE 420 

LIGTEEIIKE EEEGKDIEEG AIVNPGRDSA TNQIRKKEPQ ISTTTHYNRI GTKYNEAKTN 480 

RSPTRGSEPS GKGDVPNTSL NSTSQPVTKL ATEKDISLTS QTVTELPPHT VEGTSASLND 540 

GSKTVLR9PH MMLSGTAESIi NTVSITEYEE ESLLTSFKLD TGAEDSSGSS PATSAIPFIS 600 

ENISQGYIFS SENPETITYD VLIPESARNA SEDSTSSGSE ESLKDPSMEG NVWFPSSTDI 660 

TAQPDVGSGR ESFLQTNYTE IRVDESEKTT KSPSAGPVMS QGPSVTDLEM PHYSTPAYFP 720 

TEVTPHAFTP SSRQQDI»VST VNWYSQTTQ PVYNGETPLQ PSYSSEVFPL VTPLLLDNQI 780 

LNTTPAASSS DSALHATPVF PSVDVSFESI LSSYDGAPLL PFSSASFSSE LFRHLHTVSQ 840 

ILPQVTSATE SDKVPLHASL PVAGGDLLLE PSLAQYSDVL STTHAASETL EFGSESGVLY 900 

KTLMFSQVEP PSSDAMMHAR SSGPEPSYAL SDNEGSQHIF TVSYSSAIPV HDSVGVTYQG 960 

SLFSGPSHIP IPKSSLITPT ASLLQPTHAI* SGDGEWSGAS SDSEFLXjPDT DGLTALNISS 1020 

PVSVAEFTYT TSVFGDDNXA LSXSEIIYGN ETELQIPSFN BMVYPSESTV MPNMYDNVNK 1080 

LNASLQETSV SISSTKGMPP GSLAHTTTKV FDHEISQVPE NNFSVQPTHT VSQASGDTSL 1140 

KPVLSANSEP ASSDPASSEM LSPSTQLLFY ETSASFSTEV LLQPSFQASD VDTLLKTVLP 1200 

AVPSDPILVE TPKVDKISST MLHLIVSNSA SSENMLHSTS VPVPDVSPTS HMHSASLQGL 1260 

TISYASEKYE PVLLKSESSH QWPSLYSND ELFQTANLEI NQAHPPKGRH VFATPVLSID 13 20 

EPLNTLINKL IHSDEILTST KSSVTGKVFA GIPTVASDTF VSTDHSVPIG NGHVAITAVS 1380 

PHRDGSVTST KLLFPSKATS ELSHSAKSDA GLVGGGEDGD TDDDGDDDDD RDSDGLSIHK 1440 

CMSCSSYRES QEKVMNDSDT HENSLMDQNN PISYSLSENS EEDNRVTSVS SDSQTGMDRS 1500 

PGKSPSANGL SQKHNDGKEE NDIQTGSALL PLSPESKAWA VLTSDEE8GS GQGTSDSLNE 15 60 

NETSTDFSFA DTNEKDADGI LAAGDSEITP GFPQSPTSSV TSENSEVFHV SEAEASNSSK 1620 

ESRIGLAEGL ESEKKAVIPL VIVSALTFIC LWLVGILIY WRKCFQTAHF YLEDSTSPRV 1680 

ISTPPTPIFP ISDDVGAIPI KHFPKHVADL HASSGFTEEF ETLKEFYQEV QSCTVDLGIT 1740 

ADSSNHPDNK HKNRYINIVA YDHSRVKLAQ LAEKDGKLTD YINANYVDGY NRPKAYIAAQ 1800 

GPIiKSTAEDP WRMIWEKNVE V1VMITNLVE KGRRKCDQYW PADGSEEYGM FLVTQKSVQV I860 

LAYYTVRNFT LRNTKIKKGS QKGRPSGRW TQYHYTQWPD MGVPEYSLPV LTFVRKAAYA 1920 

KRHAVGPWV HCSAGVGRTG TYIVLDSMLQ QIQHEGTVNI FGFLKHIRSQ RNYLVQTEEQ 1980 

YVFIHDTLVE AILSKETEVL DSHIHAYVNA LLIPGPAGKT KLEKQFQLLS QSNIQQSDYS 2040 

AALKQCNREK NRTSSIIPVE RSRVGISSLS GEGTDYINAS YIMGYYQSNE FIITQHPLLH 2100 

TIKDFWRMIW DHNAQL.WMI PDGQNMAEDE FVYWPNKDEP IKCESPKVTL MAEEHKCLSN 2160 

EEKLIIQDFI IiEATQDDYVL EVRHFQCPKW PNPDSPISKT FELISVIXEB AANRDGPMIV 2220 

HDEHGGVTAG TFCALTTLMH QLEKENSVDV YQVAKMINLM RPGVFADIEQ YQFLYKVILS 2280 
LVSTRQEENP STSLDSNGAA LPDGNIAESL ESLV 

Seq ID NO: 584 DNA sequence 

Nucleic Acid Accession ft: NM_005688.1 

Coding sequence: 126.. 4439 

1 11 21 31 41 51 

i I I I I I 

CCGGGCAGGT GGCTCATGCT CGGGAGCGTG GTTGAGCGGC TGGCGCGGTT GTCCTGGAGC 60 

AGGGGCGCAG GAATTCTGAT GTGAAACTAA CAGTCTGTGA GCCCTGGAAC CTCCGCTCAG 120 

AGAAGATGAA GGATATCGAC ATAGGAAAAG AGTATATCAT CCCCAGTCCT GGGTATAGAA 180 

GTGTGAGGGA GAGAACCAGC ACTTCTGGGA CGCACAGAGA CCGTGAAGAT TCCAAGTTCA 240 

GGAGAACTOG ACCGTTGGAA TGCCAAGATG CCTTGGAAAC AGCAGCCCGA GCCGAGCGCC 300 

TCTCT CTTGA TGCCTCCATG CATTCTCAGC TCAGAATCCT GGATGAGGAG CATCCCAAGG 3 60 

GAAAGTACCA TCATGGCTTG AGTGCTCTGA AGCCCATCCG GACTACTTCC AAACACCAGC 420 

ACCCAGTGGA CAATGCTGGG CTTTTTTCCT GTATGACTTT TTCGTGGCTT TCTTCTCTGG 480 

CCCGTGTGGC CCACAAGAAG GGGGAGCTCT CAATGGAAGA CGTGTGGTCT CTGTCCAAGC 540 

ACGAGTCTTC TGACGTGAAC TGCAGAAGAC TAGAGAGACT GTGGCAAGAA GAGCTGAATG 600 

AAGTTGGGCC AGACGCTGCT TCCCTGCGAA GGGTTGTGTG GATCTTCTGC CGCACCAGGC 660 

TCATCCTGTC CATCGTGTGC CTGATGATCA CGCAGCTGGC TGGCTTCAGT GGACCAGCCT 720 

TCATGGTGAA ACACCTCTTG GAGTATACCC AGGCAACAGA GTCTAACCTG CAGTACAGCT 780 

TGTTGT TAGT GCTGGGCCTC CTCCTGAOGG AAATCGTGCG GTCTTGGTCG CTTGCACTGA 840 

CTTGGGCATT GAATTACCGA ACCGGTGTCC GCTTGCGGGG GGCCATCCTA ACCATGGCAT 900 

TTAAGAAGAT CCTTAAGTTA AAGAACATTA AAGAGAAATC CCTGGGTGAG CTCATCAACA 960 

TTTGCTCCAA CGATGGGCAG AGAATGTTTG AGGCAGCAGC CGTTGGCAGC CTGCTGGCTG 1020 

GAGGACCCGT TGTTGCCATC TTAGGCATGA TTTATAATGT AATTATTCTG GGACCAACAG 1080 

GCTTCCTGGG ATCAGCTGTT TTTATCCTCT TTTACCCAGC AATGATGTTT GCATCACGGC 1140 

TCACAGCATA TTTCAGGAGA AAATGCGTGG CCGCCACGGA TGAACGTGTC CAGAAGATGA 1200 

ATGAAGTTCT TACT T ACATT AAATTTATCA AAATGTATGC CTGGGTCAAA GCATTTTCTC 1260 

AGAGTGTTCA AAAAATCCGC GAGGAGGAGC GTCGGATATT GGAAAAAGCC GGGTACTTCC 1320 

AGGGTATCAC TGTGGGTGTG GCTCCCATTG TGGTGGTGAT TGCCAGCGTG GTGACCTTCT 1380 

CTGTTCATAT GACCCTGGGC TTCGATCTGA CAGCAGCACA GGCTTTCACA GTGGTGACAG 1440 

TCTTCAATTC CATGACTTTT GCTTTGAAAG TAACACCGTT TTCAGTAAAG TCCCTCTCAG 1500 

AAGCCTCAGT GGCTGTTGAC AGATTTAAGA GTTTGTTTCT AATGGAAGAG GTTCACATGA 1560 

TAAAGAACAA ACCAGCCAGT CCTCACATCA AGATAGAGAT GAAAAATGCC ACCTTGGCAT 1620 

GGGACTCCTC CCACTCCAGT ATCCAGAACT CGCCCAAGCT GACCCCCAAA ATGAAAAAAG 1680 

ACAAGAGGGC TTCCAGGGGC AAGAAAGAGA AGGTGAGGCA GCTGCAGCGC ACTGAGCATC 1740 

AGGCGGTGCT GGCAGAGCAG AAAGGCCACC TCCTCCTGGA CAGTGACGAG OGGCCCAGTC 1800 

CCGAAGAGGA AGAAGGCAAG CACATCCACC TGGGCCACCT GCGCTTACAG AGGACACTGC 1860 
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A CAG CAT CG A TCTGGAOATC CAAGAGGGTA AACTGGTTGO AATCTGCGGC AGTGTGGGAA 1920 

GTGGAAAAAC C T CTCTCATT TCAGCCATTT TAGGCCAGAT GACGCTTCTA GAGGGCAGCA 1980 

TTGCAATCAG TGGAACCTTC GCTTATGTGG CCCAGCAGGC CTGGATCCTC AATGCTACTC 2040 

TGAGAGACAA CATCCTGTTT GGGAAGGAAT ATGATGAAGA AAGATACAAC TCTGTGCTGA 2100 

ACAGCTGCTG CCTGAGGCCT GACCTGGCCA TTCTTCCCAG CAGCGACCTG ACGGAGATTG 2160 

GAQAGCGAGG AGCCAACCTG AGCGGTGGGC AGCGCCAGAG GATCAGCCTT GCCCGGGCCT 2220 

TGTATAGTGA CAGGAGCATC TACATCCTGG ACGACCCCCT CAGTGCCTTA GATGCCCATG 2280 

TGGGCAACCA CATCTTCAAT AGTGCTATCC GGAAACATCT CAAGTCCAAG ACAGTTCTGT 2340 

TTGTTACCCA CCAGTTACAG TACCTGGTTG ACTGTGATGA AGTGATCTTC ATGAAAGAGG 2400 

GCTGTATTAC GGAAAGAGGC ACCCATGAGG AACTGATGAA TTTAAATGGT GACTATGCTA 2460 

CCATTTTTAA TAACCTGTTG CTGGGAGAGA CACCGCCAGT TGAGATCAAT TCAAAAAAGG 2520 

AAACCAGTGG TTCACAGAAG AAGTCACAAG ACAAGGGTCC TAAAACAGGA TCAGTAAAGA 2S80 

AGGAAAAAGC AGTAAAGCCA GAGGAAGGGC AGCTTGTGCA GCTGGAAGAG AAAGGGCAGG 2640 

GTTCAGTGCC CTGGTCAGTA TATGGTGTCT ACATCCAGGC TGCTGGGGGC CCCTTGGCAT 2700 

TCCTGGTTAT TATGGCCCTT TTCATGCTGA ATGTAGGCAG CACCGCCTTC AGCACCTGGT 2760 

GGTTGAGTTA CTGGATCAAG CAAGGAAGCG GGAACACCAC TGTGACTCGA GGGAACGAGA 2820 

CCTCGGTGAG TGACAGCATG AAGGACAATC CTCATATGCA GTACTATGCC AGCATCTACG 2880 

CCCTCTCCAT GGCAGTCATG CTGATCCTGA AAGCCATTCG AGGAGTTGTC TTTGTCAAGG 2940 

GCACGCTGCG AGCTTCCTCC CGGCTGCATG ACGAGCTTTT CCGAAGGATC CTTCGAAGCC 3000 

CTATGAAGTT TTTTGACACG ACCCCCACAG GGAGGATTCT CAACAGGTTT TCCAAAGACA 3060 

TGGATGAAGT TGACGTGCGG CTGCCGTTCC AGGCCGAGAT GTTCATCCAG AACGTTATCC 3120 

TGGTGTTCTT CTGTGTGGGA ATGATCGCAG GAGTCTTCCC GTGGTTCCTT GTGGCAGTGG 3180 

GGCCCCTTGT CATCCTCTTT TCAGTCCTGC ACATTGTCTC CAGGGTCCTG ATTCGGGAGC 3240 

TGAAGCGTCT GGACAATATC ACGCAGTCAC CTTTCCTCTC CCACATCACG TCCAGCATAC 3300 

AGGGCCTTGC CACCATCCAC GCCTACAATA AAGGGCAGGA GTTTCTGCAC AGATACCAGG 3360 

AGCTGCTGGA TGACAACCAA GCTCCTTTTT TTTTGTTTAC GTGTGOGATG CGGTGGCTGG 3420 

CTGTGCGGCT GGACCTCATC AGCATCGCCC TCATCACCAC CACGGGGCTG ATGATCGTTC 3480 

TTATGCACGG GCAGATTCCC CCAGCCTATG CGGGTCTCGC CATCTCTTAT GCTGTCCAGT 3540 

TAACGGGGCT GTTCCAGTTT ACGGTCAGAC TGGCATCTGA GACAGAAGCT CGATTCACCT 3600 

CGGTGGAGAG GATCAATCAC TACATTAAGA CTCTGTCCTT GGAAGCACCT GCCAGAATTA 3660 

AGAACAAGGC TCCCTCCCCT GACTGGCCCC AGGAGGGAGA GGTGACCTTT GAGAACGCAG 3720 

AGATGAGGTA CCGAGAAAAC CTCCCTCTTG TCCTAAAGAA AGTATCCTTC ACGATCAAAC 3780 

CTAAAGAGAA GATTGGCATT GTGGGGCGGA CAGGATCAGG GAAGTCCTCG CTGGGGATGG 3840 

CCCTCTTCCG TCTGGTGGAG TTATCTGGAG GCTGCATCAA GATTGATGGA GTGAGAATCA 3900 

GTGATATTGG CCTTGCCGAC CTCCGAAGCA AACTCTCTAT CATTCCTCAA GAGCCGGTGC 3960 

TGTTCAGTGG CACTGTCAGA TCAAATTTGG ACCCCTTCAA CCAGTACACT GAAGACCAGA 4020 

TTTGGGATGC CCTGGAGAGG ACACACATGA AAGAATGTAT TGCTCAGCTA CCTCTGAAAC 4080 

TTGAATCTGA AGTGATGGAG AATGGGGATA ACTTCTCAGT GGGGGAACGG CAGCTCTTGT 4140 

GCATAGCTAG AGCCCTGCTC CGCCACTGTA AGATTCTGAT TTTAGATGAA GCCACAGCTG 4200 

CCATGGACAC AGAGACAGAC TTATTGATTC AAGAGACCAT CCGAGAAGCA TTTGCAGACT 4260 

GTACCATGCT GACCATTGCC CATCGCCTGC ACACGGTTCT AGGCTCCGAT AGGATTATGG 4320 

TGCTGGCCCA GGGACAGGTG GTGGAGTTTG ACACCCCATC GGTCCTTCTG TCCAACGACA 4380 

GTTCCCGATT CTATGCCATG TTTGCTGCTG CAGAGAACAA GGTCGCTGTC AAGGGCTGAC 4440 

TCCTCCCTGT TGACGAAGTC TCTTTTCTTT AGAGCATTGC CATTCCCTGC CTGGGGCGGG 4500 

CCCCTCATCG CGTCCTCCTA CCGAAACCTT GCCTTTCTCG ATTTTATCTT TCGCACAGCA 4560 

GTTCCGGATT GGCTTGTGTG TTTCACTTTT AGGGAGAGTC ATATTTTGAT TATTGTATTT 4620 

ATTCCATATT CATGTAAACA AAATTTAGTT TTTGTTCTTA ATTGCACTCT AAAAGGTTCA 4680 

GGGAACCGTT ATTATAATTG TATCAGAGGC CTATAATGAA GCTTTATACG TGTAGCTATA 4740 

TCTATATATA ATTCTGTACA TAGCCTATAT TTACAGTGAA AATGTAAGCT GTTTATTTTA 4800 

TATTAAAATA AGCACTGTGC TAATAACAGT GCATATTCCT TTCTATCATT TTTGTACAGT 4860 

TTGCTGTACT AGAGATCTGG TTTTGCTATT AGACTGTAGG AAGAGTAGCA TTTCATTCTT 4920 

CTCTAGCTGG TGGTTTCACG GTGCCAGGTT TTCTGGGTGT CCAAAGGAAG ACGTGTGGCA 4980 

ATAGTGGGCC CTCCGACAGC CCCCTCTGCC GCCTCCCCAC AGCCGCTCCA GGGGTGGCTG 5040 

GAGAGGGGTG GGCGGCTGGA GACCATGCAG AGCGCCGTGA GTTCTCAGGG CTCCTGCCTT 5100 

CTGTCCTGGT GTCACTTACT GTTTCTGTCA GGAGAGCAGC GGGGCGAAGC CCAGGCCCCT 5160 

TTTCACTCCC TCCATCAAGA ATGGGGATCA CAGAGACATT CCTCCGAGCC GGGGAGTTTC 5220 

TTTCCTGCCT TCTTCTTTTT GCTGTTGTTT CTAAACAAGA ATCAGTCTAT CCACAGAGAG 5280 

TCCCACTGCC TCAGGTTCCT ATGGCTGGCC ACTGCACAGA GCTCTCCAGC TCCAAGACCT 5340 

GTTGGTTCCA AGCCCTGGAG CCAACTGCTG CTTTTTGAGG TGGCACTTTT TCATTTGCCT 5400 

ATTCCCACAC CTCCACAGTT CAGTGGCAGG GCTCAGGATT TCGTGGGTCT GTTTTCCTTT 5460 

CTCACCGCAG TCGTCGCACA GTCTCTCTCT CTCTCTCCCC TCAAAGTCTG CAACTTTAAG 5520 

CAGCTCTTGC TAATCAGTGT CTCACACTGG CGTAGAAGTT TTTGTACTGT AAAGAGACCT 5580 

ACCTCAGGTT GCTGGTTGCT GTGTGGTTTG GTGTGTTCCC GCAAACCCCC TTTGTGCTGT 5640 

GGGGCTGGTA GCTCAGGTGG GCGTGGTCAC TGCTGTCATC AGTTGAATGG TCAGCGTTGC 5700 

ATGTCGTGAC CAACTAGACA TTCTGTCGCC TTAGCATGTT TGCTGAACAC CTTGTGGAAG 5760 

CAAAAATCTG AAAATGTGAA T AAAAT T ATT TTGGATTTTG TAAAAAAAAA AAAAAAAAAA 5820 
AAAAAAAAAA AAAAAAAA 

Seq ZD NO( 585 Protein sequence 
Protein Accession #: NP_005679.1 

1 11 21 31 41 51 

I I I I I I 

MKDIDIGKEY IXPSPGYRSV RERTS TSGTH RDREDSKFRR TRPLECQDAL BTAARAEGLS 60 

LDASMHSQLR ILDEEHPKGK YHHGLSALKP IRTTSKHQHP VDNAGLF S CM TFSWLSSLAR 120 

VAHKKGELSM EDVWSLSKHE SSDVNCRRIjE RLWQEELNEV GPDAASLRRV VWIPCRTRLI 180 

LSIVCLMITQ LAGFSGPAFM VKHLLEYTQA TESNLQYSLL LVLGliLLTEI VRSWSIALTW 240 

ALNYRTGVRL RGAILTMAFK KILKLKNIKE KSLGELINIC SNDGQRMFEA AAVGSLLAGG 300 

PWAIIjGMIY NVIILGPTGF LGSAVFILFY PAMMFASRLT AYFRRKCVAA TDERVQKMNE 360 

VLTYIKFIKM YAWVKAFSQS VQKIREBERR ILEKAGYFQG ITVGVAPIW VIASWTFSV 420 

HMTLGFDLTA AQAFTWTVF NSMTFALKVT PFSVKSLSEA SVAVDRFKSL FLMEEVHMIK 480 

NKPASPHIK1 EMKNATLAWD SSHSSIQNSP KLTPKMKKDK RASRGKKEKV RQLQRTEHQA 540 

VLAEQKGHLiL LDSDERPSPE EEEGKHIHLG HLRLQRTLHS IDLEIQEGKL VGICGSVGSG 600 

KTSLISAILG QMTLLEGSIA ISGTFAYVAQ QAWILNATLR DNILFGKEYD EERYNSVLNS 660 

CCLRPDLAIL PSSDLTEIGE RGANLSGGQR QRISLARALY SDRSIYILDD PLSALDAHVG 720 

NHIFNSAIRK HLKSKTVLFV THQLQYLVDC DEVI FM KEG C ITERGTHEEL MNLNGDYATI 780 

FNNX.LLGETP PVEINSKKET SGSQKKSQDK GPKTGSVKKE KAVKPEEGQL VQLEEKGQGS 840 
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VPWSVYGVYI QAAGGPLAFL VIMALFMLNV GSTAFSTWWL 
VSDSMKDNPH MQYYASIYAL SMAVMLILKA IRGWFVKGT 
KFFDTTPTGR ILNRFSKDMD EVDVRLPFQA EMFIQNVILV 
LVILFSVLHI VSRVLIRELK RLDNITQSPF LSHXTSSIQG 
LDDNQAPFFL PTCAMRWLAV RLDLISIALI TTTGLMIVLM 
GLFQFTVRLA SETEARFTSV ERINHYIKTL SLEAPARIKN 
RYRENLPLVL KKVSFTIKPK EKIGIVGRTG SGKSSLGMAL 
IGLADLRSKL SIIPQEPVLF SGTVRSNLDP FNQYTEDQIW 
SEVMENGDNF SVGERQLLCI ARALLRHCKI LILDEATAAM 
MLTIAHRLHT VLGSDRIMVL AQGQWEFDT PSVL.L.SNDSS 

Seq ID NO: 586 DNA sequence 

Nucleic Acid Accession #: NMJJ01327.1 

Coding sequence i 8 9 . . 6 3 1 



PCT/US02/12476 



SYWIRQGSGN 
LRASSRLHDE 
FFCVGMIAGV 
LATIHAYNKG 
HGQIPPAYAG 
KAPSPDWPQE 
FRLVELSGGC 
DALERTHMKE 
DTBTDLLIQE 
RFYAMFAAAE 



1 
I 

AGCAGGGGGC 
CTGAGAGCCG 
GACGGGCGAT 
TGGCGGCCCA 
AAGGGCCTCG 
GCTGAATGGA 
CCTCGCCATG 
GGATGCCCCA 
CATACTGACT 
CTGTCTCCAG 
GGCTCAGCCT 
GCCTCCTCCC 
GTTTGTCGCT 



11 
I 

GCTGTGTGTA 
GGCAGAGGCT 
GCTGATGGCC 
GGAGAGGCGG 
GGGCCGGGAG 
TGCTGCAGAT 
CCTTTCGCGA 
CCGCTTCCCG 
ATCCGACTGA 
CAGCTTTCCC 
CCCTCAGGGC 
CTAGGGAATG 
GGAGGAGGAC 



21 
I 

CCGAGAATAC 
CCGGAGCCAT 
CAGGAGGCCC 
GTGCCACGGG 
GAGGCGCCCC 
GCGGGGCCAG 
CACCCATGGA 
TGCCAGGGGT 
CTGCTGCAGA 
TGTTGATGTG 
AGAGGCGCTA 
GTCCCAGCAC 
GGCTTACATG 



31 
I 

GAGAATACCT 
GCAGGCCGAA 
TGGCATTCCT 
CGGCAGAGGT 
GCGGGGTCCG 
GGGGCCGGAG 
AGCAGAGCTG 
GCTTCTGAAG 
CCACCGCCAA 
GATCACGCAG 
AGCCCAGCCT 
GAGTGGCCAG 
TTTGTTTCTG 



41 

I 

CGTGGGCCCT 
GGCCGGGGCA 
GATGGCCCAO 
CCCCGGGGCG 
CATGGCGGCG 
AGCCGCCTGC 
GCCCGCAGGA 
GAGTTCACTG 
CTGCAGCTCT 
TGCTTTCTGC 
GGCGCCCCTT 
TTCATTGTGG 
TAGAAAATAA 



TTVTRGNETS 
LFRRILRSPM 
FPWFLVAVGP 
QEFIiHRYQEL 
LAISYAVQLT 
GEVTFENAEM 
IKIDGVRISD 
CIAQLPLKLE 
TIREAFADCT 
NKVAVKG 



51 
I 

GACCTTCTCT 
CAGGGGGTTC 
GGGGCAATGC 
CAGGGGCAGC 
CGGCTTCAGG 
TTGAGTTCTA 
GCCTGGCCCA 
TGTCCGGCAA 
CCATCAGCTC 
CCGTGTTTTT 
CCTAGGTCAT 
GGGCCTGATT 
AACTGAGCTA 



CCTCGTGGGC 
GAAGGCCAGG 
CCTGATGGCC 
GGTCCCCGGG 
CCGCATGGCG 
GACAGCOGCC 
ATCAGCTCCT 
GTGTTTTTGG 
TAGGTCATGC 
GCCTGATTGT 
CTGAGCTA 



11 

I 

CCTGACCTTC 
GCACAGGGGG 
CAGGGGGCAA 
GCGCAGGGGC 
GTGCCGCTTC 
TGCTTCAGTT 
GTCTCCAGCA 
CTCAGGCTCC 
CTCCTCCCCT 
TTGTCGCTGG 



21 
I 

TCTCTGAGAG 
TTCGACGGGC 
TGCTGGCGGC 
AGCAAGGGCC 
TGCGCAGGAT 
CCGACTGACT 
GCTTTCCCTG 
CTCAGGGCAG 
AGGGAATGGT 
AGGAGGACGG 



31 
I 

CCGGGCAGAG 
GATG CTGATG 
CCAGGAGAGG 
TCGGGGCCGA 
GGAAGGTGCC 
GCTGCAGACC 
TTGATGTGGA 
AGGCGCTAAG 
CCCAGCACGA 
CTTACATGTT 



41 
I 

GCTCCGGAGC 
GCCCAGGAGG 
CGGGTGCCAC 
GAGGAGGCGC 
CCTGCGGGGC 
ACCGCCAACT 
TCACGCAGTG 
CCCAGCCTGG 
GTGGCCAGTT 
TGTTTCTGTA 



Seq ID NO: 589 Protein sequence 
Protein AccesBion #: Eos sequence 



SI 



ACAGCGGAGC 
AGACAGAGAC 
GCTTCTCGCT 
ATGGGAAGTC 
TCCGCTGCCT 
GCTTTTACCG 
CTCTTAGTGC 
CCAGATGCGA 
ACCAGAGACT 
ACGCGGGCCG 
CAGGTTACTA 
GGCATTCAGC 
TTCATCAAGA 
AATGGTCACA 



11 

I 

GCAGAGTGAG 
TGAGCGGCCC 
CCTCCTGCCC 
CAGGCAGTGT 
CAACTGCAAT 
GCACAGAGAA 
TCGATGTGAC 
CCGATGTCTG 
GCTAGACTCC 
CTGTGTCTGC 
TAATCTGGAT 
CAGCTQCCGC 
TGTTGATGGC 
GCGCCATCAA 



21 
I 

AACCACCAAC 
GGCACCGCCA 
GCAGCCCGGG 
ATCTTTGATC 
GACAACACTG 
AGGGACCGCT 
AACTCTGGAC 
CCAGGCTTCC 
AAGTGTGACT 
AAGCCAGCTG 
GGGGGGAACC 
AGCTCTGCAG 
TGGAAGGCTG 
GATGTGTTTA 



31 

I 

CGAGGCGCCG 
TGCCTGCGCT 
CCACCTCCAG 
GGGAACTTCA 
ATGGCATTCA 
GTTTGCCCTG 
GGTGCAGCTG 
ACATGCTCAC 
GTGACCCAGC 
TTACTGGAGA 
CTGAGGGCTG 
AATACAGTGT 
TCCAACGAAA 
GCTCAGCCCA 



41 
I 

GGCAGCGACC 
CTGGCTGGGC 
GAGGGAAGTC 
CAGACAAACT 
CTGCGAGAAG 
CAATTGTAAC 
TAAACCAGGT 
GGATGCGGGG 
TGGCATCGCA 
ACGCTGTGAT 
TACCCAGTGT 
CCATAAGATC 
TGGGTCTCCT 
ACGACTAGAC 



900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



Seq ID NO: 587 Protein sequence 
Protein Accession #: NP_001318.1 

1 11 21 31 41 51 

i I I I I I 

MQAEGRGTGG STGDADGPGG PGIPDGPGGN AGGPGEAGAT GGRGPRGAGA ARASGPGGGA 
PRGPHGGAAS GLNGCCRCGA RGPESRLLEF YLAMP FATPM EAELARRSLA QDAPPLPVPG 
VLLKEFTVSG NILTIRLTAA DRRQLQLSIS SCLQQLSLLM WITQCFLPVF LAQPPSGQRR 

Seq ID NO: 588 DNA sequence 

Nucleic Acid Accession ft: Eos sequence 

Coding sequence: 52.. 459 



51 
1 

CATGCAGGCC 
CCCTGGCATT 
GGGCGGCAGA 
CCCGCGGGGT 
CAGGAGGCCG 
GCAGCTCTCC 
CTTTCTGCCC 
CGCCCCTTCC 
CATTGTGGGG 
GAAAATAAAG 



1 11 21 31 41 

I I I I I I 

MQAEGQGTGG STGDADGPGG PGIPDGPGGN AGGPGEAGAT GGRGPRGAGA ARASGPRGGA 
PRGPHGGAAS AQDGRCPCGA RRPDSRLLQF RLTAADHRQL QfcSISSCLQQ LSLLMWITQC 
FLPVFLAQAP SGQRR 

Seq ID NO: 590 DNA sequence 
Nucleic Acid Accession ft: NM_005S62.1 
Coding sequence: 90.. 3671 



51 
I 

CCTGCAGCGG 
TGCTGCCTCT 
TGTGATTGCA 
GGTAATGGAT 
TGCAAGAATG 
TCCAAAGGTT 
GTGACAGGAG 
TGCACCCAAG 
GGGCCCTGTG 
AGGTGTCGAT 
TTCTGCTATG 
ACCTCTACCT 
GCAAAGCTCC 
CCTGTCTATT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



60 
120 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



60 
120 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
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TTGTGG CTCC TGCCAAATTT CTTGGGAATC AACAGGTGAG CTATGGGCAA AQCCTGTCCT 900 

TTGACTACCG TGTGGACAGA GGAGGCAGAC ACCCATCTGC CCATGATGTG ATTCTGGAAG 960 

GTGCTGGTCT ACGGATCACA GCTCCCTTGA TGCCACTTGG CAAGACACTG CCTTGTGGGC 1020 

TCACCAAGAC TTACACATTC AGGTTAAATG AGCATCCAAG CAATAATTGG AGCCCCCAGC 1080 

TGAGTTACTT TGAGTATCGA AGGTTACTGC GGAATCTCAC AGCCCTCCGC ATCCGAGCTA 1140 

CATATGGAGA ATACAGTACT GGGTACATTG ACAATGTGAC CCTGATTTCA GCCCGCCCTG 1200 

TCTCTGGAGC CCCAGCACCC TGGGTTGAAC AGTGTATATG TCCTGTTGGG TACAAGGGGC 1260 

* AATTCTGCCA GGATTGTGCT TCTGGCTACA AGAGAGATTC AGCGAGACTG GGGCCTTTTG 1320 

GCACCTGTAT TCCTTGTAAC TGTCAAGGGG GAGGGGCCTG TGATCCAGAC ACAGGAGATT 1380 

GTTATTCAGG GGATGAGAAT CCTGACATTG AGTGTGCTGA CTGCCCAATT GGTTTCTACA 1440 

ACGATCCGCA CGACCCCCGC AGCTGCAAGC CATGTCCCTG TCATAACGGG TTCAGCTGCT 1500 

CAGTGATGCC GGAGACGGAG GAGGTGGTGT GCAATAACTG CCCTCCCGGG GTCACCGGTG 1560 

CCCGCTGTGA GCTCTGTGCT GATGGCTACT TTGGGGACCC CTTTGGTGAA CATGGCCCAG 1620 

TGAGGCCTTG TCAGCCCTGT CAATGCAACA ACAATGTGGA CCCCAGTGCC TCTGGGAATT 1680 

GTGACCGGCT GACAGGCAGG TGTTTGAAGT GTATCCACAA CACAGCCGGC ATCTACTGCG 1740 

ACCAGTGCAA AGCAGGCTAC TTCGGGGACC CATTGGCTCC CAACCCAGCA GACAAGTGTC 1800 

GAGCTTGCAA CTGTAACCCC ATGGGCTCAG AGCCTGTAGG ATGTCGAAGT GATGGCACCT 1860 

GTGTTTGCAA GCCAGGATTT GGTGGCCCCA ACTGTGAGCA TGGAGCATTC AGCTGTCCAG 1920 

CTTGCTATAA TCAAGTGAAG ATTCAGATGG ATCAGTTTAT GCAGCAGCTT CAGAGAATGG 1980 

AGGCCCTGAT TTCAAAGGCT CAGGGTGGTG ATGGAGTAGT ACCTGATACA GAGCTGGAAG 2040 

GCAGGATGCA GCAGGCTGAG CAGGCCCTTC AGGACATTCT GAGAGATGCC CAGATTTCAG 2100 

AAGGTGCTAG CAG AT CCCTT GGTCTCCAGT TGGCCAAGGT GAGGAGCCAA GAGAACAGCT 2160 

ACCAGAGCCG CCTGGATGAC CTCAAGATGA CTGTGGAAAG AGTTCGGGCT CTGGGAAGTC 2220 

AGTACCAGAA CCGAGTTOGG GATACTCACA GGCTCATCAC TCAGATGCAG CTGAGCCTGG 2280 

CAGAAAGTGA AGCTTCCTTG GGAAACACTA ACATTCCTGC CTCAGACCAC TACGTGGGGC 2340 

CAAATGGCTT TAAAAGTCTG GCTCAGGAGG CCACAAGATT AGCAGAAAGC CACGTTGAGT 2400 

CAGCCAGTAA CATGGAGCAA CTGACAAGGG AAACTGAGGA CTATTCCAAA CAAGCCCTCT 2460 

CACTGGTGCG CAAGGCCCTG CATGAAGGAG TCGGAAGCGG AAGCGGTAGC CCGGAOGGTG 2520 

CTGTGGTGCA AGGGCTTGTG GAAAAATTGG AGAAAACCAA GTCCCTGGCC CAGCAGTTGA 2580 

CAAGGGAGGC CACTCAAGCG GAAATTGAAG CAGATAGGTC TTATCAGCAC AGTCTCCGCC 2640 

TCCTGGATTC AGTGTCTCGG CTTCAGGGAG TCAGTGATCA GTCCTTTCAG GTGGAAGAAG 2700 

CAAAGAGGAT CAAACAAAAA GCGGATTCAC TCTCAACGCT GGTAACCAGG CATATGGATG 2760 

AGTTCAAGCG TACACAAAAG AATCTGGGAA ACTGGAAAGA AGAAGCACAG CAGCTCTTAC 2820 

AGAATGGAAA AAGTGGGAGA GAGAAATCAG ATCAGCTGCT TTCCCGTGCC AATCTTGCTA 2880 

AAAGCAGAGC ACAAGAAGCA CTGAGTATGG GCAATGCCAC TTTTTATGAA GTTGAGAGCA 2940 

TCCTTAAAAA CCTCAGAGAG TTTGACCTGC AGGTGGACAA CAGAAAAGCA GAAGCTGAAG 3000 

AAGCCATGAA GAGA CT CTCC TACATCAGCC AGAAGGTTTC AGATGCCAGT GACAAGACCC 3060 

AGCAAGCAGA AAGAGCCCTG GGGAGCGCTG CTGCTGATGC ACAGAGGGCA AAGAATGGGG 3120 

CCGGGGAGGC CCTGGAAATC TCCAGTGAGA TTGAACAGGA GATTGGGAGT CTGAACTTGG 3180 

AAGCCAATGT GA CAG CAG AT GGAGCCTTGG CCATGGAAAA GGGACTGGCC TCTCTGAAGA 3240 

GTGAGATGAG GGAAGTGGAA GG AG AG CT G G AAAGGAAGGA GCTGGAGTTT GACACGAATA 3300 

TGGATGCAGT ACAGATGGTG ATTACAGAAG CCCAGAAGGT TGATACCAGA GCCAAGAACG 3360 

CTGGGGTTAC AATCCAAGAC ACACTCAACA CATTAGACGG CCTCCTGCAT CTGATGGACC 3420 

AGCCTCTCAG TGTAGATGAA GAGGGGCTGG TCTTACTGGA GCAGAAGCTT TCCCGAGCCA 3480 

AGACCCAGAT CAACAGCCAA CTGCGGCCCA TGATGTCAGA GCTGGAAGAG AGGGCACGTC 3540 

AGCAGAGGGG CCACCTCCAT TTGCTGGAGA CAAGCATAGA TGGGATTCTG GCTGATGTGA 3 600 

AGAACTTGGA GAACATTAGG GACAACCTGC CCCCAGGCTG CTACAATACC CAGGCTCTTG 3660 

AGCAACAGTG AAGCTGCCAT AAATATTTCT CAACTGAGGT TCTTGGGATA CAG AT CTCAG 3720 

GGCTCGGGAG CCATGTCATG TGAGTGGGTG GGATGGGGAC ATTTGAACAT GTTTAATGGG 3780 

TATGCTCAGG TCAACTGACC TGACCCCATT CCTGATCCCA TGGCCAGGTG GTTGTCTTAT 3840 

TGCACCATAC TCCTTGCTTC CTGATGCTGG GCAATGAGGC AGATAGCACT GGGTGTGAGA 3900 

ATGATCAAGG ATCTGGACCC CAAAGAATAG ACTGGATGGA AAGACAAACT GCACAGGCAG 3960 

ATGTTTGCCT CATAATAGTC GTAAGTGGAG TCCTGGAATT TGGACAAGTG CTGTTGGGAT 4020 

ATAGTCAACT TATTCTTTGA GTAATGTGAC TAAAGGAAAA AACTTTGACT TTGCCCAGGC 4080 

ATGAAATTCT TCCTAATGTC AGAACAGAGT GCAACCCAGT CACACTGTGG CCAGTAAAAT 4140 

ACTATTGCCT CATATTGTCC TCTGCAAGCT TCTTGCTGAT CAGAGTTCCT CCTACTTACA 4200 

ACCCAGGGTG TGAACATGTT CTCCATTTTC AAGCTGGAAG AAGTGAG CAG TGTTGGAGTG 4260 

AGGACCTGTA AGGCAGGCCC ATTCAGAGCT ATGGTGCTTG CTGGTGCCTG CCACCTTCAA 4320 

GTTCTGGACC TGGGCATGAC ATCCTTTCTT TTAATGATGC CATGGCAACT TAGAGATTGC 4380 

ATTTTTATTA AAGCATTTCC TACCAGCAAA GCAAATGTTG GGAAAGTATT TACTTTTTCG 4440 

GTTTCAAAGT GATAGAAAAG TGTGGCTTGG GCATTGAAAG AGGTAAAATT CTCTAGATTT 4500 

ATTAGTCCTA ATTCAATCCT ACTTTTCGAA CACCAAAAAT GATGCGCATC AATGTATTTT 4560 

AT CT T ATT TT CTCAATCTCC TCTCTCTTTC CTCCACCCAT AATAAGAGAA TGTTCCTACT 4620 

CACACTTCAG CTGGGTCACA TCCATCCCTC CATTCATCCT TCCATCCATC TTTCCATCCA 4680 

TTACCTCCAT CCATCCTTCC AACATATATT TATTGAGTAC CTACTGTGTG CCAGGGGCTG 4740 

GTGGGACAGT GGTGACATAG TCTCTGCCCT CATAGAGTTG ATTGTCTAGT GAGGAAGACA 4800 

AGCATTTTTA AAAAATAAAT TTAAACTTAC AAACTTTGTT TGTCACAAGT GGTGTTTATT 4860 

GCAATAACCG CTTGGTTTGC AACCTCTTTG CTCAACAGAA CATATGTTGC AAGACCCTCC 4920 

CATGGGGGCA CTTGAGTTTT GGCAAGGCTG ACAGAGCTCT GGGTTGTGCA CATTTCTTTG 4980 

CATTCCAGCT GTCACTCTGT GCCTTTCTAC AACTGATTGC AACAGACTGT TGAGTTATGA 5040 

TAACACCAGT GGGAATTGCT GGAGGAACCA GAGGCACTTC CACCTTGGCT GGGAAGACTA 5100 

TGGTGCTGCC TTGCTTCTGT ATTTCCTTGG ATTTTCCTGA AAGTGTTTTT AAATAAAGAA 5160 
CAATTGTTAG ATGCC 

Seq ID NO i 591 Protein sequence 
Protein Accession #: NP_005553.1 

1 11 21 31 41 51 

I I I 1 1 I 

MPALWLGCCL CFSLLLPAAR ATSRREVCDC NGKSRQCIPD RELHRQTGNG FRCLNCNDNT 60 

DGIHCEKCKN GFYRHRERDR CLPCNCNSKG SLSARCDNSG RCSCKPGVTG ARCDRCLPGP 120 

HMLTDAGCTQ DQRLLDSKCD CD PAG I AG PC DAGRCVCKPA VTGERCDRCR SGYYNLDGGN 180 

PEGCTQCFCY GHSASCRSSA EYSVHKITST FHQDVDGWKA VQRNGSPAKL QWSQRHQDVF 240 

SSAQRLDPVY FVAPAKFLGN QQVSYGQSLS FDYRVDRGGR HPS AKD VILE GAGUUTAPL 300 

MPLGKTLPCG LTKTYTFRLN EHPSNNWSPQ LSYPEYRRLL RNLTALRIRA TYGEYSTGYI 360 

DNVTLISARP VSGAPAPWVE QCICPVGYKG QFCQDCASGY KRDSARLGPP GTCIPCNCQG 420 

GGACDPDTGD CYSGDENPDI ECADCPIGFY NDPHDPRSCK PCPCHNGFSC SVMPETEEW 480 
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CNNCPPGVTG ARCELCADGY FGDPFGBHGP VRPCQPCQCN NNVDPSASGN CDRLTGRCLK 540 

CIHNTAGIYC DQCKAGYFGD PLAPNPADKC RACNCNPMGS EPVGCRSDGT CVCKPGFGGP 600 

NCEHGAFSCP ACYNQVXIQM DQPMQQLQRM EALISKAQGG DGWPDTELE GRMQQAEQAL 660 

QDILRDAQIS EGASRSLGLQ LAKVRSQENS YQSRLDDLKM TVERVRALGS QYQNRVRDTH 720 

RLITQMQLSL AESEASLGNT NIPASDHYVG FNGPKSLAQE ATRLAESHVE SASNMEQLTR 780 

ETEDYSKQAL SLVRKALHEG VGSGSGSPDG AWQGLVEKL EKTKSLAQQL TREATQAEIE 840 

ADRSYQHSLR LLDSVSRLQG VSDQSFQVEE AKRIKQKADS LSTIiVTRHMD EPKRTQKNLG 900 

NWKEEAQQLL QNGKSGREKS DQLLSRANLA KSRAQEALSM GNATFYEVES ILKNLREFDL 960 

QVDNRKAEAE EAMKRLSYI3 QKVSDASDKT QQAERALGSA AADAQRAKNG AGEALEISSE 1020 

IEQEIGSLNL EANVTADGAL AMEXGLASLX SEMREVEGEL ERKELEFDTN MDAVQMVITE 1080 

AQKVDTRAKN AGVTIQDTLN TLDGLLHLMD QPLSVDEEGL VLLEQKLSRA KTQINSQLRP 1140 
MMSELEERAR QQRGHLHLLE TSIDGILADV KNLENIRDNL PPGCYNTQAL EQQ 

Seg ID NO: 592 DNA sequence 

Nucleic Acid Accession ft: AP101051.1 

Coding sequence: 221.856 

1 11 21 31 41 51 

I I I I I I 

GAGCAACCTC AGCTTCTAGT ATCCAGACTC CAGCGCCGCC CCGGGCGCGG ACCCCAACCC 60 

CGACCCAGAG CTTCTCCAGC GGCGGCGCAG CGAGCAGGGC TCCCCGCCTT AACTTCCTCC 120 

GCGGGGCCCA GCCACCTTCG GGAGTCCGGG TTGCCCACCT GCAAACTCTC CGCCTTCTGC 180 

ACCTGCCACC CCTGAGCCAG CGCGGGCGCC CGAGCGAGTC ATGGCCAACG CGGGGCTGCA 240 

GCTGTTGGGC TTCATTCTCG CCTTCCTGGG ATGGATCGGC GCCATCGTCA GCACTGCCCT 300 

GCCCCAGTGG AGGATTTACT CCTATGCCGG CGACAACATC GTGACCGCCC AGGCCATGTA 360 

CGAGGGGCTG TGGATGTCCT GCGTGTCGCA GAGCACCGGG CAGATCCAGT GCAAAGTCTT 420 

TGACTCCTTG CTGAATCTGA GCAGCACATT GCAAGCAACC CGTGCCTTGA TGGTGGTTGG 480 

CATCCTCCTG GGAGTGATAG CAATCTTTGT GGCCACCGTT GGCATGAAGT GTATGAAGTG 540 

CTTGGAAGAC GATGAGGTGC AGAAGATGAG GATGGCTGTC ATTGGGGGTG CGATATTTCT 600 

TCTTGCAGGT CTGGCTATTT TAGTTGCCAC AGCATGGTAT GGCAATAGAA TCGTTCAAGA 660 

ATTCTATGAC CCTATGACCC CAGTCAATGC CAGGTACGAA TTTGGTCAGG CTCTCTTCAC 720 

TGGCTGGGCT GCTGCTTCTC TCTGCCTTCT GGGAGGTGCC CTACTTTGCT GTTCCTGTCC 780 

CCGAAAAACA ACCTCTTACC CAACACCAAG GCCCTATCCA AAACCTGCAC CTTCCAGCGG 840 

GAAAGACTAC GTGTGACACA GAGGCAAAAG GAGAAAATCA TGTTGAAACA AACCGAAAAT 900 

GGACATTGAG ATACTATCAT TAACATTAGG ACCTTAGAAT TTTGGGTATT GTAATCTGAA 960 

GTATGGTATT ACAAAACAAA CAAACAAACA AAAAACCCAT GTGTTAAAAT ACTCAGTGCT 1020 

AAACATGGCT TAATCTTATT TTATCTTCTT TCCTCAATAT AGGAGGGAAG ATTTTACCAT 1080 

TTGTATTACT GCTTCCCATT GAGTAATCAT ACTCAAATGG GGGAAGGGGT GCTCCTTAAA 1140 

TATATATAGA TATGTATATA TACATGTTTT TCTATTAAAA ATAGACAGTA AAATACTATT 1200 

CTCATTATGT TGATACTAGC ATACTTAAAA TATCTCTAAA ATAGGTAAAT GTATTTAATT 1260 

CCATATTGAT GAAGATGTTT ATTGGTATAT TTTCTTTTTC GTCCTTATAT ACATATGTAA 1320 

CAGTCAAATA TCATTTACTC TTCTTCATTA GCTTTGGGTG CCTTTGCCAC AAGACCTAGC 1380 

CTAATTTACC AAGGATGAAT TCTTTCAATT CTTCATGCGT GCCCTTTTCA TATACTTATT 1440 

TTATTTTTTA CCATAATCTT ATAGCACTTG CATCGTTATT AAGCCCTTAT TTGTTTTGTG 1500 

TTTCATTGGT CTCTATCTCC TGAATCTAAC ACATTTCATA GCCTACATTT TAGTTTCTAA 1560 

AGCCAAGAAG AATTTATTAC AAATCAGAAC TTTGGAGGCA AATCTTTCTG CATGACCAAA 1620 

GTGATAAATT CCTGTTGACC TTCCCACACA ATCCCTGTAC TCTGACCCAT AGCACTCTTG 1680 

TTTGCTTTGA AAATATTTGT CCAATTGAGT AGCTGCATGC TGTTCCCCCA GGTGTTGTAA 1740 

CACAACTTTA TTGATTGAAT TTTTAAGCTA CTTATTCATA GTTTTATATC CCCCTAAACT 1800 

ACCTTTTTGT TCCCCATTCC TTAATTGTAT TGTTTTCCCA AGTGTAATTA TCATGCGTTT 1860 

TATATCTTCC TAATAAGGTG TGGTCTGTTT GTCTGAACAA AGTGCTAGAC TTTCTGGAGT 1920 

GATAATCTGG TGACAAATAT TCTCTCTGTA GCTGTAAGCA AGTCACTTAA TCTTTCTACC 1980 

TCTTTTTTCT ATCTGCCAAA TTGAGATAAT GATACTTAAC CAGTTAGAAG AGGTAGTGTG 2040 

AATATTAATT AGTTTATATT ACTCTCATTC TTTGAACATG AACTATGCCT ATGTAGTGTC 2100 

TTTATTTGCT CAGCTGGCTG AGACACTGAA GAAGTCACTG AACAAAACCT ACACACGTAC 2160 

CTTCATGTGA TTCACTGCCT TCCTCTCTCT ACCAGTCTAT TTCCACTGAA CAAAACCTAC 2220 

ACACATACCT TCATGTGGTT CAGTGCCTTC CTCTCTCTAC CAGTCTATTT CCACTGAACA 2280 

AAACCTACGC ACATACCTTC ATGTGGCTCA GTGCCTTCCT CTCTCTACCA GTCTATTTCC 2340 

ATTCTTTCAG CTGTGTCTGA CATGTTTGTG CTCTGTTCCA TTTTAACAAC TGCTCTTACT 2400 

TTTCCAGTCT GTACAGAATG CTATTTCACT TGAGCAAGAT GATGTATGGA AAGGGTGTTG 2460 

GCACTGGTGT CTGGAGACCT GGATTTGAGT CTTGGTGCTA TCAATCACCG TCTGTGTTTG 2520 

AGCAAGGCAT TTGGCTGCTG TAAGCTTATT GCTTCATCTG TAAGCGGTGG TTTGTAATTC 2580 

CTGATCTTCC CACCTCACAG TGATGTTGTG GGGATCCAGT GAGATAGAAT ACATGTAAGT 2640 

GTGGTTTTGT AATTTGAAAA GTGCTATACT AAGGGAAAGA ATTGAGGAAT TAACTGCATA 2700 

CGTTTTGGTG TTGCTTTTCA AATGTTTGAA AATAAAAAAA TGTTAAGAAA TGGGTTTCTT 2760 

GCCTTAACCA GTCTCTCAAG TGATGAGACA GTGAAGTAAA ATTGAGTGCA CTAAACGAAT 2820 

AAGATTCTGA GGAAGTCTTA TCTTCTGCAG TGAGTATGGC CCAATGCTTT CTGTGGCTAA 2880 

ACAGATGTAA TGGGAAGAAA TAAAAGCCTA CGTGTTGGTA AATCCAACAG CAAGGGAGAT 2940 

TTTTGAATCA TAATAACTCA TAAGGTGCTA TCTGTTCAGT GATGCCCTCA GAGCTCTTGC 3000 

TGTTAGCTGG CAGCTGACGC TGCTAGGATA GTTAGTTTGG AAATGGTACT TCATAATAAA 3060 

CTACACAAGG AAAGTCAGCC ACCGTGTCTT ATGAGGAATT- GGACCTAATA AATTTTAGTG 3120 

TGCCTTCCAA ACCTGAGAAT ATATGCTTTT GGAAGTTAAA ATTTAAATGG CTTTTGCCAC 3180 

ATACATAGAT CTTCATGATG TGTGAGTGTA ATTCCATGTG GATATCAGTT ACCAAACATT 3240 

ACAAAAAAAT TTTATGGCCC AAAATGACCA ACGAAATTGT TACAATAGAA TTTATCCAAT 3300 

TTTGATCTTT TTATATTCTT CTACCACACC TGGAAACAGA CCAATAGACA TTTTGGGGTT 3360 

TTATAATGGG AATTTGTATA AAGCATTACT CTTTTTCAAT AAATTGTTTT TTAATTTAAA 3420 
AAAAGGAAAA AAAAAAAAAA AAA 

Seq ID NO: 593 Protein sequence 
Protein Accession #: AAD16433.1 

1 11 21 31 41 SI 
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MANAGLQLLG FILAFLGWIG AIVSTALPQW RIYSYAGDNI VTAQAMYEGL WMSCVSQSTG 60 

QIQCKVPDSL LNLSSTLQAT RALMWGIItL GVIAIFVATV GMKCMKCLED DEVQKMRMAV 120 

IGGAIFLLAG LAILVATAWY GNRIVQEFYD PMTPVNARYE FGQALFTGWA AASLCLLGGA 180 
LLCCSCPRKT TSYPTPRPYP KPAP.SSGKDY V 
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Seq ID NO: 594 DNA sequence 

Nucleic Acid Accession #> NM_006180.l 

Coding sequence: 352.. 2820 

1 11 21 31 41 51 

I 1 I I 1 I 

CCCCCATTCG CATCTAACAA GGAATCTGCG CCCCAGAGAG TCCCGGACGC CGCCGGTCGG 60 

TGCCCGGCGC GCCGGGCCAT GCAGCGACGG CCGCCGCGGA GCTCCGAGCA GCGGTAGCGC 120 

CCCCCTGTAA AGCGGTTCGC TATGCCGGGA CCACTGTGAA CCCTGCCGCC TGCCGGAACA 180 

CTCTTCGCTC CGGACCAGCT CAGCCTCTGA TAAGCTGGAC TCGGCACGCC CGCAACAAGC 240 

ACCGAGGAGT TAAGAGAGCC GCAAGCGCAG GGAAGGCCTC CCCGCACGGG TGGGGGAAAG 300 

CGGCCGGTGC AGCGCGGGGA CAGGCACTCG GGCTGGCACT GGCTGCTAGG GATGTCGTCC 360 

TGGATAAGGT GGCATGGACC CGCCATGGCG CGGCTCTGGG GCTTCTGCTG GCTGGTTGTG 420 

GGCTTCTGGA GGGCCGCTTT CGCCTGTCCC ACGTCCTGCA AATGCAGTGC CTCTCGGATC 480 

TGGTGCAGCG ACCCTTCTCC TGGCATCGTG GCATTTCCGA GATTGGAGCC TAACAGTGTA 540 

GATCCTGAGA ACATCACCGA AATTTTCATC GCAAACCAGA AAAGGTTAGA AATCATCAAC 600 

GAAGATGATG TTGAAGCTTA TGTGGGACTG AGAAATCTGA CAATTGTGGA TTCTGGATTA 660 

AAATTTGTGG CTCATAAAGC ATTTCTGAAA AACAGCAACC TGCAGCACAT CAATTTTACC 720 

CGAAACAAAC TGACGAGTTT GTCTAGGAAA CATTTCCGTC ACCTTGACTT GTCTGAACTG 780 

ATCCTGGTGG GCAATCCATT TACATGCTCC TGTGACATTA TGTGGATCAA GACTCTCCAA 840 

GAGGCTAAAT CCAGTCCAGA CACTCAGGAT TTGTACTGCC TGAATGAAAG CAGCAAGAAT 900 

ATTCCCCTGG CAAACCTGCA GATACCCAAT TGTGGTTTGC CATCTGCAAA TCTGGCCGCA 960 

CCTAACCTCA CTGTGGAGGA AGGAAAGTCT ATCACATTAT CCTGTAGTGT GGCAGGTGAT 1020 

CCGGTTCCTA ATATGTATTG GGATGTTGGT AACCTGGTTT CCAAACATAT GAATGAAACA 1080 

AGCCACACAC AGGGCTCCTT AAGGATAACT AACATTTCAT CCGATGACAG TGGGAAGCAG 1140 

ATCTCTTGTG TGGCGGAAAA TCTTGTAGGA GAAGATCAAG ATTCTGTCAA CCTCACTGTG 1200 

CATTTTGCAC CAACTATCAC ATTTCTCGAA TCTCCAACCT CAGACCACCA CTGGTGCATT 12 60 

CCATTCACTG TGAAAGGCAA CCCCAAACCA GCGCTTCAGT GGTTCTATAA CGGGGCAATA 1320 

TTGAATGAGT CCAAATACAT CTGTACTAAA ATACATGTTA CCAATCACAC GGAGTACCAC 1380 

GGCTGCCTCC AGCTGGATAA TCCCACTCAC ATGAACAATG GGGACTACAC TCTAATAGCC 1440 

AAGAATGAGT ATGGGAAGGA TGAGAAACAG ATTTCTGCTC ACTTCATGGG CTGGCCTGGA 1500 

ATTGACGATG GTGCAAACCC AAATTATCCT GATGTAATTT ATGAAGATTA TGGAACTGCA 1560 

GCGAATGACA TCGGGGACAC CACGAACAGA AGTAATGAAA TCCCTTCCAC AGACGTCACT 1620 

GATAAAACCG GTCGGGAACA TCTCTCGGTC TATGCTGTGG TGGTGATTGC GTCTGTGGTG 1680 

GGATTTTGCC TTTTGGTAAT GCTGTTTCTG CTTAAGTTGG CAAGACACTC CAAGTTTGGC 1740 

ATGAAAGGCC CAGCCTCCGT TATCAGCAAT GATGATGACT CTGCCAGCCC ACTCCATCAC 18 00 

ATCTCCAATG GGAGTAACAC TCCATCTTCT TCGGAAGGTG GCCCAGATGC TGTCATTATT I860 

GGAATGACCA AGATCCCTGT CATTGAAAAT CCCCAGTACT TTGGCATCAC CAACAGTCAG 1920 

CTCAAGCCAG ACACATTTGT TCAGCACATC AAGCGACATA ACATTGTTCT GAAAAGGGAG 1980 

CTAGGCGAAG GAGCCTTTGG AAAAGTGTTC CTAGCTGAAT GCTATAACCT CTGT CCTGAG 2040 

CAGGACAAGA TCTTGGTGGC AGTGAAGACC CTGAAGGATG CCAGTGACAA TGCACGCAAG 2100 

GACTTCCACC GTGAGGCCGA GCTCCTGACC AACCTCCAGC ATGAGCACAT CX3TCAAGTTC 2160 

TATGGCGTCT GCGTGGAGGG CGACCCCCTC ATCATGGTCT TTGAGTACAT GAAGCATGGG 2220 

GACCTCAACA AGTTCCTCAG GGCACACGGC CCTGATGCCG TGCTGATGGC TGAGGGCAAC 2280 

CCGCCCACGG AACTGACGCA GTCGCAGATG CTGCATATAG CCCAGCAGAT CGCCGCGGGC 2340 

ATGGTCTACC TGGCGTCCCA GCACTTCGTG CACCGCGATT TGGCCACCAG GAACTGCCTG 2400 

GTCGGGGAGA ACTTGCTGGT GAAAATCGGG GACTTTGGGA TGTCCCGGGA CGTGTACAGC 2460 

ACTGACTACT ACAGGGTCGG TGGCCACACA ATGCTGCCCA TTCGCTGGAT GCCTCCAGAG 2520 

AGCATCATGT ACAGGAAATT CACGACGGAA AGCGACGTCT GGAGCCTGGG GGTCGTGTTG 2580 

TGGGAGATTT TCACCTATGG CAAACAGCCC TGGTACCAGC TGTCAAACAA TGAGGTGATA 2640 

GAGTGTATCA CTCAGGGCCG AGTCCTGCAG CGACCCOGCA CGTGCCCCCA GGAGGTGTAT 2700 

GAGCTGATGC TGGGGTGCTG GCAGCGAGAG CCCCACATGA GGAAGAACAT CAAGGGCATC 2760 

CATACCCTCC TTCAGAACTT GGCCAAGGCA TCTCCGGTCT ACCTGGACAT TCTAGGCTAG 2820 

GGCCCTTTTC CCCAGACCGA TCCTTCCCAA CGTACTCCTC AGACGGGCTG AGAGGATGAA 2860 

CAT CTTTTAA CTGCCGCTGG AGGCCACCAA GCTGCTCTCC TTCACTCTGA CAGTATTAAC 2940 

ATCAAAGACT CCGAGAAGCT CTCGAGGGAA GCAGTGTGTA CTTCTTCATC CATAGACACA 3000 

GTATTGACTT CTTTTTGGCA TTATCTCTTT CTCTCTTTCC ATCTCCCTTG GTTGTTCCTT 3060 

TTTCTTTTTT TAAATTTTCT TTTTCTTCTT TTTTTTCGTC TTCCCTGCTT CACGATTCTT 3120 

ACCCTTTCTT TTGAATCAAT CTGGCTTCTG CATTACTATT AACTCTGCAT AGACAAAGGC 3180 

CTTAACAAAC GTAATTTGTT ATATCAGCAG ACACTCCAGT TTGCCCACCA CAACTAACAA 3240 

TGCCTTGTTG TATTCCTGCC TTTGATGTGG ATGAAAAAAA GGGAAAACAA ATATTTCACT 3300 

TAAACTTTGT CACTTCTGCT GTACAGATAT CGAGAGTTTC TATGGATTCA CTTCTATTTA 33 60 

TTTATTATTA TTACTGTTCT TATTGTTTTT GGATGGCTTA AGCCTGTGTA TAAAAAAGAA 3420 

AACTTGTGTT CAATCTGTGA AGCCTTTATC TATGGGAGAT TAAAACCAGA GAGAAAGAAG 3480 

ATTTATTATG AACCGCAATA TGGGAGGAAC AAAGACAACC ACTGGGATCA GCTGGTGTCA 3540 

GTCCCTACTT AGGAAATACT CAGCAACTGT TAGCTGGGAA GAATGTATTC GGCACCTTCC 3600 

CCTGAGGACC TTTCTGAGGA GTAAAAAGAC TACTGGCCTC TGTGCCATGG ATGATTCTTT 3660 
TCCCATCACC AGAAATGATA GCGTGCAGTA GAGAGCAAAG ATGGCTT 

Seq ID NOi 595 Protein sequence 
Protein Accession 8: NP_006171.1 

1 11 21 31 41 51 

I I I I I I 

MSSWIRWHGP AMARLWGFCW LWGFWRAAF ACPTSCKCSA SRIWCSDPSP GIVAFPRLEP 60 

NSVDPENITE IFIANQKRLE I INEDDVEAY VGLRNLT I VD SGLKFVAHKA FLKNSNLQHI 120 

NPTRNKLTSli SRKHFRHLDL. SELILVGNPF TCSCDIMWIK TLQEAKSSPD TQDLYCLNES 180 

SKNIPLANLQ IPNCGLPSAN LAAPNLTVEE GKSITLSCSV AGDPVPNMYW DVGNLVSKHM 240 

NETSHTQGSL RITNI SSDDS GKQISCVAEN LVGEDQDSVN LTVHFAPTIT FLESPTSDHH 300 

WCIPFTVKGN PKPALQWFYN GAILNESKYI CTKIHVTNHT EYHGCLQLDN PTHMNNGDYT 360 

IilAKNEYGKD EKQISAHFKG WPGIDDGANP NYPDVIYEDY GTAANDIGDT TNRSNEIPST 420 

DVTDKTGREH LSVYAWVIA SWGFCLLVM LFLLKLARHS KFGMKGPASV ISNDDDSASP 480 

LHHISNGSNT PSSSEGGPDA VIIGMTKIPV IENPQYFGIT NSQLKPDTFV QHIKRHNIVL 540 

KRELGEGAFG KVFLAECYNL CPEQDKILVA VKTLKDASDN ARKDFHREAE LLTNLOHEHI 600 

VKPYGVCVEG DPL I MVFE YM KHGDLNKFLR AHGPDAVLMA EGNPPTELTQ SQMLHIAQQI 660 

AAGMVYLASQ HFVHRDLATR NCLVGENLLV KIGDFGMSRD VYSTDYYRVG GHTMLPIRWM 720 
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PPESIMYRKF TTESDVWSLO WLWEIFTYG KQPWYQLSNN EVIECITQGR VLQRPRTCPQ 780 
EVYELMLGCW QRHPHMRKNI KGIHTLLQNL AKASPVYLDI LG 

Seq ID NO: S96 DNA sequence 
Nucleic Acid Accession #1 AF410899 
Coding sequence: 483 . .2999 

1 11 21 31 41 51 

I I I I I I 

GGGAGCAGGA GCCTCGCTGG CTGCTTCGCT CGCGCTCTAC QCGCTCAGTC CCCGGCGGTA 60 
GCAGGAGCCT GGACCCAGGC GCCGGCGGCG GGCGTGAGGC GCCGGAGCCC GGCCTCGAGG 120 
TGCATACCGG ACCCCCATTC GCATCTAACA AGGAATCTGC QCCCCAGAGA GTCCCGGACG 180 
CCGCCGGTCG GTGCCCGGCG CGCCGGGCCA TGCAGCGACG GCCGCCGCGG AGCTCCGAGC 240 
AGCGGTAGCG CCCCCCTGTA AAGCGGTTCG CTATGCCGGG ACCACTGTGA ACCCTGCCGC 300 
CTGCCGGAAC ACTCTTCGCT COGGACCAGC TCAGCCTCTG ATAAGCTGGA CTCGGCACGC 360 
CCGCAACAAG CACCGAGGAG TTAAGAGAGC CGCAAGCGCA GGGAAGGCCT CCCCGCACGG 420 
GTGGGGGAAA GCGGCCGGTG CAGCGCGGGG ACAGGCACTC GGGCTGGCAC TGGCTGCTAG 480 
GGATGTCGT C CTGGATAAGG TGGCATGGAC CCGCCATGGC GCGGCTCTGG GGCTTCTGCT 540 
GGCTGGTTGT GGGCTTCTGG AGGGCCGCTT TCGCCTGTCC CACGTCCTGC AAATGCAGTG 600 
CCTCTCGGAT CTGGTGCAGC GACCCTTCTC CTGGCATCGT GGCATTTCCG AGATTGGAGC 660 
CTAACAGTGT AGATCCTGAG AACATCACCG AAATTTTCAT CGCAAACCAG AAAAGGTTAG 720 
AAATCATCAA CGAAGATGAT GTTGAAGCTT ATGTGGGACT GAGAAATCTG ACAATTGTGG 780 
ATTCTGGATT AAAATTTGTG GCTCATAAAG CATTTCTGAA AAACAGCAAC CTGCAGCACA 840 
TCAATTTTAC CCGAAACAAA CTGACGAGTT TGTCTAGGAA ACATTTCCGT CACCTTGACT 900 
TGTCTGAACT GATCCTGGTG GGCAATCCAT • TTACATGCTC CTGTGACATT ATGTGGATCA 960 

AGACTCTCCA AGAGGCTAAA TCCAGTCCAG ACACTCAGGA TTTGTACTGC CTGAATGAAA 1020 

GCAGCAAGAA TATTCCCCTG GCAAACCTGC AGATACCCAA TTGTGGTTTG CCATCTGCAA 1080 

ATCTGGCCGC ACCTAACCTC ACTGTGGAGG AAGGAAAGTC TATCACATTA TCCTGTAGTG 1140 

TGGCAGGTOA TCCGGTTCCT AATATGTATT GGGATGTTGG TAACCTGGTT TCCAAACATA 1200 

TGAATGAAAC AAGCCACACA CAGGGCTCCT TAAGGATAAC TAACATTTCA TCCGATGACA 1260 

GTGGGAAGCA GATCTCTTGT GTGGCGGAAA ATCTTGTAGG AGAAGATCAA GATTCTGTCA 1320 

ACCTCACTGT GCATTTTGCA CCAACTATCA CATTTCTCGA ATCTCCAACC TCAGACCACC 1380 

ACTGGTGCAT TCCATTCACT GTGAAAGGCA ACCCCAAACC AGCGCTTCAG TGGTTCTATA 1440 

ACGGGGCAAT ATTGAATGAG TCCAAATACA TCTGTACTAA AATACATGTT ACCAATCACA 1500 

CGGAGTACCA CGGCTGCCTC CAGCTGGATA ATCCCACTCA CATGAACAAT GGGGACTACA 1560 

CTCTAATAGC CAAGAATGAG TATGGGAAGG ATGAGAAACA GATTTCTGCT CACTTCATGG 1620 

GCTGGCCTGG AATTGACGAT GGTGCAAACC CAAATTATCC TGATGTAATT TATGAAGATT 1680 

ATGGAACTGC AGCGAATGAC ATCGGGGACA CCACGAACAG AAGTAATGAA ATCCCTTCCA 1740 

CAGACGTCAC TGATAAAACC GGTCGGGAAC ATCTCTCGGT CTATGCTGTG GTGGTGATTG 1800 

CGTCTGTGGT GGGATTTTGC CTTTTGGTAA TGCTGTTTCT GCTTAAGTTG GCAAGACACT 1860 

CCAAGTTTGG CATGAAAGAT TTCTCATGGT TTGGATTTGG GAAAGTAAAA TCAAGACAAG 1920 

GTGTTGGCCC AGCCTCCGTT ATCAGCAATG ATGATG ACT C TGCCAGCCCA CTCCATCACA 1980 

TCTCCAATGG GAGTAACACT CCATCTTCTT CGGAAGGTGG CCCAGATGCT GTCATTATTG 2040 

GAATGACCAA GATCCCTGTC ATTGAAAATC CCCAGTACTT TGGCATCACC AACAGTCAGC 2100 

TCAAGCCAGA CACATTTGTT CAGCACATCA AGCGACATAA CATTGTTCTG AAAAGGGAGC 2160 

TAGGCGAAGG AGCCTTTGGA AAAGTGTTCC TAGCTGAATG CTATAACCTC TGTCCTGAGC 2220 

AGGACAAGAT CTTGGTGGCA GTGAAGACCC TGAAGGATGC CAGTGACAAT GCACGCAAGG 2280 

ACTTCCACCG TGAGGCCGAG CTCCTGACCA ACCTCCAGCA TGAGCACATC GTCAAGTTCT 2340 

ATGGCGTCTG CGTGGAGGGC GACCCCCTCA TCATGGTCTT TGAGTACATG AAGCATGGGG 2400 

ACCTCAACAA GTTCCTCAGG GCACACGGCC CTGATGCCGT GCTGATGGCT GAGGGCAACC 2460 

CGCCCACGGA ACTGACGCAG TCGCAGATGC TGCATATAGC CCAGCAGATC GCCGCGGGCA 2520 

TGGTCTACCT GGCGTCCCAG CACTTCGTGC ACCGCGATTT GGCCACCAGG AACTGCCTGG 2580 

TCGGGGAGAA CTTGCTGGTG AAAATCGGGG ACTTTGGGAT GTCCCGGGAC GTGTACAGCA 2640 

CTGACTACTA CAGGGTCGGT GGCCACACAA TGCTGCCCAT TCGCTGGATG CCTCCAGAGA 2700 

GCATCATGTA CAGGAAATTC ACGACGGAAA GCGACGTCTG GAGCCTGGGG GTCGTGTTGT 2760 

GGGAGATTTT CACCTATGGC AAACAGCCCT GGTACCAGCT GTCAAACAAT GAGGTGATAG 2820 

AGTGTATCAC TCAGGGCCGA GTCCTGCAGC GACCCCGCAC GTGCCCCCAG GAGGTGTATG 2880 

AGCTGATGCT GGGGTGCTGG CAGOGAGAGC CCCACATGAG GAAGAACATC AAGGGCATCC 2940 

ATACCCTCCT TCAGAACTTG GCCAAGGCAT CTCCGGTCTA CCTGGACATT CTAGGCTAGG 3000 

GCCCTTTTCC CCAGACCGAT CCTTCCCAAC GTACTCCTCA GACGGGCTGA GAGGATGAAC 3060 

ATCTTTTAAC TGCCGCTGGA GGCCACCAAG CTGCTCTCCT TCACTCTGAC AGTATTAACA 3120 

TCAAAGACTC CGAGAAGCTC TCGAGGGAAG CAGTGTGTAC TTCTTCATCC ATAGACACAG 3180 

TATTGACTTC TTTTTGGCAT TATCTCTTTC TCTCTTTCCA TCTCCCTTGG TTGTTCCTTT 3240 

T TCT T T TTTT AAATTTTCTT TTTCTTCTTT TTTTTCGTCT TCCCTGCTTC ACGATTCTTA 3300 

CCCTTTCTTT TQAATCAATC TGGCTTCTGC ATTACTATTA ACT CTGCAT A GACAAAGGCC 33 60 

TTAACAAACG TAATTTGTTA TATCAGCAGA CACTCCAGTT TGCCCACCAC AACTAACAAT 3420 

GCCTTGTTGT ATTCCTGCCT TTGATGTGGA TGAAAAAAAG GGAAAACAAA TATTTCACTT 3480 

AAACTTTGTC ACTTCTGCTG TACAGATATC GAGAGTTTCT ATGGATTCAC TTCTATTTAT 3 540 

TTATTATTAT TACTGTTCTT ATTGTTTTTG GATGGCTTAA GCCTGTGTAT AAAAAAGAAA 3600 

ACTTGTGTTC AATCTGTGAA GCCTTTATCT ATGGGAGATT AAAACCAGAG AGAAAGAAGA 3660 

TTTATTATGA ACCGCAATAT GGGAGGAACA AAGACAACCA CTGGGATCAG CTGGTGTCAG 3720 

TCCCTACTTA GGAAATACTC AGCAACTGTT AGCTGGGAAG AATGTATTCG GCACCTTCCC 3780 

CTGAGGACCT TTCTGAGGAG TAAAAAGACT ACTGGCCTCT GTGCCATGGA TGATTCTTTT 3840 

CCCATCACCA GAAATGATAG CGTGCAGTAG AGAGCAAAGA TGGCTTCCGT GAGACACAAG 3 900 

ATGGCGCATA GTGTGCTCGG ACACAGTTTT GTCTTCGTAG GTTGTGATGA TAGCACTGGT 3 960 

TTGTTTCTCA AGCGCTATCC ACAGAACCTT TGTCAACTTC AGTTGAAAAG AGGTGGATTC 4020 
ATGTCCAGAG CTCATTTCGG GGTCAGGTGG GAAAGCC 

Seq ID NO: 597 Protein sequence 
Protein Accession #» AAL67965.1 

1 11 21 31 41 51 

t I I I I 1 

MSSWIRWHGP AMARLWGFCW LWGFWRAAF ACPTSCKCSA SRIWCSDPSP GIVAFPRLEP 60 
NSVDPENITE IFIANQKRLE IINEDDVEAY VGLRNLTIVD SGLKFVAHKA FLKNSNLQHI 120 
NFTRNKLTSL SRKHFRHLDL SELILVGNPF TCSCDIMWIK TLQEAKSSPD TQDLYCLNES 180 
SKNIPLANLQ IPNCGLPSAN LAAPNLTVEE GKSITLSCSV AGDPVPNMYW DVGNLVSKHM 240 
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WO 02/086443 

NETSHTQGSL RITNISSDDS GKQISCVAEN IiVOEOQOSVN 
WCIPFTVKGN PKPALQWFYN GAILNESKYI CTKIHVTNHT 
LIAKNEYGKD EKQISAHFMG WPGIDDGANP NYPDVIYEDY 
DVTDKTGRSH LSVYAWVIA SWGPCLLVM LFLLKLARHS 
VGPASVISND DDSASPLHHI SNGSNTPSSS EGGPDAVI IG 
KPDTFVQHIK RHNIVLKREL GEGAFGKVPL AECYNLCPEQ 
FHREAELLTN LQHEHIVKFY GVCVEGDPLI MVFEYMKHGD 
PTELTQSQML HIAQQIAAGM VYLASQHPVH RDLATRNCLV 
DYYRVGGHTM LPIRWMPPES IMYRKFTTES DVWSLGWLW 
CITQGRVLQR PRTCPQEVYE LMLGCWQREP HMRKMIKGIH 

Seq ID NO: 596 DMA sequence 
Nucleic Acid Accession U> AB052906 
Coding sequence: 74.. 814 



PCT/US02/12476 



LTVHFAPTIT 
EYHGCLQLDN 
GTAANDIGDT 
KFGMKDPSWF 
MTKIPVIENP 
DKILVAVKTL 
LNKFLRAHGP 
GENLLVKIGD 
EIFTYGKQPW 
TLLGNLAKAS 



FLBSPTSDHH 
PTHMNNGDYT 
TNRSNEIPST 
GFGKVKSRQG 
QYFGITNSQL 
KDASDNARKD 
DAVLMAEGNP 
FGMSRDVYST 
YQLSNNEVIE 
PVYLDILG 



1 
I 

AAAACCTTGA 
CTCTGGGTCC 
GCTCCTGCTG 
CATCACCGTC 
GGATGAAAA3 
CCTGGGGAAG 
GGTGGTGGAC 
GGAACCCCTC 
TGGATCTTGG 
AATGTGGACA 
GGTTGTGGCC 
CTTCTTGATG 
CTCAGGCACA 
CATCCTCCCC 
AAGCTGATAC 
CCAGCTGCCC 
TGGACCCAAT 
TACCTAACAT 
TTCTGGCTGA 
GTACTTCTTT 
TAGACTTCAG 
ATAAGAAAAA 
TTTAAATAAA 



11 
I 

GGTGATTCAT 
TTAATGGCAG 
TCCGGCTGGT 
ATCCCTAAGT 
ACTTTTCTTC 
AAACTAAATG 
ATACTTACAG 
ACCCTGCAGG 
CAGTTCAGTT 
ACGGTTCATC 
ATGTCCTTCC 
GGCATGGACA 
ACCCAACTCA 
TGCTTCATCC 
CAAAAGGCTC 
ACGACCTACG 
AGCTCATTCA 
ATTATGCAAT 
CTAAACAAGA 
GAATGATGAT 
ACCTCTGGGG 
ATTTATATTA 
GAGTTCTATT 



21 
I 

CTTCCAGGCT 
CAGCCGCCGC 
CCCGGGCTGG 
TCAGACCTGG 
ACTATGACTG 
TCACAACGGC 
AGCAACTGCG 
CCAGGATGTC 
TCGATGGGCA 
CTGGAGCCAG 
ATTACTTCTC 
GCACCCTGGA 
GGGCCACAGC 
TCCCTGGCAT 
CTGTGAGCAC 
GTGTATGTCC 
CTGCCTTGAT 
TTTCTCTTGG 
TATATCATTT 
CTCTTTCTTG 
ATTCTTTCCG 
ATGATTGTTT 
TCCCAAAAAA 



31 
I 

CTCCTTCCAT 
TACCAAGATC 
GCGAGCCGAC 
ACCACGGTGG 
TGGCAACAAG 
CTGGAAAGCA 
TGACATTCAG 
TTGTGAGCAG 
GATCTTCCTC 
AAAGATGAAA 
AATGGGAGAC 
GCCAAGTGCA 
CACCACCCTC 
CTGAGGAGAG 
GGTCTTGATC 
AGTGGCCTCC 
TCCTTTTGCC 
TGCTACCTGA 

CAAATGATAT 
TGTCCTGAAA 
CCTTTAGTAA 
AAAAAAAAAA 



41 
1 

CAAGTCTCTC 
CTTCTGTGCC 
CCTCACTCTC 
TGTGCGGTTC 
ACAGTCACAC 
CAGAACCCAG 
CTGGAGAATT 
AAAGCTGAAG 
CTCTTTGACT 
GAAAAGTGGG 
TGTATAGGAT 
GGAGCACCAC 
ATCCTTTGCT 
TCCTTTAGAG 
AAACTCGCCC 
AGCAGAT CAT 
AACAATTTTA 
TGGAATTCCT 
CTTTTTGTTT 
TGTCAGTAAA 
GAGAATTTTT 
TTTATTGTTC 
AA 



51 
I 

CTCCCTAGCG 
TCCCGCTTCT 
TTTGCTATGA 
AAGGCCAGGT 
CTGTCAGTCC 
TACTGAGAGA 
ACACACCCAA 
GACACAGCAG 
CAGAGAAGAG 
AGAATGACAA 
GGCTTGAGGA 
TCGCCATGTC 
GCCTCCTCAT 
TGACAGGTTA 
TTCTGTCTGG 
GATGACATCA 
CCAGCAGTTA 
GCACTTAAAG 
GGAAAATCAA 
ATAATCACGT 
AAATTATTTA 
TGTACTGATA 



Seq ID NO: 599 Protein sequence 
Protein Accession #r BAB61048.1 

1 n 21 31 41 51 

I I I I I I 

MAAAAATKIL LCLPLLLLLS GWSRAGRADP HSLCYDITVI PKFRPGPRWC AVQGQVDEKT 
PIiHYDCGNKT VTPVSPLGKK LNVTTAWKAQ NFVLREWDI LTEQLRDIQL ENYTPKEPLT 
LQARMSCEQK AEGHSSGSWQ FSFDGQIFLL FDSEKRMWTT VHPGARKMKE KWENDKWAM 
SFHYFSMGDC IGWIiEDFI*MG MDSTLEP SAG APLAMSSGTT QLRATATTLI LCCLLIILPC 
FZLPGZ 

Seq ID NO: 600 DNA sequence 

Nucleic Acid Accession At nm_001898.i 

Coding sequence: 57.. 482 



1 

I 

GGCTCTCACC 
CCCAGTATCT 
GCCCCAAGGA 
AGTGGGTACA 
ACTACTACAG 
ATTACTTCTT 
ACACCTGTGC 
TCTACGAAGT 
AGGGATCTGT 
CCACCCCTGG 
GACAGACAGA 
CTTCCTTCTT 
AAACAGTAGC 



11 

I 

CTCCTCTCCT 
GAGTACCCTG 
GGAGGATAGG 
GCGTGCCCTT 
ACGTCCGCTG 
CGACGTAGAG 
CTTCCATGAA 
TCCCTGGGAG 
GCCAGGCCAT 

GAAGGCTGCA 
GCTTCTAATA 
ATCGCC 



21 
I 

GCAGCTCCAG 
CTGCTCCTGC 
ATAATCCCGG 
CACTTCGCCA 
CGGGTACTAA 
GTGGGCCGCA 
CAGCCAGAAC 
AACAGAAGGT 
TCGCACCAGC 
CCCACCCTGC 
GGAGTCCTTT 
GCCCTGGTAC 



31 
I 

CTTTGTGCTC 
TGGCCACCCT 
GTGGCATCTA 
TCAGCGAGTA 
GAGCCAGGCA 
CCATATGTAC 
TGCAGAAGAA 
CCCTGGTGAA 
CACCACCCAC 
GGGAGGCCTC 
GTTGCTCAGC 
ATGGTACACA 



41 
I 

TGCCTCTGAG 
AGCTGTGGCC 
TAACGCAGAC 
TAACAAGGCC 
ACAGACCGTT 
CAAGTCCCAG 
ACAGTTGTGC 
ATCCAGGTGT 
TCCCACCCCC 
CCCATGTGCC 
AGGGCGCTCT 
CCCCCCCACC 



300 
360 
420 
480 
540 
600 
660 
720 
780 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 



60 
120 
180 
240 



51 
I 

GAGACCATGG 
CTGGCCTGGA 
CTCAATGATG 
ACCAAAGATG 
GGGGGGGTGA 
CCCAACTTGG 
TCTTTCGAGA 
CAAGAATCCT 
TGTAGTGCTC 
TGCGCCAAGA 
GCCCTCCCTC 
TCCTGCAATT 



60 
120 
1B0 
240 
300 
360 
420 
480 
540 
600 
660 
720 



Seq ID NO: 601 Protein sequence 
Protein Accession #> NPJJ01B89.1 

1 11 21 31 41 51 

I I 1 I I I 

MAQYLSTLLL LLATLAVAIiA WSPKEEDRII PGGIYNADLN DEWVQRALHF AISEYNKATK 

DDYYRRPLRV LRARQQTVGG VNYFFDVBVG RTICTKSQPN LDTCAFHEQP ELQKKQLCSF 

EIYBVPWENR RSLVKSRCQE S 

Seq ID NO: 602 DNA sequence 

Nucleic Acid Accession #: NM_003976.2 

Coding sequence: 299.961 



60 
120 



11 

I 



21 
I 



31 
I 



41 
I 



51 
1 



419 



WO 02/086443 

CTCTGAGCTT CTCTGAGCCT TGTTTGCTCA TCTGGAAAAA GGGGATTAAA CCATTTACCT 60 

CATGGAGTTG TGAAAGAATA GCTGCAAAGC ACCTAACACA TAGTAAGGTT CCCAGTGCAG 120 

CTACTTCTGC TGGGTTGAGT CTAGCTGTGT AGGCCCCTTG TTCCTCACCT GGAGAAACTG 180 

GGGTGGCAGG CCGGTCCCCC ACAAAAGATA ACTCATCTCT TAATTTGCAA GCTGCCTCAA 240 

CAGGAGGGTG GGGGAACAGC TCAACAATGG CTGATGGGCG CTCCTGGTGT TGATAGAGAT 300 

GGAACTTGGA CTTGGAGGCC TCTCCACGCT GTCCCACTGC CCCTGGCCTA GGCGGCAGCC 360 

TGCCCTGTGG CCCACCCTGG CCGCTCTGGC TCTGCTGAGC AGCGTCGCAG AGGCCTCCCT 420 

GGGCTCCGCG CCCCGCAGCC CTGCCCCCCG CGAAGGCCCC CCGCCTGTCC TGGCGTCCCC 480 

CGCCGGCCAC CTGCCGGGGG GACGCACGGC CCGCTGGTGC AGTGGAAGAG CCCGGCGGCC 540 

GCCGCCGCAG CCTTCTCGGC CCGCGCCCCC GCCGCCTGCA CCCCCATCTG CTCTTCCCCG 600 

QGGGGQCCGC QCGGCQCGGQ CTGGGGGCCC GGGCAGCCGC GCTCGGGCAG CGGGGGCGCG 660 

GGGCTGCCGC CTGCGCTCGC AGCTGGTGCC GGTGCGCGCG CTCGGCCTGG GCCACCGCTC 720 

CGACGAGCTG GTGCGTTTCC GCTTCTGCAG CGGCTCCTGC CGCCGCGCGC GCTCTCCACA 780 

CGACCTCAGC CTGGCCAGCC TACTGGGCGC CGGGGCCCTG OGACCGCCCC CGGGCTCCCG 840 

GCCCGTCAGC CAGCCCTGCT GCCGACCCAC GCGCTACGAA GCGGTCTCCT TCATGGACGT 900 

CAACAGCACC TGGAGAACCG TGGACCGCCT CTCCGCCACC GCCTGCGGCT GCCTGGGCTG 960 

AGGGCTCGCT CCAGGGCTTT GCAGACTGGA CCCTTACCGG TGGCTCTTCC TGCCTGGGAC 1020 

CCTCCCGCAG AGTCCCACTA GCCAGCGGCC TCAGCCAGGG ACGAAGGCCT CAAAGCTGAG 1080 

AGGCCCCTAC CGGTGGGTGA TGGATATCAT CCCCGAACAG GTGAAGGGAC AACTGACTAG 1140 

CAGCCCCAGA GCCCTCACCC TGCGGATCCC AGCCTAAAAG ACACCAGAGA CCTCAGCTAT 1200 

GGAGCCCTTC GGACCCACTT CTCACAGACT CTGGCACTGG CCAGGCCTCG AACCTGGGAC 1260 

CCCTCCTCTG ATGAACACTA CAGTGGCTGA GGCATCAGCC CCCGCCCAGG CCCTGTAGGG 1320 

ACAGCATTTG AAGGACACAT ATTGCAGTTG CTTGGTTGAA AGTGCCTGTG CTGGAACTGG 1380 
CCTGTACTCA CTCATGGGAG CTGGCCCC 

Seq ID NO: 603 Protein sequence 

Protein Acceesion #: NP_003967.l . . 

1 11 21 31 41 51 

I I I I I I 

MELGLGGLST LSHCPWPRRQ PALWPTLAAL ALLSSVAEAS LGSAPRSPAP REQPPPVLAS 60 

PAGHLPGGRT ARWCSGRARR PPPQPSRPAP PPPAPPSALP RGGRAARAGG PGS RARAAGA 120 

RGCRLRSQLV PVRALGLGHR SDELVRFRFC SGSCRRARSP HDLSLASLLG AGALRPPPGS 180 
RPVSQPCCRP TRYEAVSFMD VNSTWRTVDR LSATACGCLG 

Seq ID NO: 604 DNA sequence 
Nucleic Acid Accession #: NM_O570 91.1 
Coding sequence: 783.. 1445 

1 11 21 31 41 51 

I I I I 1 I 

ACTGGCCGCT GAGAGAAGAA TCGGGTGGAG CAGAGAGCAG CTGCTGCAGG GCAGACAGCC 60 

GGACCCCCAA ATCTGCACGT ACCAGCAGTC AGCCGCCCCA CGCAGGGACC GGCTTACCCC 120 

TCGCTCCCCQ CCCTCACTCA CTTTCTCCCG CCCTCGGCCC GGCCTCCCAG CTCTCTACTT 180 

CGCGTGTCTA CAAACTCAAC TCCCGGTTTC CGTGCCTCTC CACCGCTCGA GTTCTCTACT 240 

CTCCATATCC GAGGGGCCCC TCCCAGCATC TACCCCCCTC CCAACCTCGG GGGACCTAGC 300 

CAAGCTAGGG GGGACTGGAT CCGACGGGTG GAGCAGCCAG GTGAGCCCCG AAAGGTGGGG 360 

CGGGGCAGGG GCGCTCCCAG CCCCACCCCG GGATCTGGTG ACGCTGGGGC TGGAATTTGA 420 

CACCGGACGG CTGCGGCGGC GGGCAGGAGG CTGCTGAGGG ATGGAGTTGG GCCCGGCCCC 480 

CAGACAAGGC CCGGGGGCTC CGCCAGCAGC AGGTCCCTCG GGCCCCAGCC CTCGCTGCCA 540 

CCCGGGCCTG GAGCCCCACA CCCGAGGGTG CAGACTGGCT GCCAAGGCCA CACTTTTGGC 600 

TAAAAGAGGC ACTGCCAGGT GTACAGTCCT GGGCATGCGC TGTTTGAGCT TCGGGGGAGA 660 

GCCCAGCACT GGTCCCCGGA AAGGTGCCTA GAAGAACAAG GTGCAGGACC CCGTGCTGCC 720 

TCAACAGGAG GGTGGGGGAA CAGCTCAACA ATGGCTGATG GGCGCTCCTG GTGTTGATAG 780 

AGATGGAACT TGGACTTGGA GGCCTCTCCA CGCTGTCCCA CTGCCCCTGG CCTAGGCGGC 840 

AGCCTGCCCT GTGGCCCACC CTGGCCGCTC TGGCTCTGCT GAGCAGCGTC GCAGAGGCCT 900 

CCCTGGGCTC CGCGCCCCGC AGCCCTGCCC CCCGCGAAGG CCCCCCGCCT GTCCTGGCGT 960 

CCCCCGCCGG CCACCTGCCG GGGGGACGCA CGGCCCGCTG GTGCAGTGGA AGAGCCCGGC 1020 

GGCCGCCGCC GCAGCCTTCT CGGCCCGCGC CCCCGCCGCC TGCACCCCCA TCTGCTCTTC 1080 

CCCGCGGGGG CCGCGCGGCG CGGGCTGGGG GCCCGGGCAG CCGCGCTCGG GCAGCGGGGG 1140 

OGCGGGGCTG CCGCCTGCGC TCGCAGCTGG TGCCGGTGCG CGCGCTCGGC CTGGGCCACC 1200 

GCTCCGACGA GCTGGTGCGT TTCCGCTTCT GCAGCGGCTC CTGCOGCOGC GCGCGCTCTC 1260 

CACACGACCT CAGCCTGGCC AGCCTACTGG GCGCCGGGGC CCTGCGACCG CCCCCGGGCT 1320 

CCCGGCCCGT CAGCCAGCCC TGCTGCCGAC CCACGCGCTA CGAAGCGGTC TCCTTCATGG 1380 

ACGTCAACAG CACCTGGAGA ACCGTGGACC GCCTCTCCGC CACCGCCTGC GGCTGCCTGG 1440 

GCTGAGGGCT CGCTCCAGGG CTTTGCAGAC TGGACCCTTA CCGGTGGCTC TTCCTGCCTG 1S00 

GGACCCTCCC GCAGAGTCCC ACTAGCCAGC GGCCTCAGCC AGGGACGAAG GCCTCAAAGC 1560 

TGAGAGGCCC CTACCGGTGG GTGATGGATA TCATCCCCGA ACAGGTGAAG GGACAACTGA 1620 

CTAGCAGCCC CAGAGCCCTC ACCCTGCGGA TCCCAGCCTA AAAGACACCA GAGACCTCAG 1680 

CTATGGAGCC CTTCGGACCC ACTTCTCACA GACTCTGGCA CTGGCCAGGC CTCGAACCTG 1740 

GGACCCCTCC TCTGATGAAC ACTACAGTGG CTGAGGCATC AGCCCCCGCC CAGGCCCTGT 1800 

AGGGACAGCA TTTGAAGGAC ACATATTGCA GTTGCTTGGT TGAAAGTGCC TGTGCTGGAA 1860 
CTGGCCTGTA CTCACTCATG GGAGCTGGCC CC 

Seq ID NO: 605 Protein sequence 
Protein Accession #i NP_003967.1 

1 11 21 31 41 51 

I I I I I I 

MELGLGGLST LSHCPWPRRQ PALWPTLAAL ALLSSVAEAS LGSAPRSPAP REGPPPVLAS 60 

PAGHLPGGRT ARWCSGRARR PPPQPSRPAP PPPAPPSALP RGGRAARAGG PG 3 RARAAGA 120 

RGCRLRSQLV PVRALGLGHR SDELVRFRFC SGSCRRARSP HDLSLASLLG AGALRPPPGS 180 
RPVSQPCCRP TRYEAVSFMD VNSTWRTVDR LSATACGCLG 



Seq ID NO: 606 DNA sequence 

Nucleic Acid Acceesion #i NM_057160.1 



420 



WO 02/086443 

Coding sequence? 1..714 



PCT/US02/12476 



1 11 21 31 41 51 

« I I I I I I 

J ATGCCCGGCC TGATCTCAGC CCGAGGACAG CCCCTCCTTG AGGTCCTTCC TCCCCAAGCC 60 

CACCTGGGTG CCCTCTTTCT CCCTGAGGCT CCACTTGGTC TCTCCGCGCA GCCTGCCCTG 120 

TGGCCCACCC TGGCCGCTCT GGCTCTGCTG AGCAGCGTCG CAGAGGCCTC CCTGGGCTCC 180 

GCGCCCCGCA GCCCTGCCCC CCGCGAAGGC CCCCCGCCTG TCCTGGCGTC CCCCGCCGGC 240 

CACCTGCCGG GGGGACG CAC GGCCCGCTGG TGCAGTGGAA GAGCCCGGCG GCCGCCGCCG 300 

10 CAGCCTTCTC GGCCCGCGCC CCCGCCGCCT GGACCCCCAT CTGCTCTTCC CCGCGGGGGC 3 60 

CGCGCGGCGC GGGCTGGGGG CCCGGGCAGC CGCGCTCGGG CAGCGGGGGC GCGGGGCTGC 420 

CGCCTGCGCT CGCAGCTGGT GCCGGTGCGC GCGCTCGGCC TGGGCCACCG CTCCGACGAG 480 

CTGGTGCGTT TCCGCTTCTG CAGCGGCTCC TGCCGCCGCG CGCGCTCTCC ACACGACCTC 540 

AG CCTGGCCA GCCTACTGGG CGCCGGGGCC CTGCGACCGC CCCCGGGCTC CCGGCCCGTC 600 

15 AGCCAGCCCT GCTGCCGACC CACGCGCTAC GAAGCGGTCT CCTTCATGGA CGTCAACAGC 660 

ACCTGGAGAA CCGTGGACCG CCTCTCCGCC ACCGCCTGCG GCTGCCTGGG CTGAGGGCTC 720 

GCTCCAGGGC TTTGCAGACT GGACCCTTAC CGGTGGCTCT TCCTGCCTGG GACCCTCCCG 780 

CAGAGTCCCA CTAGCCAGCG GCCTCAGCCA GGGACGAAGG CCTCAAAGCT GAGAGGCCCC 840 

TACCGGTGGG TGATGGATAT CATCCCCGAA CAGGTGAAGG GACAACTGAC TAGCAGCCCC 900 

20 AGAGCCCTCA CCCTGCGGAT CCCAGCCTAA AAGACACCAG AGACCTCAGC TATGGAGCCC 960 

TTCGGACCCA CTTCTCACAG ACTCTGGCAC TGGCCAGGCC TCGAACCTGG GACCCCTCCT 1020 

CTGATGAACA CTACAGTGGC TGAGGCATCA GCCCCCGCCC AGGCCCTGTA GGGACAGCAT 1080 

TTGAAGGACA CATATTGCAG TTGCTTGGTT GAAAGTGCCT GTGCTGGAAC TGGCCTGTAC 1140 
TCACTCATGG GAGCTGGCCC C 



25 



35 



65 



Seg ID NO: 607 Protein sequence 
Protein Accession »i np 476S01.1 



1 11 21 31 41 51 

30 | | | | I 1 

MPGLISARGQ PLLBVLPPQA HLGALFLPEA PLGLSAQPAL WPTLAALALL SSVAEASLGS 60 
APRSPAPREG PPPVLASPAG HLPGGRTARW CSGRARRPPP QPSRPAPPPP APPSALPRGG 120 
RAARAGGPGS RARAAGARGC RLRSQLVPVR ALGLGHRSDE LVRFRFCSGS CRRARSPHDL 180 
SLASLLGAGA LRPPPGSRPV SQPCCRPTRY EAVSPMDVNS TWRTVDRLSA TACGCLG 



Seq ID NO : 608 DNA sequence 
Nucleic Acid Accession ft: NM_0570 90. 
Coding sequence: 29.. 715 



40 1 11 21 31 41 51 

I I I I I I 

CTGATGGGCG CTCCTGGTGT TGATAGAGAT GGAACTTGGA CTTGGAGGCC TCTCCACGCT 60 

GTCCCACTGC CCCTGGCCTA GGCGGCAGGC TCCACTTGGT CTCTCCGCGC AGCCTGCCCT 120 

GTGGCCCACC CTGGCCGCTC TGGCTCTGCT GAGCAGCGTC GCAGAGGCCT CCCTGGGCTC 180 

45 CGCGCCCCGC AGCCCTGCCC CCCGCGAAGG CCCCCCGCCT GTCCTGGCGT CCCCCGCCGG 240 

CCACCTGCCG GGGGGACGCA CGGCCCGCTG GTGCAGTGGA AGAGCCCGGC GGCCGCCGCC 3 00 

GCAGCCTTCT CGGCCCGCGC CCCCGCCGGC TGCACCCCCA TCTGCTCTTC CCCGCGGGGG 360 

CCGCGCGGCG CGGGCTGGGG GCCCGGGCAG CCGCGCTCGG GCAGCGGGGG CGCGGGGCTG 420 

CCGCCTGCGC TCGCAGCTGG TGCCGGTGCG CGCGCTCGGC CTGGGCCACC GCTCCGACGA 480 

50 GCTGGTGCGT TTCCGCTTCT GCAGCGGCTC CTGCCGCCGC GCGCGCTCTC CACACGACCT 540 

CAGCCTGGCC AGCCTACTGG GCGCCGGGGC CCTGCGACCG CCCCCGGGCT CCCGGCCCGT 600 

CAGCCAGCCC TGCTGCCGAC CCACGCGCTA CGAAGCGGTC TCCTTCATGG ACGTCAACAG 660 

CACCTGGAGA ACCGTGGACC GCCTCTCCGC CACCGCCTGC GGCTGCCTGG GCTGAGGGCT 720 

CGCTCCAGGG CTTTGCAGAC TGGACCCTTA CCGGTGGCTC TTCCTGCCTG GGACCCTCCC 780 

55 GCAGAGTCCC ACTAGCCAGC GGCCTCAGCC AGGGACGAAG GCCTCAAAGC TGAGAGGCCC 840 

CTACCGGTGG GTGATGGATA TCATCCCCGA ACAGGTGAAG GGACAACTGA CTAGCAGCCC 900 

CAGAGCCCTC ACCCTGCGGA TCCCAGCCTA AAAGACACCA GAGACCTCAG CTATGGAGCC 960 

CTTCGGACCC ACTTCTCACA GACTCTGGCA CTGGCCAGGC CTCGAACCTG GGACCCCTCC 1020 

TCTGATGAAC ACTACAGTGG CTGAGGCATC AGCCCCCGCC CAGGCCCTGT AGGGACAGCA 1080 

60 TTTGAAGGAC ACATATTGCA GTTGCTTGGT TGAAAGTGCC TGTGCTGGAA CTGGCCTGTA 1140 
CTCACTCATG GGAGCTGGCC CC 



Seq ID NO: 609 Protein sequence 
Protein Accession #i NP 476431.1 



1 11 21 31 41 51 

I I I I I I 

MELGLGGLST LSHCPWPRRQ APLGLSAQPA LWPTLAAItAL LSSVAEASLG SAPRSPAPRE 60 
GPPPVLASPA GHLPGGRTAR WCSGRARRPP PQPSRPAPPP PAPPSALPRG GRAARAGGPG 120 
70 SRARAAGARG CRLRSQLVPV RAIX5LGHRSD EhVRFRFCSG SCRRARSPHD LSUVSLLGAG 180 
AIiRPPPGSRP VSQPCCRPTR YEAVSFMDVN STWRTVDRLS ATACGCLG 

Seq ID NO: 610 DNA sequence 
Nucleic Acid Accession #: Eos sequence 
75 Coding sequence: 1..1746 

1 11 21 31 41 51 

1 I I I I I 

ATGCCACTGA AGCATTATCT CCTTTTGCTG GTGGGCTGCC AAGCCTGGGG TGCAGGGTTG 60 

80 GCCTACCATG GCTGCCCTAG CGAGTGTACC TGCTCCAGGG CCTCCCAGGT GGAGTGCACC 120 

GGGGCACGCA TTGTGGCGGT GCCCACCCCT CTGCCCTGGA ACGCCATGAG CCTGCAGATC 180 

CTCAACACGC ACATCACTGA ACTCAATGAG TCCCCGTTCC TCAATATCTC AGCCCTCATC 240 

GCCCTGAGGA TTGAGAAGAA TGAGCTGTCG CGCATCACGC CTGGGGCCTT CCGAAACCTG 300 

GGCTCGCTGC GCTATCTCAG CCTCGCCAAC AACAAGCTGC AGGTTCTGCC CATCGGCCTC 360 

85 TTCCAGGGCC TGGACAGCCT TGAGTCTCTC CTTCTGTCCA GTAACCAGCT GTTGCAGATC 420 

CAGCCGGCCC ACTTCTCCCA GTGCAGCAAC CTCAAGGAGC TGCAGTTGCA CGGCAACCAC 480 

CTGGAATACA TCCCTGACGG AGCCTTCGAC CACCTGGTAG GACTCACGAA GCTCAATCTG 540 
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GGCAAGAATA GCCTCACCCA CATCTCACCC AGGGTCTTCC AGCACCTGGG CAATCTCCAG 600 
GTCCTCCGGC TGTATGAGAA CAGGCTCACO GATATCCCCA TGGGCACTTT TGATGGGCTT 660 
GTTAACCTGC AGGAACTGGC TCTACAGCAG AACCAGATTG GACTGCTCTC CCCTGGTCTC 720 
TTCCACAACA ACCACAACCT CCAGAGACTC TACCTGTCCA ACAACCACAT CTCCCAGCTG 780 
CCACCCAGCA TCTTCATGCA GCTGCCCCAG CTCAACCGTC TTACTCTCTT TGGGAATTCC 840 
CTGAAGGAGC TCTCTCTGGG GATCTTCGGG CCCATGCCCA ACCTGCGGGA GCTTTGGCTC 900 
TATGACAACC ACATCTCTTC TCTACCCGAC AATGTCTTCA GCAACCTCCG CCAGTTGCAG 960 

GTCCTGATTC TTAGCOGCAA TCAGATCAGC TTCATCTCCC CGGGTGCCTT CAAOGGGCTA 1020 

ACGGAGCTTC GGQAGCTQTC CCTCCACACC AACGCACTGC AGGACCTGGA CGGGAATGTC 1080 

TTCCGCATGT TGGCCAACCT GCAGAACATC TCCCTGCAGA ACAATCGCCT CAGACAGCTC 1140 

CCAGGGAATA TCTTCGCCAA CGTCAATGGC CTCATGGCCA TCCAGCTGCA GAACAACCAG 1200 

CTGGAGAACT TGCCCCTCGG CATCTTCGAT CACCTGGGGA AACTGTGTGA GCTGCGGCTG 1260 

TATGACAATC CCTGGAGGTG TGACTCAGAC ATCCTTCCGC TCCGCAACTG GCTCCTGCTC 1320 

AACCAGCCTA GGTTAGGGAC GGACACTGTA CCTGTGTGTT TCAGCCCAGC CAATGTCCGA 1380 

GGCCAGTCCC TCATTATCAT CAATGTCAAC GTTGCTGTTC CAAGCGTCCA TGTCCCTGAG 1440 

GTGCCTAGTT ACCCAGAAAC ACCATGGTAC CCAGACACAC CCAGTTACCC TGACACCACA 1500 

TCCGTCTCTT CTACCACTGA GCTAACCAGC CCTGTGGAAG ACTACACTGA TCTGACTACC 1560 

ATTCAGGTCA CTGATGACCG CAGCGTTTGG GGCATGACCC AGGCCCAGAG CGGGCTGGCC 1620 

ATTGCCGCCA TTGTAATTGG CATTGTCGCC CTGGCCTGCT CCCTGGCTGC CTGCGTCGGC 1680 

TGTTGCTGCT GCAAGAAGAG GAGCCAAGCT GTCCTGATGC AGATGAAGGC ACCCAATGAG 1740 

TGTTAAAGAG GCAGGCTGGA GCAGGGCTGG GGAATGATGG GACTGGAGGA CCTGGGAATT 1800 

TCATCTTTCT GCCTCCACCC CTGGGTCCAT GGAGCTTTCC CGTGATTGCT CTTTCTGGCC 1860 

CTAGATAAAG GTGTGCCTAC CTCTTCCTGA CTTGCCTGAT TCTCCCGTAG AGAAGCAGGT 1920 

CGTGCCGGAC CTTCCTACAA TCAGGAAGAT AGATCCAACT GGCCATGGCA AAAGCCCTGG 1980 

GGATTTCCGA TTCATACCCC TGGGCTTCCT TCGAGAGGGC TCTTCCTCCA AATCCTCCCC 2040 

ACCTGTCCTC CAAGAACAGC CTTCCCTGCG CCCAGGCCCC CTCCGGGCCT CTGTAGACTC 2100 

AGTTAGTCCA CAGCCTGCTC ACTTCGTGGG AATAGTTCTC CGCTGAGATA GCCCCTCTCG 2160 

CCTAAGTATT ATGTAAGTTG ATTTCCCTTC TTTTGTTTCT CTTGTTTGTG CTATGGCTTG 2220 

ACCCAGCATG TCCCCTCAAA TGAAAGTTCT CCCCTTGATT TTCTGCTCCT GAAGGCAGGG 2280 

TGAGTTCTCT CCTCAAAGAA GACTTCAAAC CATTTAACTG GTTTCTTAAG AGCCGTCAAT 2340 

CAGCCTGGTT TTGGGGATGC TATGAAAGAG AGAAGGAAAA TCATGCCGCT CAGTTCCTGG 2400 

AGACAGAAGA GCCGTCATCA GTGTCTCACT TGTGATTTTT ATCTGGAAAA GGAAGAAACA 2460 

CCCCAGCACA GCAAGCTCAG CCTTTTAGAG AAGGATATTT CCAAACTGCA AACTTTGCTT 2520 

TGAAAAGTTT AGCCCTTTAA GGAATGAAAT CATGTAGAAT TTTGGACTTC TAAAAACATT 2580 

AAAATCAGCT TATTAATACG GGATAGAGAA AGAAATCTGG TGCCTGGGGG TCCCTGTGTT 2640 

CACCCCTAGA GTTTGTTTTA AAATTTTTAA TTGAAGCATG TGAAGTGTAC STGCAGAAAA 2700 

GTGGGAACAT GATAGTGTAT GGCTTGGTGG ATTTTCACAA ACTGAACATA CCTGTGTAAT 2760 

CAGCATCTAG ACCCAGACCC AGAGCATCAC AAATATCCCC CATCCTGGGC TTTTCCCAGA 2820 

GGAGATGGGG GCTTCTGAAG ATGGACTTAC CTGGGACCTG CCCCCCATGA GCCAGGACGG 2880 

TCCCCCCACA GTCAGCCTGT GCAAAGGCCC CGTGGCCAGG GGTGGAGGAG AATATGTGGG 2940 

TGTGGACAGG ATGGGAGACT GTGGCCTGAA CAGGAGATTT TATTATATCT GGAGACCCTG 3000 

AGAGACCCTG AGACCTGGGG CACCATGGCT GGCCAGGTCA GAAGCATCCT GACTGCAGAG 3060 

GTCCGTGCAG CCACACCCTC TTCCCTGCCA GCAAGTTGTC TGCGGCTCAT CGGAGGCCCC 3120 

TCCGCCTGGA GCCTTCTATG GACGTGATAT GCCTGTATCT GTTTTTAATT TTCATTCTTC 3180 

ACTTAGGGGA AGTGAAATCG CTCAGAGATG AGATCCTTTA ATTGAAAACG AAGTGTAACG 3240 

GAATCTAGTG TCTTTCTAAT GTGGTAAAAT TCTCCATCAA CATCACAGTC AGCTGGCAGC 3300 

TGAACTTCAG AATCTCACTT ACAGCAGGCG ACACGGGGGT ACACCGATGG GTCACACTGG 3360 

GTCTGGGGGC TCCCTGGAGC TCCTCCTGCG TGTGGTCTGG TTAGGAGTTG AGTTGTTTGC 3420 

TCCAGGGTTA TTCTCCTCCT CGAGTCACAG TCACACGAAT ACCTGCCTTC TCTGGCTTTC 3480 

CTGCTATACA CATATTCACA TGGCGCTCAA GAAGTTAGGC TCATGGCAAC GTGTGTCTTT 3540 

CTCTGGACAA CTGGCCCAGT TTACAGTGAA ATGGAGAATT TCAGGTCTCC ACGTCTGCCC 3600 

AGGAAAGAAC TTCAGCTGAC TCCACGGGGA TCTGGAAATC CACGACCAAT CCCGATCGGC 3 660 

TCTTATTAGC TCCCCGCTCC ACAAGACACC TGTGCTTTGG AAATCCACCA CCAATCCCGA 3 720 

TCGGCTCTTA TTAGCTCCCC GCTCCACAAG ACACCTGTGA TCTGGAAATC TACCACCAAT 3780 

CCCGATCGGC TCTTATTAGC TCCCCGCTCC ACAAGACACC TGTGACATCC TCCAGGGCCA 3 840 

CAGGAGCACG TGCTGACCAG TTTTCCCTTC CAGTTCCTGG ACAAAAAGTG TCGAGAGGGC 3900 

TGTTTGCAAA CACTAGTGCA CTTTGTAGCT TTTCACCCTC TGTCCCAGGG AATCTAGGAG 3 960 

AGATGAGGCC CGTCAGAGTC AAGAGATGTC ATCCCCCCAG GGTCTCCAAG GCATTTCCAC 40 20 

ACTATTGGTG GCACCTGGAG GACATGCACC AAGGCTTGCC AGAGCCAACA GGAAGTGAGC 4080 

CCAGAGCATG GCACATGAGC ATCACCCGCT GATGGTGGCC TGCTGTGCCT GGTGCCAACA 4140 

GGGGCATCCC GGCCCGTACC CCTCCAGACA GGAAGCATGG GTTTGCCCAC AGACCTGTCQ 4200 

GGTGCTCCTG TGAGTGGCCT CCAGATGTCT TTGTGCATAG GCACAAGTGG GCCAGGGCTG 4260 

GAGGGAGGTG GGAAACCTCA TCATCCGGTG GGCCCTGCCA ATCTTAACCC AGAACCCTTA 4320 

GGTATTCCTG G CAGTAGCCA TGACATTGGA GCACCTTCCT CTCCAGCCAG AGGCTGACCT 4380 

GAGGGCCACT GTCCTCAGAT GACACCACCC AGGAGCACCC TAGGTGAGGG GTGAGGGCCC 4440 

CCTTATGTGA ACCTCTTGCC TCTTCCTTTC TCCCATCAGA GTGGTTGGAT GGAGCCATTG 4500 

GCCTCCTTTT CTTCAGCGGG CCCTTCAACC TCTCTGCACC ATGTTGTCTG GCTGAGGAGC 4560 

TACTAGAAAA GCTGAGTGGA GTCTCCTTTC CAACAGGATG ATGCATTTGC TCAATTCTCA 4620 

GGGCTGGAAT GAGCCGGCTG GTCCCCCAGA AAGCTGGAGT GGGGTACAGA GTTCAGTTTT 4680 

CCTCTCTGTT TACAGCTCCT TGACAGTCCC ACGCCCATCT GGAGTGGGAG CTGGGAGTTA 4740 

GTGTTGGAGA AGAAACAACA AAAGCCAATT AGAACCACTA TTTTTAAAAA GTGCTTACTG 4800 

TGCACAGATA CTCTTCAAGC ACTGGACGTG GATTCTCTCT CTAGCCCTCA GCACCCCTGC 4860 

GGTAGGAGTG CCGCCTCTAC CCACTTGTGA TGGGGTACAG AGGCACTTGC TCTTCTGCAT 4920 

GGTGTTCAAT AGGCTGGGAG TTTTATTTAT CTCTTCAAAC TTTGTACAAG AGCTCATGGC 4980 

TTGTCTTGGG CTTTCGTCAT TAAACCAAAG GAAATGGAAG CCATTCCCCT GTTGCTCTCC 5040 

TTAGTCTTGG TCATCAGAAC CTCACTTGGT ACCATATAGA TCAAAAGCTT TGTAACCACA 5100 

GGAAAAAATA AACTCTTCCA TCCCTTAAAG AATAGAATAG TTTGTCCCTC TCATGGGAAT 5160 

TGGGCTGTAT GTATATTGTT CTTCCTCCTT AGAATTTAGA GATACAAGAG TTCTACTTAG 5220 

AACTTTTCAT GGACACAATT TCCACAACCT TTCAGATGCT GATGTAGAGC TATTGGGAAA 5280 

GAACTTCCAA ACT CAGG AAG TTTGCAGAGA GCAGACAGCT AGAGATAACT CGGGACCCAG 5340 

AGTTGGTCGA CAGATGTTAG ATGTATCCTA GCTTTTAGCC ATAAACCACT CAAAGATTCA 5400 

GCCCCCAGAT CCCACAGTCA GAACTGAATC TGCGTTGTTG GGAAGCCAGC AGTGGCCTTG 5460 

GGAAGGAAGC CATGGCTGTG GTTCAGAGAG GGTGGGCTGG CAAGCCACTT CCGGGGAAAA 5520 

CTCCTTCCGC CCCAGGTTTC TTCTTCTCTT AAGGAGAGAT TGTTCTCACC AACCCGCTGC 5580 

CTTCATGCTG CCTTCAAAGC TAGATCATGT TTGCCTTGCT TAGAGAATTA CTGCAAATCA 5640 

GCCCCAGTGC TTGGGGATGC ATTTACAGAT TTCTAGGCCC TCAGGGTTTT GTAGAGTGTG 5700 

AGCCCTGGTG GGCAGGGTTG GQGGGTCTGT CTTCTGCTGG ATGCTGCTTG TAATCCATTT 5760 
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GGTGTACAOA ATCAACAATA AATAATATAC ATGTAT 



PCT/US02/12476 
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Seq ID NO i 611 Protein sequence 
Protein Accession »t BAB64S87.1 



1 
I 

MPLKHYLLLL 
LNTHITELNE 
FQGLDSLESL 
GKNSLTHISP 
FHNNHNLQRL 
YDNHISSLPD 
FRMLANLQNI 
YDNPWRCDSD 
VPSYPETPWY 
IAAIVIQIVA 



11 

I 

VGCQAWOAGL 
SPFLNISALI 
IiLSSNQLLQI 
RVFQHLGNLQ 
YLSNNHISQL 
NVFSNLRQLQ 
SLQNNRLRQL 
ILPLRNWtiLL 
PDTPSYPDTT 
LACSLAACVG 



21 
I 

AYHGCPSECT 
ALRIEKNELS 
QPAHPSQCSN 
VLRIiYENRLT 
PPSIFMQLPQ 
VLILSRNQIS 
PGNIPANVNG 
NQPRLGTDTV 
SVSSTTELTS 
CCCCKKRSQA 



31 
i 

CSRASQVECT 
RITPGAFRNL 
LKELQLHGNH 
DIPMGTFDGIi 
LNRLTLFGNS 
FISPGAFNGli 
LMAIQLQNNQ 
PVCPSPANVR 
PVEDYTDLTT 
VLMQMKAPNE 



41 
I 

GARIVAVPTP 
GSLRYLSLAN 
LEY I PDGAFD 
VNLQELALQQ 
LKELSLGIFG 
TEIiRELSLHT 
LENLPLGIFD 
GQSLIIINVN 
IQVTDDRSVW 
C 



Seq ID NO: 612 DNA sequence 
Nucleic Acid Accession #* XM_098151 
Coding sequences 1..447 



1 

I 

ATGATGCATT 
AGTGGGGTAC 
. TCTGGAGTGG 
CTATTTTTAA 
TCTCTAGCCC 
CAGAGGCACT 
AACTTTGTAC 
AAGCCATTCC 



11 
I 

TGCTCAATTC 
AGAGTTCAGT 
GAGCTGGGAG 
AAAGTGCTTA 
TCAGCACCCC 
TGCTCTTCTG 
AAGAGCTCAT 
CCTGTTGCTC 



21 
I 

TCAGGGCTGG 
TTTCCTCTCT 
TCAGTGTTGG 
CTGTGCACAG 
TGCGGTAGGA 
CATGGTGTTC 
GGCTTGTCTT 
TCCTTAG 



31 
I 

AATGAGCCGG 
GTTTACAGCT 
AGAAGAAACA 
ATACTCTTCA 
GTGCCGCCTC 
AATAGGCTGG 
GGGCTTTCGT 



41 

I 

CTGGTCCCCC 
CCTTGACAGT 
ACAAAAGCCA 
AGCACTGGAC 
TACCCACTTG 
GAGTTTTATT 
CATTAAACCA 



51 

[ 

LPWNAMSLQI 
NKLQVLPIGL 
HLVGLTKLNL 
NQIGLLSPGL 
PMPNLRELWL 
NALQDLDGNV 
HLGKLCELRL 
VAVPSVHVPE 
GMTQAQSGLA 



SI 
I 

AGAAAGCTGG 
CCCACGCCCA 
ATTAGAACCA 
GTGGATTCTC 
TGATGGGGTA 
TATCTCTTCA 
AAGGAAATGG 



Seq ID NO s 613 Protein sequence 
Protein Acceoeion H s XP_0981S1 



1 11 21 31 41 51 

i I I I l I 

MMHLLNSQGW NEPAGPPESW SGVQSSVPLS VYSSLTVPRP SGVGAGSQCW RRNNKSQLEP 
L FLKSAYCAQ ILFKHWTWIL SLALSTPAVG VPPLPTCDGV QRHLLFCMVF NRLGVIiFISS 
NFVQELMACL GLSSLNQRKW KPFPCCSP 

Seq ID NOt 614 DNA sequence 

Nucleic Acid Accession #: NM_002658.1 

Coding sequence: 77.. 1372 



GTCCCCGCAG 
CCCCGACCTC 
GAGCGACTCC 
TGGAGGAACA 
GAAATTCGGA 
TCACTTTTAC 
CTCTGCCACT 
CCTGGGGAAA 
GCAGGTGGGC 
AAAGCCCTCC 
CCGCTTTAAG 
CATCTACAGG 



CATCGTCTAC 
GGTGGAAAAC 
CATTGCCTTG 
ACAGACCATC 
CACTGGCTTT 
TGTTGTGAAG 
CACCACCAAA 
CTCAGGGGGA 
CTGGGGCCGT 
CTTACCCTGG 
AGGGAGGAAA 
TCCATCAGCT 
CACCACCAGG 
CAGACCCTCT 
TGTCTTTTTC 
GGCTCGAAGG 
AATGAATAAT 
AATGTGGGAG 
ATTCCATGAA 
GCTGTGAGTG 
AAACTGTGTG 
CTGGGGCCTC 
ACCTGTGACC 
ATCCCTTCCT 
ACACTGAATA 
ATCAATAAAA 



11 

I 

CGCCGTCGCG 
GCCACCATGA 
AAAGGCAGCA 
TGTGTGTCCA 
GGGCAGCACT 
CGAGGAAAGG 
GTCCTTCAGC 
CATAATTACT 
CTAAAGCCGC 
TCTCCTCCAG 
ATTATTGGGG 
AGGCACCGGG 
GTGATCAGCG 
CTGGGTCGCT 
CTCATCCTAC 
CTGAAGATCC 
TGCCTGCCCT 
GGAAAAGAGA 
CTGATTTCCC 
ATGCTATGTG 
CCCCTCGTCT 
GGATGTGCCC 
ATCCGCAGTC 
CGGGCACCAC 
GTAAGAAGAG 
GTGAACGACA 
GGCCAGGATG 
TGGACTGAAG 
GAGAGCCAGC 
TTCCCAATTA 
CAGCGGTTTG 
TGTATCAGGA 
TAAGTGTGAG 
GACTGTGATG 
TTGGGTCCCC 
AGCACTGTCT 
TTTAGCCTAG 
TTTATATTTC 
TGTGATTTTT 



21 
I 

CCCTCCTGCC 
GAGCCCTGCT 
ATGAACTTCA 
ACAAGTACTT 
GTGAAATAGA 
CCAGCACTGA 
AAACGTACCA 
GCAGGAACCC 
TTGTCCAAGA 
AAGAATTAAA 
GAGAATTCAC 
GGGGCTCTGT 
CCACACACTG 
CAAGGCTTAA 
ACAAGGACTA 
GTTCCAAGGA 
CGATGTATAA 
ATTCTACCGA 
ACCGGGAGTG 
CTGCTGACCC 
GTTCCCTCCA 
TGAAGGACAA 
ACACCAAGGA 
CCGCTTTCTT 
ACTGGGAAGA 
ATAGCTTTAC 
GAGGGGTGGT 
CCTGCAGGAG 
TCCCCCGACC 
GGAAGTGTAA 
GGGAGCAGAG 
AATATATATG 
TAAGAGCTGG 
CCACACAGAG 
CACGTGACAG 
CAGTTTCACT 
TTCATCCAAT 
ACTATTTTTA 
CTGA 



31 
I 

GCAGGCCACC 
GGCGCGCCTG 
TCAAGTTCCA 
CTCCAACATT 
TAAGTCAAAA 
CACCATGGGC 
TGCCCACAGA 
AGACAACCGG 
GTGCATGGTG 
ATTTCAGTGT 
CACCATCGAG 
CACCTACGTG 
CTTCATTGAT 
CTCCAACACG 
CAGCGCTGAC 
GGGCAGGTGT 
CGATCCCCAG 
CTATCTCTAT 
TCAGCAGCCC 
CCAATGGAAA 
AGGCCGCATG 
GCCAGGCGTC 
AGAGAATGGC 
GCTGGTTGTC 
TAGGCTCTGC 
CCTCACGGAT 
CCTGACTCAA 
TTAAAAAGGG 
GGTGGGCATT 
GCAGCTGAGG 
ACACTAACGA 
TGTGTGTATG 
TGTCTGATTG 
TGGTCTTTCT 
TGCCTGGGAA 
TTCACATAGA 
CCTCACTGGG 
TTTATATTTT 



41 

! 

GAGGCCGCCG 
CTTCTCTGCG 
TCGAACTGTG 
CACTGGTGCA 
ACCTGCTATG 
CGGCCCTGCC 
TCTGATGCTC 
AGGCGACCCT 
CATGACTGCG 
GGCCAAAAGA 
AACCAGCCCT 
TGTGGAGGCA 
TACCCAAAGA 
CAAGGGGAGA 
ACGCTTGCTC 
GCGCAGCCAT 
TTTGGCACAA 
CCGGAGCAGC 
CACTACTACG 
ACAGATTCCT 
ACTTTGACTG 
TACACGAGAG 
CTGGCCCTCT 
ATTTTTGCAG 
ACAGATGGAT 
AGGCCTGGGT 
CATGTTACTG 
CAGGGCATCT 
TGTGAGGCCC 
TCTCTTGAGG 
CTTCAGGGCA 
TTTGCACACT 
TTAAGTCTAA 
GGAGAGGTTA 
TGTACTTATT 
TGTCCCTTTC 
TGGGGTGAGG 
TGTAATTTTA 



ACCAGCAACT 
CCTGTGCATG 
ATGGTTGAGA 
GAGCTTAGCC 
GGGCTCTGAT 
TGTTGTGTGG 
ATATTTCCTT 
TAGGTCACTC 
CTGCAGCATG 
TTGGCCAGTT 
ACCACTCCTT 
AATAAAAGTG 



60 
120 
180 
240 
300 
360 
420 
480 
540 



60 
120 
180 
240 
300 
360 
420 



60 
120 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1660 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
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Seq ID NO: 615 Protein sequence 
Protein Accession #: NP_002649.1 

1 11 21 31 41 51 

I I I I I I 

MRALLARLLL CVLWSDSKG SNEJJHQVPSN CDCLNGGTCV SNKYFSNIHW CNCPKKFGGQ 60 

HCEIDKSKTC YEGNGHFYRG KASTDTMGRP CLPWNSATVL QQTYHAHRSD ALQLGLGKHN 120 

YCRNPDNRRR PWCYVQVGLK PLVQECMVHD CADGKKP9SP PEELKFQCGQ KTLRPRFKII 180 

GGEFTTIENQ PWFAAIYRRH RGGSVTYVCG GSLISPCWVI SATHCFIDYP KKEDYIVYLG 240 

RSRLNSNTQG EMKFEVENLI LHKDYSADTL AHHNDIALLK IRSKEGRCAQ PSRTIQTICL 300 

PSMYNDPQFQ TSCEITGFGK ENSTDYLYPE QLKMTWKLI SHRECQQPHY YGSEVTTKML 360 

CAADPQWKTD SCQGDSGGPL VCSLQGRMTL TGIVSWGRGC ALKDKPGVYT RVSHFLPWIR 420 
SHTKEENGLA L 

Seq ID NO: 616 DNA sequence 

Nucleic Acid Accession #: NM_024422.1 

Coding sequence* 202.. 2907 

1 11 21 31 41 51 

I I I 1 I 1 

CGCCAAAGGA AAAGCCCCTT GGATGAGAGG CAGGCGCTTC AGAGAAGCTA AGAAAAGCAC 60 

CTCTCCGCGC GCCCCACCTC CTCCGCCTCG CGCTCCTCCT GAOChQCGGQ CCCAGACTGC 120 

GCTCCGGCCG CGGCCCTCGC CCCGCGGAGC CCTCCTACCC CGGCCCGACG CTCGGCCCGC 180 

GACCTGCCCC GAGCCCTCTC CATGGAGGCA GCCCGCCCCT CCGGCTCCTG GAACGGAGCC 240 

CTCTGCCGGC TGCTCCTGCT GACCCTCGCG ATCTTAATAT TTGCCAGTGA TGCCTGCAAA 300 

AATGTGACAT TACATGTTCC CTCCAAACTA GATGCCGAGA AACTTGTTGG TAGAGTTAAC 3 60 

CTGAAAGAGT GCTTTACAGC TGCAAATCTA ATTCATTCAA GTGATCCTGA CTTCCAAATT 420 

TTGGAGGATG GTTCAGTCTA TACAACAAAT ACTATTCTAT TGTCCTCGGA GAAGAGAAGT 460 

TTTACCATAT TACTTTCCAA CACTGAGAAC CAAGAAAAGA AGAAAATATT TGTCTTTTTG 540 

GAGCATCAAA CAAAGGTCCT AAAGAAAAGA CATACTAAAG AAAAAGTTCT AAGGCGCGCC 600 

AAGAGAAGAT GGGCTCCAAT TCCTTGTTCG ATGCTAGAAA ACTCCTTGGG TCCTTTTCCA 660 

CTTTTCCTTC AACAGGTTCA ATCTGACACG GCCCAAAACT ATACCATATA CTATTCCATA 720 

AGAGGTCCTG GAGTTGACCA AGAACCT CGG AATTTATTTT ATGTGGAGAG AGACACTGGA 780 

AACTTGTATT GTACT CGTCC TGTAGATCGT GAGCAGTATG AATCTTTTGA GATAATTGCC 840 

TTTGCAACAA CTCCAGATGG GTATACTCCA GAACTTCCAC TGCCCCTAAT AATCAAAATA 90 0 

GAGGATGAAA ATGATAACTA CCCAATTTTT ACAGAAGAAA CTTATACTTT TACAATTTTT 960 

GAAAATTGCA GAGTGGGCAC TACTGTGGGA CAAGTGTGTG CTACTGACAA AGATGAGCCT 102 0 

GACACGATGC ACACACGCCT GAAGTACTCC ATCATTGGGC AGGTGCCACC ATCACCCACC 1080 

CTATTTTCTA TGCATCCAAC TACAGGCGTG ATCACCACAA CATCATCTCA GCTAGACAGA 1140 

GAGTTAATTG ACAAGTACCA GTTGAAAATA AAAGTACAAG ACATGGATGG TCAGTATTTT 1200 

GGTCTACAGA CAACTTCAAC TTGTATCATT AACATTGATG ATGTAAATGA CCACTTGCCA 1260 

ACAT TT ACT C GTACTTCTTA TGTGACATCA GTGGAAGAAA ATACAGTTGA TGTGGAAATC 1320 

TTACGAGTTA CTGTTGAGGA TAAGGACTTA GTGAATACTG CTAACTGGAG AGCTAATTAT 1380 

ACCATTTTAA AGGGCAATGA AAATGGCAAT TTTAAAATTG TAACAGATGC CAAAACCAAT 1440 

GAAGGAGTTC TTTGTGTAGT TAAGCCTTTG AATTATGAAG AAAAGCAACA GATGATCTTG 1500 

CAAATTGGTG TAGTTAATGA AGCTCCATTT TCCAGAGAGG CTAGTCCAAG ATCAGCCATG 1560 

AGCACAGCAA CAGTTACTGT TAATGTAGAA GATCAGGATG AGGGCCCTGA GTGTAACCCT 1620 

CCAATACAGA CTGTTCGCAT GAAAGAAAAT GCAGAAGTGG GAACAACAAG CAATGGATAT 1680 

AAAG CAT ATG ACCCAGAAAC AAGAAGTAGC AGTGGCATAA GGTATAAGAA ATTAACTGAT 1740 

CCAACAGGGT GGGTCACCAT TGATGAAAAT ACAGGATCAA TCAAAGTTTT CAGAAGCCTG 1800 

GATAGAGAGG CAGAGACCAT CAAAAATGGC ATATATAATA TTACAGTCCT TGCATCAGAC 1860 

CAAGGAGGGA GAACATGTAC GGGGACACTG GGCATTATAC TTCAAGACGT GAATGATAAC 1920 

AGCCCATTCA TACCTAAAAA GACAGTGATC ATCTGCAAAC CCACCATGTC ATCTGCGGAG 1980 

ATTGTTGCGG TTGATCCTGA TGAGCCTATC CATGGCCCAC CCTTTGACTT T AGT CTGG AG 2040 

AGTTCTACTT CAGAAGTACA GAGAATGTGG AGACTGAAAG CAATTAATGA TACAGCAGCA 2100 

CGTCTTTCCT ATCAGAATGA TCCTCCATTT GGCTCATATG TAGTACCTAT AACAGTGAGA 2160 

GATAGACTTG GCATGTCTAG TGTCACTTCA TTGGATGTTA CACTGTGTGA CTGCATTACC 2220 

GAAAATGACT GCACACATCG TGTAGATCCA AGGATTGGCG GTGGAGGAGT ACAACTTGGA 2280 

AAGTGGGCCA TCCTTGCAAT ATTGTTGGGC AT AGCATTG C TCTTTTG CAT CCTGTTTACG 2340 

CTGGTCTGTG GGG CTTCTGG GACGTCTAAA CAACCAAAAG TAATTCCTGA TGATTTAGCC 2400 

CAGCAGAACC TAATTGTATC AAACACAGAA GCTCCTGGAG ATGACAAAGT GTATTCTGCG 2460 

AATGGCTTCA CAACCCAAAC TGTGGGCGCT TCTGCTCAGG GAGTTTGTGG CACCGTGGGA 2520 

TCAGGAATCA AAAACGGAGG TCAGGAGACC ATCGAAATGG TGAAAGGAGG ACACCAGACC 2580 

TCGGAATCCT GCCGGGGGGC TGGCCACCAT CACACCCTGG ACTCCTGCAG GGGAGGACAC 2640 

ACGGAGGTGG ACAACTGCAG ATACACTTAC TCGGAGTGGC ACAGTTTTAC TCAGCCCCGT 2700 

CTTGGTGAAA AAGTGTATCT GTGTAATCAA GATGAAAATC ACAAGCATGC CCAAGACTAT 2760 

GTCCTGACAT ATAACTATGA AGGAAGAGGA TCGGTGGCTG GGTCTGTAGG TTGTTGCAGT 2820 

GAACGACAAG AAGAAGATGG GCTTGAATTT TTGGATAATT TGGAGCCCAA ATTTAGGACA 2880 

CTAGCAGAAG CATGCATGAA GAGATGAGTG TGTTCTAATA AGTCTCTGAA AGCCAGTGGC 2940 

TTTATGACTT TTAAAAAAAA TTACAAACCA AGAATTTTTT AAAGCAGAAG ATGCTATTTG 3000 

TGGGGGTTTT TCTCTCATTA TTTGGATGGA ATCTCTTTGG TCAAATGCAC ATTTACAGAG 3060 

AGACACTATA AACAAGTACA CAAATTTTTC AATTTTTACA TATTTTTAAA TTACTTATCT 3120 

TCTATCCAAG GAGGTCTACA GAGAAATTAA AGTCTGCCTT ATTTGTTACA TTTGGGTATA 3180 

ATGACAACAG CCAATTTATA GTGCAATAAA ATGTAATTAA TTCAAGTCCT TATTATAGAC 3240 

TATTTGAAGC ACAACCTAAT GGAAAATTGT AGAGACCTTG CTTTAACATT ATCTCCAGTT 3300 

AATTAAGTGT TCATGTGGTG CTTGGAAACT GTTGTTTTCC TGAACATCTA AAGTGTGTAG 3360 

ACTGCATTCT TGCTATTATT TTATTCTTGT AATGTGACCT TTTCACTGTG CAAAGGGAGA 3420 
TTTCTAGCCA GGCATTGACT AT TACAATTT CATT 

Beq ID NOt 617 Protein sequence 
Protein Accession 8: NP_077740.l 

1 11 21 31 41 51 

| I I I I 1 

MEAARPSGSW NGALCRLLLL TLAILIFASD ACKNVTLHVP SKLDAEKLVG RVNLKECFTA 60 

ANLIHSSDPD FQILEDGSVY TTNTILLSSE KRSFTILLSN TENQEKKKIF VFLEHQTKVL 120 
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KKRHTKEKVL RRAKRRWAPI PCSMLENSLG PFPI»FLQQVQ SDTAQNYTIY YSIRGPGVDQ 180 

EPRNLFYVER DTGNIiYCTRP VDREQYESFE I I AFATTPDG YTPELPLPLI IKIEDEMDNY 240 

PIFTEETYTF TIFENCRVGT TVGQVCATDK DEPDTMHTRL KYSIIGQVPP SPTLFSMHPT 300 

TGVITTTSSQ LDRELIDKYQ LKIKVQDMDG QYFGLQTTST CIINIDDVND HLPTFTRTSY 360 

VTSVEENTVD VEILRVTVED KDLVNTANWR ANYTILKGNE NGNFKIVTDA KTNEGVLCW 420 

KPLNYEEKQQ MILQIGWNE APFSREASPR SAMSTATVTV NVEDQDEGPE CNPPIQTVRM 480 

KENAEVGTTS NGYKAYD PET RSSSGIRYKK LTDPTGWVTI DENTGSIKVF RSLDREAETI 540 

KNGIYNITVL ASDQGGRTCT GTLGI ILQDV NDNSPFIPKK TVIICKPTMS SAEIVAVDPD 600 

EPIHGPPFDF SLESSTSEVQ RMWRLKAIND TAARLSYQND PPFGSYWPI TVRDRLGMSS 660 

VTSLDVTLCD CITENDCTHR VDPRIGGGGV QLGKWAILAI LLGIALLFCI LFTLVCGASG 720 

TSKQPKVIPD DLAQQNLIVS NTEAPGDDKV YSANGFTTQT VGASAQGVCG TVGSGIKNGG 780 

QETIEMVKGG HQTSESCRGA GHHHTLDSCR GGHTEVDNCR YTYSEWHSFT QPRLGEKVYL 840 

CNQDENHKHA QDYVLTYNYE GRGSVAGSVG CCSERQEEDG LEFLDNLEPK FRTLAEACMK 900 
R 

Seq ID NO i 618 DNA sequence 

Nucleic Acid Accession #: NMJ>04949.1 

Coding sequence: 202.. 2745 

1 11 21 31 41 51 

I' I I I I I 

CGCCAAAGGA AAAGCCCCTT GGATGAGAGG CAGGCGCTTC AGAGAAGCTA AGAAAAGCAC 60 

CTCTCCGCGC GCCCCACCTC CTCCGCCTCG CGCTCCTCCT GAGCAGCGGG CCCAGACTGC 120 

GCTCCGGCCG CGGCCCTCGC CCCGCGGAGC CCTCCTACCC CGGCCCGACG CTCGGCCCGC 180 

OACCTGCCCC GAGCCCTCTC CATGGAGGCA GCCCGCCCCT CCGGCTCCTG GAACGGAGCC 240 

CTCTGCCGGC TGCTCCTGCT GACCCTCGCG ATCTTAATAT TTGCCAGTGA TGCCTGCAAA 300 

AATGTGACAT TACATGTTCC CTCCAAACTA GATGCCGAGA AACTTGTTGG TAGAGTTAAC 360 

CTGAAAGAGT GCTTTACAGC TGCAAATCTA ATTCATTCAA GTGATCCTGA CTTCCAAATT 420 

TTGGAGGATG GTTCAGTCTA TACAACAAAT ACTATTCTAT TGTCCTCGGA GAAGAGAAGT 480 

TTTACCATAT TACTTTCCAA CACTGAGAAC CAAGAAAAGA AGAAAATATT TGTCTTTTTG 540 

GAGCATCAAA CAAAGGTCCT AAAGAAAAGA CATACTAAAG AAAAAGTTCT AAGGCGCGCC 600 

AAGAGAAGAT GGGCTCCAAT TCCTTGTTCG ATGCTAGAAA ACTCCTTGGG TCCTTTTCCA 660 

CTTTTCCTTC AACAGGTTCA ATCTGACACG GCCCAAAACT ATACCATATA CTATTCCATA 720 

AGAGGTCCTG GAGTTGACCA AGAACCTCGG AATTTATTTT ATGTGGAGAG AGACACTGGA 780 

AACTTGTATT GTACTCGTCC TGTAGATCGT GAGCAGTATG AATCTTTTGA GATAATTGCC 840 

TTTGCAACAA CTCCAGATGG GTATACTCCA GAACTTCCAC TGCCCCTAAT AATCAAAATA 900 

GAGGATGAAA ATGATAACTA CCCAATTTTT ACAGAAGAAA CTTATACTTT TACAATTTTT 960 

GAAAATTGCA GAGTGGGCAC TACTGTGGGA CAAGTGTGTG CTACTGACAA AGATGAGCCT 1020 

GACACGATGC ACACACGCCT GAAGTACTCC ATCATTGGGC AGGTGCCACC ATCACCCACC 1080 

CTATTTTCTA TGCATCCAAC TACAGGCGTG ATCACCACAA CATCATCTCA GCTAGACAGA 1140 

GAGTTAATTG ACAAGTACCA GTTGAAAATA AAAGTACAAG ACATGGATGG TCAGTATTTT 1200 

GGTCTACAGA CAACTTCAAC TTGTATCATT AACATTGATG ATGTAAATGA CCACTTGCCA 1260 

ACATTTACTC GTACTTCTTA TGTGACATCA GTGGAAGAAA ATACAGTTGA TGTGGAAATC 1320 

TTACGAGTTA CTGTTGAGGA TAAGGACTTA GTGAATACTG CTAACTGGAG AGCTAATTAT 1380 

AC CAT TTTAA AGGGCAATGA AAATGGCAAT TTTAAAATTG TAACAGATGC CAAAACCAAT 1440 

GAAGGAGTTC TTTGTGTAGT TAAGCCTTTG AATTATGAAG AAAAGCAACA GATGATCTTG 1500 

CAAATTGGTG TAGTTAATGA AGCTCCATTT TCCAGAGAGG CTAGTCCAAG ATCAGCCATG 1560 

AGCACAGCAA CAGTTACTGT TAATGTAGAA GATCAGGATG AGGGCCCTGA GTGTAACCCT 1620 

CCAATACAGA CTGTTCGCAT GAAAGAAAAT GCAGAAGTGG GAACAACAAG CAATGGATAT 1680 

AAAGCATATG ACCCAGAAAC AAGAAGTAGC AGTGGCATAA GGTATAAGAA ATTAACTGAT 1740 

CCAACAGGGT GGGTCACCAT TGATGAAAAT ACAGGATCAA TCAAAGTTTT CAGAAGCCTG 1800 

GATAGAGAGG CAGAGACCAT CAAAAATGGC ATATATAATA TTACAGTCCT TGCATCAGAC 1860 

CAAGGAGGGA GAACATGTAC GGGGACACTG GGCATTATAC TTCAAGACGT GAATGATAAC 1920 

AGCCCATTCA TACCTAAAAA GACAGTGATC ATCTGCAAAC CCACCATGTC ATCTGCGGAG 1980 

ATTGTTGCGG TTGATCCTGA TGAGCCTATC CATGGCCCAC CCTTTGACTT TAGTCTGGAG 2040 

AGTTCTACTT CAGAAGTACA GAGAATGTGG AGACTGAAAG CAATTAATGA TACAGCAGCA 2100 

CGTCTTTCCT ATCAGAATGA TCCTCCATTT GGCTCATATG TAGTACCTAT AACAGTGAGA ,2160 

GATAGACTTG GCATGTCTAG TGTCACTTCA TTGGATGTTA CACTGTGTGA CTGCATTACC 2220 

GAAAATGACT GCACACATCG TGTAGATCCA AGGATTGGCG GTGGAGGAGT ACAACTTGGA 2280 

AAGTGGGCCA TCCTTGCAAT ATTGTTGGGC ATAGCATTGC TCTTTTGCAT CCTGTTTACG 2340 

CTGGTCTGTG GGGCTTCTGG GACGTCTAAA CAACCAAAAG TAATTCCTGA TGATTTAGCC 2400 

CAGCAGAACC TAATTGTATC AAACACAGAA GCTCCTGGAG ATGACAAAGT GTATTCTGCG 2460 

AATGGCTTCA CAACCCAAAC TGTGGGCGCT TCTGCTCAGG GAGTTTGTGG CACCGTGGGA 2520 

TCAGGAATCA AAAACGGAGG TCAGGAGACC ATCGAAATGG TGAAAGGAGG ACACCAGACC 2580 

TCGGAATCCT GCCGGGGGGC TGGCCACCAT CACACCCTGG ACTCCTGCAG GGGAGGACAC 2640 

ACGGAGGTGG ACAACTGCAG ATACACTTAC TCGGAGTGGC ACAGTTTTAC TCAGCCCCGT 2700 

CTTGGTGAAG AATCCATTAG AGGACACACT CTGATTAAAA ATTAAACAAT GAAAGAAAGT 2760 

GTATCTGTGT AATCAAGATG AAAATCACAA GCATGCCCAA GACTATGTCC TGACATATAA 2820 

CTATGAAGGA AGAGGATCGG TGGCTGGGTC TGTAGGTTGT TGCAGTGAAC GACAAGAAGA 2880 

AGATGGGCTT GAATTTTTGG ATAATTTGGA GCCCAAATTT AGGACACTAG CAGAAGCATG 2940 

CATGAAGAGA TGAGTGTGTT CTAATAAGTC TCTGAAAGCC AGTGGCTTTA TGACTTTTAA 3000 

AAAAAATTAC AAACCAAGAA TTTTTTAAAG CAGAAGATGC TATTTGTGGG GGTTTTTCTC 3060 

TCATTATTTG GATGGAATCT CTTTGGTCAA ATGCACATTT ACAGAGAGAC ACTATAAACA 3120 

AGTACACAAA TTTTTCAATT TTTACATATT TTTAAATTAC TTATCTTCTA TCCAAGGAGG 3180 

TCTACAGAGA AATTAAAGTC TGCCTTATTT GTTACATTTG GGTATAATGA CAACAGCCAA 3240 

TTTATAGTGC AATAAAATGT AATTAATTCA AGTCCTTATT ATAGACTATT TGAAGCACAA 3300 

CCTAATGGAA AATTGTAGAG ACCTTGCTTT AACATTATCT CCAGTTAATT AAGTGTTCAT 3360 

GTGGTGCTTG GAAACTGTTG TTTTCCTGAA CATCTAAAGT GTGTAGACTG CATTCTTGCT 3420 

ATTATTTTAT TCTTGTAATG TGACCTTTTC ACTGTGCAAA GGGAGATTTC TAGCCAGGCA 3480 
TTGACTATTA CAATTTCATT 

Seq ID NOt 619 Protein sequence 
Protein Accession ft: NP_004940.1 

1 11 21 31 41 51 

I I I I I I 

MEAARPSGSW NGALCRLLLL TLAILIFASD ACKNVTLHVP SKLDAEXLVG RVNLKECFTA 60 
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ANLIHSSDPD FQILEDGSVY TTNTILLSSE KRSPTILLSN TENQEKKKIF VFLEHQTKVL 120 

KKRHTKBKVL RRAKRRWAPI PCSMLENSLG PFPLFLQQVQ SDTAQNYTIY YSIRGPGVDQ 180 

EPRNLFYVER DTGNLYCTRP VDREQYESPE I I APATTPDG YTPELPLPLI IKIEDENDNY 240 

PIFTEETYTF TIFENCRVGT TVGQVCATDK DEPDTMHTRL KYSIIGQVPP SPTLFSMHPT 300 

5 TGVITTTSSQ LDRELIDKYQ LKIKVQDMDG QYFGLQTTST CIINXDDVND HLPTFTRTSY 360 

VTSVEENTVD VEILRVTVED KDLVNTANWR ANYTILKGNE NGNFKIVTDA KTNEGVLCW 420 

KPLNYEEKQQ MILQIGWNE APFSREASPR SAMSTATVTV NVEDQDEGPE CNPPIQTVRM 480 

KENAEVGTTS NGYKAYDPET RSSSGIRYKK LTDPTGWVTI DENTGSIKVF RSLDREAETI 540 

KNGIYNITVL ASDO.GGRTCT GTLGI ILQDV NDNSPFIPKK TVIICKPTMS SAEIVAVDPD 600 

1 0 EPIHGPPFDF SLBSSTSEVQ RMWRLKAIND TAARLSYQND PPFGSYWPI TVRDRLGMSS 660 

VTSLOVTLCD CITENDCTHR VDPRIGGGGV QLGKWAILAI LLGIALLFCI LFTLVCGASG 720 

TSKQPKVIPD DLAQQNLIVS NTEAPGDDKV YSANGFTTQT VGASAQGVCG TVGSGIKNGG 780 

QETIEMVKGG HQTSESCRGA GHHHTLDSCR GGHTEVDNCR YTYSEWHSFT QPRLGEESIR 840 
GHTLIKN 



15 



40 



70 



Seq ID NOt 620 DNA sequence 

Nucleic Acid Accession ft: NM_032545.1 

Coding sequence : 46 . . 718 



20 1 11 21 31 41 51 

I I I I I I 

AAACTGATCT TCAATGCACT AAGAGAAGGA GACTCTCAAA CCAAAAATGA CCTGGAGGCA 60 

CCATGTCAGG CTTCTGTTTA CGGTCAGTTT GGCATTACAG ATCATCAATT TGGGAAACAG 120 

CTATCAAAGA GAGAAACATA ACGGCGGTAG AGAGGAAGTC ACCAAGGTTG CCACTCAGAA 180 

25 GCACCGACAG TCACCGCTCA ACTGGACCTC CAGTCATTTC GGAGAGGTGA CTGGGAGCGC 240 

CGAGGGCTGG GGGCCGGAGG AGCCGCTCCC CTACTCCCGG GCTTTCGGAG AGGGTGCGTC 300 

CGCGCGGCCG CGCTGCTGCA GGAACGGCGG TACCTGCGTG CTGGGCAGCT TCTGCGTGTG 360 

CCCGGCCCAC TTCACCGGCC GCTACTGCGA GCATGACCAG AGGCGCAGTG AATGCGGCGC 420 

CCTGGAGCAC GGAGCCTGGA CCCTCCGCQC CTGCCACCTC TGCAGGTGCA TCTTCGGGGC 480 

30 CCTGCACTGC CTCCCCCTCC AGACGCCTGA CCGCTGTGAC CCGAAAGACT TCCTGGCCTC 540 

CCACGCTCAC GGGCCGAGCG CCGGGGGCGC GCCCAGCCTG CTACTCTTGC TGCCCTGCGC 600 

ACTCCTGCAC CGCCTCCTGC GCCCGGATGC GCCCGCGCAC CCTCGGTCCC TGGTCCCTTC 660 

CGTCCTCCAG CQGQhGCGGC GCCCCTGCGG AAGGCCGGGA CTTGGGfCATC GCCTTTAATT 720 

TTCTATGTTG TAAATAATAG ATGTGTTTAG TTTACCGTAA GCTGAAGCAC TGGGTGAATA 780 

35 TTTTTATTGG GTAATAAATA TTTTCATGAA AGCGCCAAAA AAAAAAAAAA AAAAAAAAAA 840 
AAAAAA 



Seq ID NO: 621 Protein sequence 
Protein Accession ft: NP 115934.1 



11 21 31 41 SI 

I I I I I I 

MTWRHHVRLL FTVSLALQII NLGNSYQREK HNGGREEVTK VATQKHRQSP LNWTSSHFGB 60 
VTGSAEGWGP EEPLPYSRAF GEGASARPRC CRNGGTCVLG SFCVCPAHFT GRYCEHDQRR 120 
45 SECGALEHGA WTLRACHLCR CIFGAXjHCLP LQTPDRCDPK DFLASHAHGP SAGGAPSLLL 180 

LLPCALLHRL LRPDAPAHPR SLVPSVLQRE RRPCGRPGLG HRL 

Seq ID HO: 622 DNA sequence 
Nucleic Acid Accession ft: FGENESH predicted 
50 Coding sequence: 1..390 

1 11 21 31 41 51 

I I I I I I 

ATGAGGTTCA GTGTCTCAGG CATGAGGACC GACTACCCCA GGAGTGTGCT GGCTCCTGCT 60 

55 TATGTGTCAG TCTGTCTCCT CCTCTTGTGT CCAAGGGAAG TCATCGCTCC CGCTGGCTCA 120 

GAACCATGGC TGTGCCAGCC GGCACCCAGG TGTGGAGACA AGATCTACAA CCCCTTGGAG 180 

CAGTGCTGTT ACAATGACGC CATCGTGTCC CTGAGCGAGA CCCGCCAATG TGGTCCCCCC 240 

TGCACCTTCT GGCCCTGCTT TGAGCTCTGC TGTCTTGATT CCTTTGGCCT CACAAACGAT 300 

TTTGTTGTGA AGCTGAAGGT TCAGGGTGTG AATTCCCAGT GCCACTCATC TCCCATCTCC 360 
60 AGTAAATGTG AAAGAGGCCG GATATGTTAG 

Seq ID NO: 623 Protein sequence 
Protein Accession ft: FGENESH predicted 

65 1 11 21 31 41 51 

I I I I I I 

MRFSVSGMRT DYPRSVLAPA YVSVCLLLLC PREVIAPAGS EPWLCQPAPR CGDKIYNPLE 60 
QCCYNDAIVS LSETRQCGPP CTFWPCFELC CLDSFGLTND FWKLKVQGV NSQCHSSPIS 120 
SKCERGRIC 



Seq ID NO: 624 DNA sequence 
Nucleic Acid Accession ft: M18728.1 
Coding sequence: 51. .1085 



75 1 11 21 31 41 51 

I 1 I I I I 

GGAGCTCAAG CTCCTCTACA AAGAGGTGGA CAGAGAAGAC AGCAGAGACC ATGGGACCCC 60 

CCTCAGCCCC TCCCTGCAGA TTGCATGTCC CCTGGAAGGA GGTCCTGCTC ACAGCCTCAC 120 

TTCTAACCTT CTGGAACCCA CCCACCACTG CCAAGCTCAC TATTGAATCC ACGCCATTCA 180 

80 ATGTCGCAGA GGGGAAGGAG GTTCTTCTAC TCGCCCACAA CCTGCCCCAG AATCGTATTG 240 

GTTACAGCTG GTACAAAGGC GAAAGAGTGG ATGGCAACAG TCTAATTQTA GGATATGTAA 300 

TAGGAACTCA ACAAGCTACC CCAGGGCCCG CATACAGTGG TCGAGAGACA ATATACCCCA 360 

ATGCATCCCT GCTGATCCAG AACGTCACCC AGAATGACAC AGGATTCTAT ACCCTACAAG 420 

TCATAAAGTC AGATCTTGTG AATGAAGAAG CAACCGGACA GTTCCATGTA TACCCGGAGC 480 

85 TGCCCAAGCC CTCCATCTCC AGCAACAACT CCAACCCCGT GGAGGACAAG GATGCTGTGG 540 

CCTTCACCTG TGAACCTGAG GTTCAGAACA CAACCTACCT GTGGTGGGTA AATGGTCAGA 600 

GCCTCCCGGT CAGTCCCAGG CTGCAGCTGT CCAATGGCAA CATGACCCTC ACTCTACTCA 660 
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GCGTCAAAAG GAACGATGCA GGATCCTATG AATGTGAAAT ACAOAACCCA GCGAGTGCCA 720 

ACCGCAGTGA CCCAGTCACC CTGAATGTCC TCTATGGCCC AGATGTCCCC ACCATTTCCC 780 

CCTCAAAGGC CAATTACCGT CCAGGGGAAA ATCTGAACCT CTCCTGCCAC GCAGCCTCTA B40 

ACCCACCTGC ACAGTACTCT TGGTTTATCA ATGGGACGTT CCAGCAATCC ACACAAGAGC 900 

TCTTTATCCC CAACATCACT GTGAATAATA GCGGATCCTA TATGTGCCAA GCCCATAACT 960 

CAGCCACTGG CCTCAATAGG ACCACAGTCA CGATGATCAC AGTCTCTGGA AGTGCTCCTG 1020 

TCCTCTCAGC TGTGGCCACC GTCGGCATCA CGATTGGAGT GCTGGCCAGG GTGGCTCTGA 1080 
TATAGCAGCC CTGGTGTATT TTCGATATTT CAGGAAGACT GGCAGATTGG ACCAGACCCT .1140 

GAATTCTTCT AGCTCCTCCA ATCCCATTTT ATCCCATGGA ACCACTAAAA ACAAGGTCTG 1200 

CTCTGCTCCT GAAGCCCTAT ATGCTGGAGA TGGACAACTC AATGAAAATT TAAAGGGAAA 1260 

ACCCTCAGGC CTGAGGTGTG TGCCACTCAG AGACTTCACC TAACTAGAGA CAGTCAAACT 1320 

GCAAACCATG GTGAGAAATT GACGACTTCA CACTATGGAC AGCTTTTCCC AAGATGTCAA 1380 

AACAAGACTC CTCATCATGA TAAGGCTCTT ACCCCCTTTT AATTTGTCCT TGCTTATGCC 1440 

TGCCTCTTTC GCT7GGCAGG ATGATGCTGT CATTAGTATT TCACAAGAAG TAGCTTCAGA 1500 

GGGTAACTTA ACAGAGTGTC AGATCTATCT TGTCAATCCC AACGTTTTAC ATAAAATAAG 1560 

AGATCCTTTA GTGCACCCAG TGACTGACAT TAGCAGCATC TTTAACACAG CCGTGTGTTC 1620 

AAATGTACAG TGGTCCTTTT CAGAGTTGGA CTTCTAGACT CACCTGTTCT CACTCCCTGT 1680 

TTTAATTCAA CCCAGCCATG CAATGCCAAA TAATAGAATT GCTCCCTACC AGCTGAACAG 1740 

GGAGGAGTCT GTGCAGTTTC TGACACTTGT TGTTGAACAT GGCTAAATAC AATGGGTATC 1800 

GCTGAGACTA AGTTGTAGAA ATTAACAAAT GTGCTGCTTG GTTAAAATGG CTACACT CAT 1860 

CTGACTCATT CTTTATTCTA TTTTAGTTGG TTTGTATCTT GCCTAAGGTG CGTAGTCCAA 1920 

CTCTTGGTAT TACCCTCCTA ATAGTCATAC TAGTAGTCAT ACTCCCTGGT GTAGTGTATT 1980 

CTCTAAAAGC TTTAAATGTC TGCATGCAGC CAGCCATCAA ATAGTGAATG GTCTCTCTTT 2040 

GGCTGGAATT ACAAAACTCA GAGAAATGTG TCATCAGGAG AACATCATAA CCCATGAAGG 2100 

ATAAAAGCCC CAAATGGTGG TAACTGATAA TAG CACTAAT GCTTTAAGAT TTGGTCACAC 2160 

TCTCACCTAG GTGAGCGCAT TGAGCCAGTG GTGCTAAATG CTACATACTC CAACTGAAAT 2220 

GTTAAGGAAG AAGATAGATC CAATTAAAAA AAATTAAAAC CAATTTAAAA AAAAAAAAGA 2280 

ACACAGGAGA TTCCAGTCTA CTTGAGTTAG CATAATACAG AAGTCCCCTC TACTTTAACT 2340 

TTTACAAAAA AGTAACCTGA ACTAATCTGA TGTTAACCAA TGTATTTATT TCTGTGGTTC 2400 

TGTTTCCTTG TTCCAATTTG ACAAAACCCA CTGTTCTTGT ATTGTATTGC CCAGGGGGAG 2460 

CT AT CACTGT ACTTGTAGAG TGGTGCTGCT TTAATTCATA AATCACAAAT AAAAGCCAAT 2520 
TAGCTCTATA ACT 

Seq ID NOt 625 Protein sequence 
Protein Accession #i AAAS9907.1 

1 11 21 31 41 51 

I I I I I I 

MGPPSAPPCR LHVFWKEVLL TASLLTFWNP PTTAKLTIES TPFNVAEGKE VLLLAHNLPQ 60 

NRIGYSWYKG ERVDGNSt.IV GYVIGTQQAT PGPAYSGRET IYPNASLLIQ NVTQNDTGFY 120 

TLQVIKSDLV NEEATGQFHV YPELPKPSIS SNNSNPVEDK DAVAFTCEPE VQNTTYLWWV 180 

NGQSLPVSPR LQLSNGNMTL TLLSVKRNDA GSYECEIQNP ASANRSDPVT LNVLYGPDVP 240 

TISPSKANYR PGENLNLSCH AASNPPAQYS WFINGTFQQS TQEItFIPNIT VNNSGSYMCQ 300 
AHNSATGLNR TTVTMITVSG SAFVLSAVAT VGITIGVLAR VALI 

Seq ID NOt 626 DNA sequence 
Nucleic Acid Accession «: M18728.1 
Coding sequence: 13 55.. 165 7 

1 11 21 31 41 51 

I 1 I I 1 I 

GGAGCTCAAG CTCCTCTACA AAGAGGTGGA CAGAGAAGAC AGCAGAGACC ATGGGACCCC 60 

CCTCAGCCCC TCCCTGCAGA TTGCATGTCC CCTGGAAGGA GGTCCTGCTC ACAGCCTCAC 120 

TTCTAACCTT CTGGAACCCA CCCACCACTG CCAAGCTCAC TATTGAATCC ACGCCATTCA 180 

ATGTCGCAGA GGGGAAGGAG GTTCTTCTAC TOGCCCACAA CCTGCCCCAG AATCGTATTG 240 

GTTACAGCTG GTACAAAGGC GAAAGAGTGG ATGGCAACAG TCTAATTGTA GGATATGTAA 300 

TAGGAACTCA ACAAGCTACC CCAGGGCCCG CATACAGTGG TCGAGAGACA ATATACCCCA 360 

ATGCATCCCT GCTGATCCAG AACGTCACCC AGAATGACAC AGGATTCTAT ACCCTACAAG 420 

TCATAAAGTC AGATCTTGTG AATGAAGAAG CAACCGGACA GTTCCATGTA TACCCGGAGC 480 

TGCCCAAGCC CTCCATCTCC AGCAACAACT CCAACCCCQT GGAGGACAAG GATGCTGTGG 540 

CCTTCACCTG TGAACCTGAG GTTCAGAACA CAACCTACCT GTGGTGGGTA AATGGTCAGA 600 

GCCTCCCGGT CAGTCCCAGG CTGCAGCTGT CCAATGGCAA CATGACCCTC ACTCTACTCA 660 

GCGTCAAAAG GAACGATGCA GGATCCTATG AATGTGAAAT ACAOAACCCA GCGAGTGCCA 720 

ACCGCAGTGA CCCAGTCACC CTGAATGTCC TCTATGGCCC AGATGTCCCC ACCATTTCCC 780 

CCTCAAAGGC CAATTACCGT CCAGGGGAAA ATCTGAACCT CTCCTGCCAC GCAGCCTCTA 840 

ACCCACCTGC ACAGTACTCT TGGTTTATCA ATGGGACGTT CCAGCAATCC ACACAAGAGC 900 

TCTTTATCCC CAACATCACT GTGAATAATA GCGGATCCTA TATGTGCCAA GCCCATAACT 960 

CAGCCACTGG CCTCAATAGG ACCACAGTCA CGATGATCAC AGTCTCTGGA AGTGCTCCTG 1020 

TCCTCTCAGC TGTGGCCACC GTCGGCATCA CGATTGGAGT GCTGGCCAGG GTGGCTCTGA 1080 

TATAGCAGCC CTGGTGTATT TTCGATATTT CAGGAAGACT GGCAGATTGG ACCAGACCCT 1140 

GAATTCTTCT AGCTCCTCCA ATCCCATTTT ATCCCATGGA ACCACTAAAA ACAAGGTCTG 1200 

CTCTGCTCCT GAAGCCCTAT ATGCTGGAGA TGGACAACTC AATGAAAATT TAAAGGGAAA 1260 

ACCCTCAGGC CTGAGGTGTG TGCCACTCAG AGACTTCACC TAACTAGAGA CAGTCAAACT 1320 

GCAAACCATG GTGAGAAATT GACGACTTCA CACTATGGAC AGCTTTTCCC AAGATGTCAA 13 80 

AACAAGACTC CTCATCATGA TAAGGCTCTT ACCCCCTTTT AATTTGTCCT TGCTTATGCC 1440 

TGCCTCTTTC GCTTGGCAGG ATGATGCTGT CATTAGTATT TCACAAGAAG TAGCTTCAGA 1500 

GGGTAACTTA ACAGAGTGTC AGATCTATCT TGTCAATCCC AACGTTTTAC ATAAAATAAG 1560 

AGATCCTTTA GTGCACCCAG TGACTGACAT TAGCAGCATC TTTAACACAG CCGTGTGTTC 1620 

AAATGTACAG TGGTCCTTTT CAGAGTTGGA CTTCTAGACT CACCTGTTCT CACTCCCTGT 1680 
TTTAATTCAA CCCAGCCATG CAATGCCAAA TAATAGAATT GCTCCCTACC AGCTGAACAG. 1740 

GGAGGAGTCT GTGCAGTTTC TGACACTTGT TGTTGAACAT GGCTAAATAC AATGGGTATC 1800 

GCTGAGACTA AGTTGTAGAA ATTAACAAAT GTGCTGCTTG GTTAAAATGG CTACACTCAT 1860 

CTGACTCATT CTTTATTCTA TTTTAGTTGG TTTGTATCTT GCCTAAGGTG CGTAGTCCAA 1920 

CTCTTGGTAT TACCCTCCTA ATAGTCATAC TAGTAGTCAT ACTCCCTGGT GTAGTGTATT 1980 

CTCTAAAAGC TTTAAATGTC TGCATGCAGC CAGCCATCAA ATAGTGAATG GTCTCTCTTT 2040 

GGCTGGAATT ACAAAACTCA GAGAAATGTG TCATCAGGAG AACATCATAA CCCATGAAGG 2100 

ATAAAAGCCC CAAATGGTGG TAACTGATAA TAGCACTAAT GCTTTAAGAT TTGGTCACAC 2160 
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TCTCACCTAG GTGAGCGCAT TGAGCCAGTQ GTGCTAAATG CTACATACTC CAACTGAAAT 2220 

GTTAAGGAAG AAGATAGATC CAATTAAAAA AAATTAAAAC CAATTTAAAA AAAAAAAAGA 2280 

ACACAGGAGA TTCCAGTCTA CTTGAGTTAG CATAATACAG AAGTCCCCTC TACTTTAACT 2340 

TTTACAAAAA AGTAACCTGA ACTAATCTGA TGTTAACCAA TGTATTTATT TCTGTGGTTC 2400 

TGTTTCCTTG TTCCAATTTG ACAAAACCCA CTGTTCTTGT ATTGTATTGC CCAGGGGGAG 2460 

CTATCACTGT ACTTGTAGAG TGGTGCTGCT TTAATTCATA AATCACAAAT AAAAGCCAAT 2520 
TAGCTCTATA ACT 

Seq ID NO: 627 Protein sequence 
Protein Accession #: AAA59908.1 

1 11 21 31 41 SI 

I I I I I I 

MDSFSQDVKT RLLIMIRLLP PPNLSLLMPA SPAWQDDAVI SISQEVASEG NLTECQIYLV 60 
NPNVLHKIRD PLVHPVTDIS SIFNTAVCSN VQWSFSELDP 

Seq ID NO : 628 DNA sequence 
Nucleic Acid Accession ft: M18728.1 
Coding sequence: 2370.. 2501 

1 11 21 31 41 51 

I 1 II I I 

GGAGCTCAAG CTCCTCTACA AAGAGGTGGA CAGAGAAGAC AGCAGAGACC ATGGGACCCC 60 

CCTCAGCCCC TCCCTGCAGA TTGCATGTCC CCTGGAAGGA GGTCCTGCTC ACAGCCTCAC 120 

TTCTAACCTT CTGGAACCCA CCCACCACTG CCAAGCTCAC TATTGAATCC ACGCCATTCA 180 

ATGTCGCAGA GGGGAAGGAG GTTCTTCTAC TCGCCCACAA CCTGCCCCAG AATCGTATTG 240 

GTTACAGCTG GTACAAAGGC GAAAGAGTGG ATGGCAACAG TCTAATTGTA GGATATGTAA 30 0 

TAGGAACTCA ACAAGCTACC CCAGGGCCCG CATACAGTGG TCGAGAGACA ATATACCCCA 3 60 

ATGCATCCCT GCTGATCCAG AACGTCACCC AGAATGACAC AGGATTCTAT ACCCTACAAG 42 0 

TCATAAAGTC AQATCTTGTG AATGAAGAAG CAACCGGACA GTTCCATGTA TACCCGGAGC 480 

TGCCCAAGCC CTCCATCTCC AGCAACAACT CCAACCCCGT GGAGGACAAG GATGCTGTGG 540 

CCTTCACCTG TGAACCTGAG GTTCAGAACA CAACCTACCT GTGGTGGGTA AATGGTCAGA 600 

GCCTCCCGGT CAGTCCCAGG CTGCAGCTGT CCAATGGCAA CATGACCCTC ACTCTACTCA 660 

GCGTCAAAAG GAACGATGCA GGATCCTATG AATGTGAAAT ACAGAACCCA GCGAGTGCCA 720 

ACCGCAGTGA CCCAGTCACC CTGAATGTCC TCTATGGCCC AGATGTCCCC ACCATTTCCC 780 

CCTCAAAGGC CAATTACCGT CCAGGGGAAA ATCTGAACCT CTCCTGCCAC GCAGCCTCTA 840 

ACCCACCTGC ACAGTACTCT TGGTTTATCA ATGGGACGTT CCAGCAATCC ACACAAG AG C 900 

TCTTTATCCC CAACATCACT GTGAATAATA GCGGATCCTA TATGTGCCAA GCCCATAACT 960 

CAGCCACTGG CCTCAATAGG ACCACAGTCA CGATGATCAC AGTCTCTGGA AGTGCTCCTG 1020 

TCCTCTCAGC TGTGGCCACC GTCGGCATCA CGATTGGAGT GCTGGCCAGG GTGGCTCTGA 1080 

TATAGCAGCC CTGGTGTATT TTCGATATTT CAGGAAGACT GGCAGATTGG ACCAGACCCT 1140 

GAATTCTTCT AGCTCCTCCA ATCCCATTTT ATCCCATGGA ACCACTAAAA ACAAGGTCTG 1200 

CTCTGCTCCT GAAGCCCTAT ATGCTGGAGA TGGACAACTC AATGAAAATT TAAAGGGAAA 1260 

ACCCTCAGGC CTGAGGTGTG TGCCACTCAG AGACTTCACC TAACTAGAGA CAGTCAAACT 1320 

GCAAACCATG GTGAGAAATT GACGACTTCA CACTATGGAC AGCTTTTCCC AAGATGTCAA 1380 

AACAAGACTC CTCATCATGA TAAGGCTCTT ACCCCCTTTT AATTTGTCCT TGCTTATGCC 1440 

TGCCTCTTTC GCTTGGCAGG ATGATGCTGT CATTAGTATT TCACAAGAAG TAGCTTCAGA 1500 

GGGTAACTTA ACAGAGTGTC AGATCTATCT TGTCAATCCC AACGTTTTAC ATAAAATAAG 1560 

AGATCCTTTA GTGCACCCAG TGACTGACAT T AGCAG CATC TTTAACACAG CCGTGTGTTC 1620 

AAATGTACAG TGGTCCTTTT CAGAGTTGGA CTTCTAGACT CACCTGTTCT CACTCCCTGT 16 B0 

TTTAATTCAA CCCAGCCATG CAATGCCAAA TAATAGAATT GCTCCCTACC AGCTGAACAG 1740 

GGAGGAGTCT GTGCAGTTTC TGACACTTGT TGTTGAACAT GGCTAAATAC AATGGGTATC 1800 

GCTGAGACTA AGTTGTAGAA ATTAACAAAT GTGCTGCTTG GTTAAAATGG CTACACTCAT 1860 

CTGACTCATT CTTTATTCTA TTTTAGTTGG TTTGTATCTT GCCTAAGGTG CGTAGTCCAA 1920 

CTCTTGGTAT TACCCTCCTA ATAGTCATAC TAGTAGTCAT ACTCCCTGGT GTAGTGTATT 1980 

CTCTAAAAGC TTTAAATGTC TGCATGCAGC CAGCCATCAA ATAGTGAATG GTCTCTCTTT 2040 

GGCTGGAATT ACAAAACTCA GAGAAATGTG TCATCAGGAG AACATCATAA CCCATGAAGG 2100 

ATAAAAGCCC CAAATGGTGG TAACTGATAA TAGCACTAAT GCTTTAAGAT TTGGTCACAC 2160 

TCTCACCTAG GTGAGCGCAT TGAGCCAGTG GTGCTAAATG CTACATACTC CAACTGAAAT 2220 

GTTAAGGAAG AAGATAGATC CAATTAAAAA AAATTAAAAC CAATTTAAAA AAAAAAAAGA 2280 

ACACAGGAGA TTCCAGTCTA CTTGAGTTAG CATAATACAG AAGTCCCCTC TACTTTAACT 2340 

TTTACAAAAA AGTAACCTGA ACTAATCTGA TGTTAACCAA TGTATTTATT TCTGTGGTTC 2400 

TGTTTCCTTG TTCCAATTTG ACAAAACCCA CTGTTCTTGT ATTGTATTGC CCAGGGGGAG 2460 

CTATCACTGT ACTTGTAGAG TGGTGCTGCT TTAATTCATA AATCACAAAT AAAAGCCAAT 2520 
TAGCTCTATA ACT 

Seq ID NO: 629 Protein sequence 
Protein Accession #: AAA59909.1 

1 11 21 31 41 51 

1 I I 1 I I 

MLTNVFISW LFPCSNLTKP TVLVLYCPGG AITVLVEWCC PNS 

Seq ID NO: 630 DNA sequence 

Nucleic Acid Accession #: NM_01663 9.1 

Coding sequence: 40.. 429 

1 11 21 31 41 51 

I I I I i I 

GCGGCGGGCG CAGACAGCGG CGGGCGCAGG ACGTGCACTA TGGCTCGGGG CTCGCTGCGC 60 

CGGTTGCTGC GGCTCCTCGT GCTGGGGCTC TGGCTGGCGT TGCTGCGCTC CGTGGCCGGG 120 

GAGCAAGCGC CAGGCACCGC CCCCTGCTCC CGCGGCAGCT CCTGGAGCGC GGACCTGGAC 180 

AAGTGCATGG ACTGCGCGTC TTGCAGGGCG CGACCGCACA GCGACTTCTG CCTGGGCTGC 240 

GCTGCAGCAC CTCCTGCCCC CTTCCGGCTG CTTTGGCCCA TCCTTGGGGG CGCTCTGAGC 300 

CTGACCTTCG TGCTGGGGCT GCTTTCTGGC TTTTTGGTCT GGAGACGATG CCGCAGGAGA 360 

GAGAAGTTCA CCACCCCCAT AGAGGAGACC GGCGGAGAGG GCTGCCCAGC TGTGGCGCTG 420 
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ATCCAGTGAC AATGTGCCCC CTGCCAGCCG GGGCTCGCCC ACT CAT CAT T CATTCATCCA 480 

TTCTAGAGCC AGTCTCTGCC TCCCAGACGC GGCGGGAGCC AAGCTCCTCC AACCACAAGG 540 

GGGGTGGGGG GCGGTGAATC ACCTCTGAGG CCTGGGCCCA GGGTTCAGGG GAACCTTCCA 600 

AGGTGTCTGG TTGCCCTGCC TCTGGCTCCA GAACAGAAAG GG AG CCTCAC GCTGGCTCAC 660 

ACAAAACAGC TGACACTGAC TAAGGAACTG CAGCATTTGC ACAGGGGAGG GGGGTGCCCT 720 

CCTTCCTTAO QACCTQGGGG CCAGGCTGAC TTGGGGGGCA GACTTGACAC TAGGCCCCAC 700 

TCACTCAGAT GTCCTGAAAT TCCACCACGG GGGTCACCCT GGGGGGTTAG GGACCTATTT 840 

TTAACACTAG GGGCTGGCCC ACTAGGAGGG CTGGCCCTAA GATACAGACC CCCCCAACTC 900 

CCCAAAGCGG GGAGGAGATA TTTATTTTGG GGAGAGTTTG GAGGGGAGGG AGAATTTATT 960 
AATAAAAGAA TCTTTAACTT TAAAAAAAAA AAAAAAAA 

Seq ID NO: 631 Protein sequence 
Protein Accession #: NP_057723.1 

1 11 21 31 41 51 

I 1 I I I I 

MARGSLRRLL RLLVLGLWLA LLRSVAGEQA PGTAPCSRGS SWSADLDKCM DCASCRARPH 60 

SDFCLGCAAA PPAPFRLLWP ILGGALSLTF VLGLLSGFLV WRRCRRREKF TTPIEETGGE 120 
GCPAVALIQ 

Seq ID NO t 632 dna sequence 

Nucleic Acid Accession #: NM_003 816.1 

Coding sequence: 79.. 2538 

1 11 21 31 41 51 

I I I I I I 

CGGCAGGGTT GGAAAATGAT GGAAGAGGCG GAGGTGGAGG CGACCGAGTG CTGAGAGGAA 60 

CCTGCGGAAT CGGCCGAGAT GGGGTCTGGC GCGCGCTTTC CCTCGGGGAC CCTTCGTGTC 120 

CGGTGGTTGC TGTTGCTTGG CCTGGTGGGC CCAGTCCTCG GTGCGGCGCG GCCAGGCTTT 180 

CAACAGACCT CACATCTTTC TTCTTATGAA ATTATAACTC CTTGGAGATT AACTAGAGAA 240 

AGAAGAGAAG CCCCTAGGCC CTATTCAAAA CAAGTATCTT ATGTTATTCA GGCTGAAGGA 300 

AAAGAG CAT A TTATTCACTT GGAAAGGAAC AAAGACCTTT TGCCTGAAGA TTTTGTGGTT 360 

TATACTTACA ACAAGGAAGG GACTTTAATC ACTGACCATC CCAATATACA GAATCATTGT 420 

CATTATCGGG GCTATGTGGA GGGAGTTCAT AATTCAT CCA TTGCTCTTAG CGACTGTTTT 480 

GGACTCAGAG GATTGCTGCA TTTAGAGAAT GCGAGTTATG GGATTGAACC CCTGCAGAAC 540 

AGCTCTCATT TTGAGCACAT CATTTATCGA ATGGATGATG TCTACAAAGA GCCTCTGAAA 600 

TGTGGAGTTT CCAACAAGGA TATAGAGAAA GAAACTGCAA AGGATGAAGA GGAAGAGCCT 660 

CCCAGCATGA CTCAGCTACT TCGAAGAAGA AGAGCTGTCT TGCCACAGAC CCGGTATGTG 720 

GAGCTGTTCA TTGTCGTAGA CAAGGAAAGG TATGACATGA TGGGAAGAAA TCAGACTGCT 780 

GTGAGAGAAG AGATGATTCT CCTGGCAAAC TACTTGGATA GTATGTATAT TATGTTAAAT 840 

ATTCGAATTG TGCTAGTTGG ACTGGAGATT TGGACCAATG GAAACCTGAT CAACATAGTT 900 

GGGGGTGCTG GTGATGTGCT GGGGAACTTC GTGCAGTGGC GGGAAAAGTT TCTTATCACA 960 

CGTCGGAGAC ATGACAGTGC ACAGCTAGTT CTAAAGAAAG GTTTTGGTGG AACTGCAGGA 1020 

ATGGCATTTG TGGGAACAGT GTGTTCAAGG AGCCACGCAG GCGGGATTAA TGTGTTTGGA 1080 

CAAATCACTG TGGAGACATT TGCTTCCATT GTTGCTCATG AATTGGGTCA TAATCTTGGA 1140 

ATGAATCACG ATGATGGGAG AGATTGTTCC TGTGGAGCAA AGAGCTGCAT CATGAATT CA 1200 

GGAGCATCGG GTTCCAGAAA CTTTAGCAGT TGCAGTGCAG AGGACTTTGA GAAGTTAACT 1260 

TTAAATAAAG GAGGAAACTG CCTTCTTAAT ATTCCAAAGC CTGATGAAGC CTATAGTGCT 1320 

CCCTCCTGTG GTAATAAGTT GGTGGACGCT GGGGAAGAGT GTGACTGTGG TACTCCAAAG 1380 

GAATGTGAAT TGGACCCTTG CTGCGAAGGA AGTACCTGTA AGCTTAAATC ATTTGCTGAG 1440 

TGTGCATATG GTGACTGTTG TAAAGACTGT CGGTTCCTTC CAGGAGGTAC TTTATGCCGA 1500 

GGAAAAACCA GTGAGTGTGA TGTTCCAGAG TACTGCAATG GTTCTTCTCA GTTCTGTCAG 1560 

CCAGATGTTT TTATTCAGAA TGGATATCCT TGCCAGAATA ACAAAGCCTA TTG CTACAAC 1620 

GGCATGTGCC AGTATTATGA TGCTCAATGT CAAGTCATCT TTGGCTCAAA AGCCAAGGCT 1680 

GCCCCCAAAG ATTGTTTCAT TGAAGTGAAT TCTAAAGGTG ACAGATTTGG CAATTGTGGT 1740 

TTCTCTGGCA ATGAATACAA GAAGTGTGCC ACTGGGAATG CTTTGTGTGG AAAGCTTCAG 1800 

TGTGAGAATG TACAAGAGAT ACCTGTATTT GGAATTGTGC CTGCTATTAT TCAAACGCCT 1860 

AGTCGAGGCA CCAAATGTTG GGGTGTGGAT TTCCAGCTAG GATCAGATGT TCCAGATCCT 1920 

GGGATGGTTA ACGAAGGCAC AAAATGTGGT GCTGGAAAGA TCTGTAGAAA CTTCCAGTGT 1980 

GTAGATGCTT CTGTTCTGAA TTATGACTGT GATGTTCAGA AAAAGTGTCA TGGACATGGG 2040 

GTATGTAATA GCAATAAGAA TTGTCACTGT GAAAATGGCT GGGCTCGCCC AAATTGTGAG 2100 

ACTAAAGGAT ACGGAGGAAG TGTGGACAGT GGACCTACAT ACAATGAAAT GAATACTGCA 2160 

TTGAGGGACG GACTTCTGGT CTTCTTCTTC CTAATTGTTC CCCTTATTGT CTGTGCTATT 2220 

TTTATCTTCA TCAAGAGGGA TCAACTGTGG AGAAGCTACT TCAGAAAGAA GAGATCACAA 2280 

ACATATGAGT CAGATGGCAA AAATCAAGCA AACCCTTCTA GACAGCOGGG GAGTGTTCCT 2340 

CGACATGTTT CTCCAGTGAC ACCTCCCAGA GAAGTTCCTA TATATGCAAA CAGATTTGCA 2400 

GTACCAACCT ATGCAGCCAA GCAACCTCAG CAGTTCCCAT CAAGGCCACC TCCACCACAA 2460 

CCGAAAGTAT CATCTCAGGG AAACTTAATT CCTGCCCGTC CTGCTCCTGC ACCTCCTTTA 2520 

TATAGTTCCC TCACTTGATT TTTTTAACCT TCTTTTTGCA AATGT CTTCA GGGAACTGAG 2580 

CTAATACTTT TTTTTTTTCT TGATGTTTTC TTGAAAAGCC TTTCTGTTGC AACTATGAAT 2640 

GAAAACAAAA CACCACAAAA CAGACTTCAC TAACACAGAA AAACAGAAAC TGAGTGTGAG 2700 

AGTTGTGAAA TACAAGGAAA TG CAGTAAAG CCAGGGAATT TACAATAACA TTTCCGTTTC 2760 

CATCATTGAA TAAGTCTTAT TCAGTCATCG GTGAGGTTAA TGCACTAATC ATGGATTTTT 2820 

TGAACATGTT ATTGCAGTGA TTCTCAAATT AACTGTATTG GTGTAAGATT TTTGTCATTA 2880 

AGTGTTTAAG TGTTATTCTG AATTTTCTAC CTTAGTTATC ATTAATGTAG TTCCTCATTG 2940 

AACATGTGAT AATCTAATAC CTGTGAAAAC TGACTAATCA GCTGCCAATA ATATCTAATA 3000 

TTTTTCATCA TGCACGAATT AATAATCATC ATACTCTAGA ATCTTGTCTG TCACTCACTA 3060 

CATGAATAAG CAAATATTGT CTTCAAAAGA ATGCACAAGA ACCACAATTA AGATGTCATA 3120 

TTATTTTGAA AGTACAAAAT ATACTAAAAG AGTGTGTGTG TATTCACGCA GTTACTCGCT 3180 

TCCATTTTTA TGACCTTTCA ACTATAGGTA ATAACTCTTA GAGAAATTAA TTTAATATTA 3240 

GAATTTCTAT TATGAATCAT GTGAAAGCAT GACATTCGTT CACAATAGCA CTATTTTAAA 3300 

TAAATTATAA GCTTTAAGGT ACGAAGTATT TAATAGATCT AATCAAATAT GTTGATTCAT 3360 

GGCTATAATA AAGCAGGAGC AATTATAAAA TCTTCAATCA ATTGAACTTT TACAAAACCA 3420 

CTTGAGAATT TCATGAGCAC TTTAAAATCT GAACTTTCAA AGCTTGCTAT TAAATCATTT 3480 

AGAATGTTTA CATTTACTAA GGTGTGCTGG GTCATGTAAA ATATTAGACA CTAATATTTT 3540 

CATAGAAATT AGGCTGGAGA AAGAAGGAAG AAATGGTTTT CTTAAATACC TACAAAAAAG 3600 

TTACTGTGGT ATCTATGAGT TATCATCTTA GCTGTGTTAA AAATGAATTT TTACTATGGC 3660 
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AGATATGGTA TGGATCGTAA AATTTTAAGC ACTAAAAATT TTTTCATAAC CTTTCATAAT 
AAAGTTTAAT AATAGGTTTA TTAACTGAAT TTCATTAGTT TTTTAAAAGT GTTTTTGGTT 
TGTGTATATA TACATATACA AATACAACAT TTACAATAAA TAAAATACTT GAAATTCTCA 
AAAAAAAAAA AAAAAAAAAA AAAAA 

Seq ID NO: 633 Protein sequence 
Protein Accession ft: NP_003807.l 



PCT/US02/12476 



3720 
37B0 
3840 



MGSGARPPSG 
PYSKQVSYVI 
EGVHNSSIAL 
DIEKETAKDE 
LLANYLDSMY 
AQLVLKKGFG 
RDCSCGAKSC 
LVDAGEECDC 
DVPEYCNGSS 
IEVNSKGDRF 
WGVDFQLGSD 
NCHCENGWAP 
DQLWRSYFRK 
KQPQQPPSRP 



11 
1 

TLRVRWLLLL 
QAEGKEHIIH 
SDCFGLRGLL 
EEEPPSMTQL 
IMLNIRIVLV 
GTAGMAFVGT 
IMNSGASGSR 
GTPKECELDP 
QFCQPDVFIQ 
GNCGFSGNEY 
VPDPGMVNEG 
PNCETKGYGG 
KRSQTYESDG 
PPPQPKVSSQ 



21 
I 

GLVGPVLGAA 
LERNKDLLPE 
HLENASYGIE 
IiRRRRAVLPQ 
GLEIWTNGNL 
VCSRSHAGGI 
NFSSCSAEDP 
CCEGSTCKLK 
NGYPCQNNKA 
KKCATGNALC 
TKCGAGKICR 
SVDSGPTYNE 
KNQANPSRQP 
GNLIPARPAP 



31 

I 

RPGPQQTSHL 
DFWYTYNKE 
PLQNSSHFEH 
TRYVELFIW 
XNIVGGAGDV 
NVFGQITVET 
EKLTLNKGGN 
SFAECAYGDC 
YCYNGMCQYY 
GKLQCENVQE 
NFQCVDASVL 
MNTALRDGLL 
GSVPRHVSPV 
APPLY SSLT 



Seq ID NO: 634 DNA sequence 

Nucleic Acid Accession ft: NMJJ02091.1 

Coding sequence: 56.. 503 



AGTCTCTGCT 
CGGCAGTGAG 
AGCGGTCCCG 
CCACTGGGCG 
TGAGAGAGGG 
GAATTTGCTG 
GGCCTTGGGC 
AGGTTCAAAA 
CCCCCAGCTG 
TAAGAGACTG 
AAATATTTGA 
CTTCTGGTTT 
TTTTTATATC 
TAAAAGCTTA 



11 
I 

CTTCCCAGCC 
CTCCCGCTGG 
CTGCCTGCGG 
GTGGGGCACT 
AGCCTGAAGC 
GGTCTCATAG 
AATCAGCAGC 
GGCAAAGTTG 
AACCAGCAAT 
AGTTCTGCAA 
CTATTCTGTA 
AAACTTGTTT 
TAGGCTACCT 
AACACAT 



21 
I 

TCTCCGGCGC 
TCCTGCTGGC 
GCGGAGGGAC 
TAATGGGGAA 
AGCAGCTGAG 
AAGCAAAGGA 
CTTCGTGGGA 
GTAGACTCTC 
GATAATGATG 
GCATCAGTTC 
TCTTTCATCC 
GCTGTGAACA 
GTTGGTTAGA 



31 
I 

GCTCCAAGGG 
GCTGGTCCTC 
CGTGCTGACC 
AAAG AG CACA 
AGAGTACATC 
GAACAGAAAC 
TTCAGAGGAT 
TGCTCCAGGT 

TACGGATCAT 
TTGACTAAAT 
ATTGTCGAAA 
TTCAAGGCCC 



41 

i 

SSYEIITPWR 
GTLITDHPNI 
IIYRMDDVYK 
DKERYDMMGR 
LGNFVQWREK 
FA9IVAHELG 
CLLNIPKPDE 
CKDCRFLPGG 
DAQCQVIFGS 
IPVFGIVPAI 
NYDCDVQKKC 
VFFFLIVPLI 
TPPREVPIYA 



41 

1 

CTTCCCGTCG 
TGCCTAGCGC 
AAGATGTACC 
GGGGAGTCTT 
AGGTG GGAAG 
CACCAGCCAC 
AGCAGCAACT 
TCTCAACGTG 
AAAGAGAAAA 
CAACAAGATT 
TCGTGATTTT 
AGAGTCTTCC 
CGAGCTGTTA 



51 
I 

LTRERREAPR 
QNHCHYRGYV 
EPLKCGVSNK 
NQTAVREEMI 
FLITRRRHDS 
HNLGMNHDDG 
AYSAPSCGNK 
TLCRGKTSEC 
KAKAAPKDCF 
IQTPSRGTKC 
BGHGVCNSNK 
VCAIFIFIKR 
NRFAVPTYAA 



51 
I 

GGACCATGCG 
CCCGGGGGCG 
CGCGCGGCAA 
CTTCTGTTTC 
AAGCTGCAAG 
CTCAACCCAA 
TCAAAGATGT 
AAGGAAGGAA 
ACAAAACCCC 
TCCTTGTGCA 
CAAGCAGCAT 
AATTAATGCT 
CCATTCACAA 



Seq ID NO: 635 Protein sequence 
Protein Accession #» NP_002082.l 



11 



21 



51 



31 41 

I I I 

MRGSELPLVIi LALVLCIjAPR GRAVPLPAGG GTVLTKMYPR GNHWAVGHLM GKKSTGESSS 

VSERGSLKQQ LREYIRWEEA ARNLLGLIEA KENRNHQPPQ PKALGNQQPS WDSEDS8NFK 
DVGSKGKVGR LSAPGSQREG RNPQLNQQ 

Seq ID NO: 636 DNA sequence 

Nucleic Acid Accession #i NM_016522.1 

Coding sequence: 2 6 5.. 12 9 9 



GCGGAAGCAG 
CTGGCAAAAG 
TTTTCTCCTC 
CCGCACCCCA 
TCGGGGAAGT 
TGCCTCGTGG 
AGCGGAGATG 
GCCACCCTCA 
ACCATCCTCT 
AACACCCAAA 
TACACCTGCT 
CAAGTATCTC 
ATTAGCCTCA 
TCTCCCAAAG 
CGGGAACAGT 
CGGAGAGTAA 
GTCCCCGTGG 
TTCCAGTGGT 
AACAGACCTT 
TACACTTGCG 
CCAGGCGCCO 
CTGCCTCTTC 
CGGGAAAGGC 
CCAATCAGAT 
GGGAGGGGAA 
CCTTGCAGAT 



11 
I 

CGAGGAGGGA 
CCQAGGCTQQ 
CCCGCGCCTC 
CCCACTTCCT 
TGTGGCTGTC 
TCGTGTCTCT 
CCACCTTCCC 
GGTGCACTAT 
ATGCTGGGAA 
CGCAGTACAG 
CGGTGCAGAC 
CCAAAATTGT 
CCTGCATAGC 
CGGTTGGCTT 
CAGGGGACTA 
AGGTCACOGT 
GACAAAAGGG 
ACAAGGATGA 
TCCTCTCAAA 
TGGCCTCCAA 
TCAGCGAGGT 
TGGTCTTGCA 
TGCCGCCACC 
ATATACAAAT 
CAAAGAATAC 
ATTTAGGTAC 



21 
I 

GCCCCCTTTG 
ATTTGGGGGA 
CCGGTCGCCG 
GTGCTCGCCC 
QAGAATGGGG 
CAGGCTGCTG 
CAAAGCTATG 
TGACAACCGG 
TGACAAGTGG 
CATCGAGATC 
AGACAACCAC 
AGAGATTTCT 
AACTGGTAGA 
TGTGAGTGAA 
CGAGTGCAGT 
GAACTATCCA 
GACACTGCAG 
CAAAAGACTG 
ACT CATC TTC 
CAAGCTGGGC 
GAGCAACGGC 
CCTGCTTCTC 
ACCACCACCA 
GAAATTAGAA 
TTTGGGGGGA 
AATGGAGTTT 



31 

1 

GCCGTCCTCC 
GGAATATTAG 
CGGGTTCACC 
GGGGGGCGTG 
GTCTGTGGGT 
TTCCTTGTAC 
GACAACGTGA 
GTCACCCGGG 
TGCCTGGATC 
CAGAACGTGG 
CCAAAGACCT 
TCAGATATCT 
CCAGAGCCTA 
GACGAATACT 
GCCTCCAATG 
CCATACATTT 
TGTGAAGCCT 
ATTGAAGGAA 
TTCAATGTCT 
CACACCAATG 
ACGTCGAGGA 
AAATTTTGAT 
ACACAACAGC 
GAAACACAGC 
AAAGAGTTTT 
TCTTTTCCCA 



41 

! 

GTGGAACCGG 
ACTCGGAGGA 
GCTCAGTCCC 
TGCCGTGCGG 
ACCTGTTCCT 
CCACAGGAGT 
CGGTCCGGCA 
TGGCCTGGCT 
CTCGCGTGGT 
ATGTGTATGA 
CTAGGGTCCA 
CCATTAATGA 
CGGTTACTTG 
TGGAAATTCA 
ACGTGGCCGC 
CAGAAGCCAA 
CAGCAGTCCC 
AGAAAGGGGT 
CTGAACATGA 
CCAGCATCAT 
GGGCAGGCTG 
GTGAGTGCCA 
AATGGCAACA 
CTCATGGGAC 
AAAAAAGAAA 
AACGGGAAGA 



51 
I 

TTTTCCGAGG 
GTCTGCGCGC 
CGCGCTCGCT 
CTGCCGGAGT 
GCCCTGGAAG 
GCCCGTGCGC 
GGGGGAGAGC 
AAACOGCAGC 
CCTTCTGAGC 
CGAGGGCCCT 
CCTCATTGTG 
AGGGAACAAT 
GAGA CA CAT C 
GGGCATCACC 
GCCCGTGGTA 
GGGTACAGGT 
CTCAGCAGAA 
GAAAGTGGAA 
CTATGGGAAC 
GCTATTTGGT 
CGTCTGGCTG 
CTTCCCCACC 
CCGACAGCAA 
AGAAATTTGA 
TTGAAAATTG 
ACACAGCACA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



60 
120 



60 
120 
180 
240 
300 
360 
420 
480 
54 0 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
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CCCGGCTTGG ACCCACTGCA AGCTGCATCG TGCAACCTCT TTGGTGCCAG TGTGGGCAAG 
GGCTCAGCCT CTCTGCCCAC AGACTGCCCC CACGTGGAAC ATTCTGGAGC TGGCCATCCC 
AAATTCAATC AGTCCATAGA GACGAACAGA ATGAGACCTT CCGGCCCAAG CGTGGCGCTT 
CCGGCCCAAG CGTGGCGCTG CGGGCACTTT GGTAGACTGT GCCACCACGG CGTGTGTTGT 
GAAACGTGAA ATAAAAAGAG CAAAAAAAAA AAAAAAAAA 

Seq ID NO: 637 Protein sequence 
Protein Accession #: NP_057606.1 ' 



PCT/US02/12476 



1620 
1660 
1740 
1800 



MGVCGYLFLP 
NRVTRVAWUJ 
NHPKTSRVHL 
SEDEYLBIQG 
LQCBASAVPS 
LGHTNASIML 



11 
I 

WKCLVWSLR 
RSTILYAGND 
IVQVSPKIVE 
I TREQSGD YE 
AEFQWYKDDK 
FGPGAVSEVS 



21 
I 

LLFLVPTGVP 
KWCLDPRWL 
ZSSOISINEG 
CSASNDVAAP 
RLIEGKKGVK 
NGTSRRAGCV 



31 

I 

VRSGDATPPK 
LSNTQTQYSI 
NNISLTCIAT 
WRRVKVTVN 
VENRPFLSKL 
WLLPLLVLHL 



41 

I 

AMDNVTVRQG 
EIQNVDVYDE 
GRPEPTVTWR 
YPPYISEAKG 
IPFNVSEHDY 
LLKP 



51 
I 

ESATLRCTID 
GPYTCSVQTD 
HISPKAVGFV 
TGVPVGQKGT 
GNYTCVASNK 



Seq ID NO : 638 DNA sequence 

Nucleic Acid Accession «: NM_012261.1 

Coding sequencet 203., 1045 



GATTTGCTCT 
ACAGAATACG 
CACTCCAGCG 
CCTCATTCGG 
ACTTCGAGTT 
GGAAAATCTC 
TGGGACGACG 
GGCCAGCAAC 
TGAGGTGAAG 
CGCATATGCA 
GGCGACTTGG 
CAAAGACGCA 
CACCCCCGCT 
TGATCCGCAG 
TATCTCAGAT 
GGAAGAAACC 
CGCGATTTAC 
ATCCCAGTAT 
CCAACTGGAT 
CATAGCTACA 
AACCCACGGA 
ATGCTGGGGA 
TGACTCTCCA 
TTGAAAACAT 
TGCTCCCTTG 
TCATGCTCCC 
GTTTAGTGAT 
AAAACGACTA 
GGGGGACCTG 
TTCTCTGGC 



11 
I 

GCCAGCAGCT 
CGCTCCCTCC 
GCGACTTTGA 
GGCACTGCGA 
CTCCTGATGT 
TCAGGCCTTT 
TGTCTCATGG 
TACGTAGATC 
GGCCGCTGTG 
CTCAAAATGC 
AGGCTGAGCA 
GTCAGTGCTG 
GGGAAGTCCT 
AAGACGGTCA 
TTTGTCTTCA 
TTGCCCCTGA 
CACGTCCACC 
AAGCACATGG 
CAGGTAGAAC 
ATCAAACAGG 
AGGGGGAGAC 
GGAGGGGAGG 
AAGAGCAATA 
GCTTCTTTGA 
GACACAGCTG 
TGCAGCAAGA 
TGTCTTGGGA 
ATGTAACTAT 
AAGAATCAAT 



21 
I 

QTCGGTQCCG 
CTCCCCCTTC 
GGGATTCCCT 
GTATGGATCT 
TGTTCCATAC 
CCACTAACCC 
CAGAGTTTGC 
TGATCACAGA 
GCCACAGCCA 
TCTTTGTAAA 
AAGTGCAGTT 
GGAAGCACAC 
ATGAGTGTCA 
CCATGATCCT 
GTGAAGAGCA 
TTTTGGGGCT 
ACAAAATGAC 
GCTAGAGGCC 
AACAAAAGCA 
CCTGGGTATC 
TCTTTCGGAT 
AGGGTCTCAG 
AATGCCACTT 
GGAGGAAACC 
GCTTATCCTA 
CCCCTGAAAG 
ATGTTT CACT 
GCAGAGTTGT 
CTGTGTGAGT 



31 
I 

CGCTCGACAC 
TCTGTCCCCC 
CTCTGGCGGC 
CCAAGGAAGA 
AATGGCTCAA 
TGAAAAAGAT 
AGCCAAATTT 
ACAGGCCGAT 
GTCGGAGCTG 
GGAAAG CCAC 
TGTCTACGAC 
AGCCAACTCG 
AGCTCAACAA 
GTCTGGGGTC 
TAAATGCCCA 
CATCTTGGGC 
TGCCAACCAG 
GTTAGGCAGG 
CTTTTCCATC 
TGAGGCTTGC 
TTGTAGGGTG 
ACAGCTTTCG 
GGAGCTGTAT 
CCTTTAGGTT 
TACAGTTGTC 
TGATTCATGC 
GCTACCCGCA 
TTGGACTTCT 
CTGTTTTTCA 



41 

I 

CGAGTCCTAG 
GCCTCTCGCT 
CTCTGCAGCA 
GGGGTCCCCA 
ATCATGGCAG 
ATATTTGTGG 
ATTGTACCTT 
ATCGCATTGA 
CAAGTGTTCT 
AACATGTCCA 
TCCTCGGAGA 
CACCACCTCT 
ACCATTTCAC 
CACATCCAAC 
GTGGATGAGC 
CTCGTCATCA 
GTGCAGATCC 
CACCCCCTAT 
TTGTACACGA 
TTGGCTTGTG 
AAATGGCAAT 
TGCTCATGGT 
CTGGCCCCAA 
CAGAAGAATA 
AATGCACACA 
TTCTGGCTGG 
TCCAGCGACT 
TCCTGTGCCA 
AAATGAAATA 



Seq ID NO i 639 Protein sequence 
Protein Accession #x NP 0363 93.1 



1 11 

I I 
MDLQGRGVPS IDRLHVLLML 
EFAAKPIVPY DVWASNYVDL 
FVKESHNMSK GPEATWRLSK 
ECQAQQTISL ASSDPQKTVT 
LGLILGLVIM VTLAIYHVHH 



21 31 41 

I I I 

FHTMAQIMAE QEVENLSGLS TNPEKDIFW 
ITEQADIALT RGAEVKGRCG HSQSELQVFW 
VQFVYDSSEK THFKDAVSAG KHTANSHHLS 
MILSAVHIQP FDIISDFVFS EEHKCPVDER 
KMTANQVQIP RDRSQYKHMG 



Seq ID NO: 640 DNA sequence 

Nucleic Acid Accession ft: NM_002993.1 

Coding sequence: 64.. 408 



1 

I 

GGCACGAGCC 
ACTATGAGCC 
GCGCTGCTCG 
GTCTCTGCTG 
CCCAAAACGA 
GTGGTAGCCT 
AAGAAAGTCA 
ACCATGCATC 
CAGTAAGAAT 
GAAGAGTGTG 
CTAATATAGT 
CAATTGACCA 
TGAAGATAAC 
ATTTCGTATG 
ACT CACT CTT 



11 
I 

AGTCTCCGCG 
TCCCGTCCAG 
CGCTGCTGCT 
TGCTGACAGA 
TTGGTAAACT 
CCCTGAAGAA 
TCCAGAAAAT 
ATAAAATTGC 
AAGAAGGAAG 
GGGGAAAGCC 
ATTTCCACTA 
TATTGTGAGC 
TATTGTATTT 
GAAATAATGT 
CTCATAAAAT 



21 
I 

CCTCCACCCA 
CCGCGCGGCC 
CCTGCTGACG 
GCTGCGTTGC 
GCAGGTGTTC 
CGGGAAGCAA 
TTTGGACAGT 
CCAGTCTTCA 
GGTTGGTTTT 
TACGCTTCTC 
TTTACTGTTA 
AAAGAATCAC 
CTATCATACA 
TTTATTAGTG 
AGGAAATATT 



31 
I 

GCTCAGGAAC 
CGTGTCCCGG 
CCGCCGGGGC 
ACTTGTTTAC 
CCCGCAGGCC 
GTTTGTCTGG 
GGAAACAAGA 
GCGGAGCAGT 
TTTCCATTTT 
CCTGAAGTTT 
TTTTACCTGA 
TGGTTATTAG 
TTCCTTAAAG 
TGCTGTTGAG 
TTAGTTCTGT 



41 

I 

CCGCGAACCC 
GTCCTTCGGG 
CCCTCGCCAG 
GCGTTACGCT 
CGCAGTGCTC 
ACCCGGAAGC 
AAAACTGAGT 
TTTCTG GAGA 
CTACATGGAT 
ACAGCTCAGC 
TAAGTTATTG 
TCTTTCAATG 
TCTTACCGAA 
GGAGGTATCC 
TTTCTTGGGG 



51 
I 

CTAGGCGCTC 
CACCCCGGCC 
GCACAGCCGG 
GCATCGACAG 
AACAAGAAGT 
TGCGGGAAAA 
ATGATGTGTG 
CCCGGGGAGC 
GGGTGGATCG 
AGGGACCTGA 
AAACCCACTT 
CTGCCTTGGT 
TGGCCTCTAG 
CTTTTGACAT 
GGGAGCAACT 
TGGTAACACT 
CTCGGGACAG 
TCCTGCTCCC 
GATACACCAA 
TCCATGCTTA 
TATTCTCTCC 
GGCTTGGCTT 
AGTTTAGGGA 
TGGGGTGCTT 
GAATACAACC 
CATTCTGCAT 
GCAGCACCAG 
GGTCCAAGTC 
AAACACACTA 



51 
I 

RENGTTCLMA 
VDRAYALKML 
ALVTPAGXSY 
EQDEETLPLI 



51 

I 

TCTCTTGACC 
CTCCTTGTGC 
CGCTGGTCCT 
GAGAGTAAAC 
CAAGGTGGAA 
CCCTTTTCTA 
AACAAAAAAG 
TCCCTGGACC 
TCCCTACTTT 
TAATGAAGTA 
AACCCTTTGG 
AATATTGAAT 
AAGGCTGTGG 
TGTTGTTCTT 
AATATGTTAC 



60 
120 
180 
240 
300 



60 
120 
180 
'240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 



60 
120 
180 
240 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



431 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

TCTTTACCCT AGGATGCTAT TTAAGTTGTA CTGTATTAGA ACACTGGGTG TGTCATACCG 
TTATCTGTGC AGAATATATT TCCTTATTCA GAATTTCTAA AAATTTAAGT TCTGTAAGGG 
CTAATATATT CTCTTCCTAT GGTTTTAGAT GTTTGATGTC TTCTTAGTAT GGCATAATGT 
CATGATTTAC TCATTAAACT TTGATTTTGT ATGCTATTTT TT CACTAT AG GATGACTATA 
ATTCTGGTCA CTAAATATAC ACTTTAGATA GATGAAGAAG CCCAAAAACA GATAAATTCC 
TGATTGCTAA TTTACATAGA AATGTATTCT CTTGGTTTTT TAAATAAAAG CAAAATTAAC 
AATGATCTGT GCTCTGCAAA GTTTTGAAAA TATATTTGAA CAATTTGAAT ATAAATTCAT 
CATTTASTCC TCAAAATATA TACAGCATTG CTAAGATTTT CAGATATCTA TTGTGGATCT 
TTTAAAGGTT TTGACCATTT TGTTATGAGG AATTATACAT GTATCACATT CACTATATTA 
AAATTGCACT TTTATTTTTT CCTGTGTGTC ATGTTGGTTT TTGGTACTTG TATTGTCATT 
TGGAGAAACA ATAAAAGATT TCTAAACCAA AAAAAAAAAA AAAAAAA 

Seq 10 NO i 641 Protein sequence 
Protein Accession #: NP 002984.1 



PCT/US02/12476 



41 



51 



1 11 21 31 

I I I I 1 I 

MSLPSSRAAR VPGPSGSLCA LrLALLLLLTP PGPLASAGPV SAVLTELRCT CLRVTLRVNP 
KTIGKLQVFP AGPQCSKVEV VASLKNGKQV CU3PEAPPLK KVIQKILDSG NKKN 

Seq ID NO: 642 DNA sequence 

Nucleic Acid Accession #: NMJU3271.1 

Coding sequence: 2 7.. 80 9 



TCCGGAGCCA 
CCGGGGGCGT 
TCTGCGCGCG 
AGACTGGCGC 
TGCAGGAGCT 
GGGCCGAGGC 
TCTGGGGCGC 
CTGCAGCGCA 
CCCAGCTTGT 
ACGACGGCCC 
CCGAGCTGTT 
TGGCAGCCCC 
CTGAGGGCGT 
TGCCTGCACG 
CAGAAGTGCC 
TTACCCCGGC 
GATCTGAGC 



11 
I 

GGCTCGCTGG 
CGGCCTTTTG 
GCOGGTAAAG 
TCCTCGCCGC 
GGCGCGGGCG 
GCAGGAGGCT 
CCCCCGCAAC 
GCTCGCTCGC 
CCCCGCGCCC 
CGCGGGCCCG 
GAGGTACTTG 
GCGCCGCCTC 
GCTGGGGGCG 
CCGCCTCTTG 
CCCGCCATCC 
CAGCCAGCCC 



21 

1 

GGCAGCATGG 
GTGCTGCTGC 
GAACCCCGCG 
TTCCGGCGGT 
CTGGCGCATC 
GAGGATCAGC 
TCTGATCCGG 
GCTCTGCTCC 
GTCCCCGCCG 
GATGCTGAGG 
CTGGGACGGA 
CGCCGTGCCG 
CTGCTGCGTG 
CCACCCTGAG 
CGCCACCAGG 
TCTCACCCGA 



31 
I 

CGGGGTCGCC 
TGCTCGGCCT 
GCCTAAGCGC 
CAGTGCCCCG 
TGCTGGAGGC 
AGGCGCGCGT 
CTCTGGGCCT 
GCGCCCGCCT 
CGGCGCTCCG 
AGGCAGGCGA 
TTCTTGCGGG 
CCGACCACGA 
TGAAACGCCT 
CACTGCCCGG 
ACTTCTCCCC 
GGATCCCTAC 



41 
I 

GCTGCTCTGG 
GTTTCGGCCG 
AGCGTCTCCG 
AGGTGAGGCG 
CGAACGTCAG 
CCTGGCGCAG 
GGACGACGAC 
TGACCCTGCC 
ACCCCGGCCC 
CGAGACACCC 
AAGCGCGGAC 
TGTGGGCTCT 
AGAGACCCCG 
ATCCCGTGCA 
GCCAGCACGT 
CCCCTGGCCC 



SI 

I 

GGGCCGCGGG 
CCCCCCGCGC 
CCCTTGGCTG 
GCGGGGGCGG 
GAGCGGGCGC 
CTGCTGCGCG 
CCCGACGCGC 
GCCCTAGCAG 
CCGGTCTACG 
GACGTGGACC 
TCCGAGGGGG 
GAGCTGCCCC 
GCGCCCCAGG 
CCCTGGGACC 
CCAGAGCAAC 
ACAATAACAT 



Seq ID NO: 643 Protein sequence 
Protein Accession #: NP_03 7403.1. 

1 11 21 31 41 51 

I I I I I I 

MAGSPLLWGP RAGGVGLLVL LLLGLPRPPP ALCARPVKEP RGLSAASPPL AETGAPRRFR 
RSVPRGEAAG AVQELARAIA HLLEAERQER ARAEAQEAED QQARVLAQLL RVWGAPRNSD 
PALGLDDDPD APAAQLARAL LRARLDPAAL AAQLVPAPVP AAALRPRPPV YDDGPAGPDA 
EEAGDETPDV DPELLRYLLG RILAGSADSE GVAAPRRLRR AADHDVGSEL PPEGVLQALL 
RVKRLETPAP QVPARRLfcPP 

Seq ID NO: 644 DNA sequence 
Nucleic Acid Accession #: NM_002214 
Coding sequence: 681.. 2990 



1 
I 

CCCAGAGCCG 
CTGCCGACTT 
GTTGGCCTCC 
TCCCCTCGAC 
TAGGGTGGTT 
CTAAGCTGAT 
TGTCCCGGAG 
TGGCCGTCGA 
GGCCGTAGGG 
CCGAGCCGCG 
GGCCCCGAGG 
GGGGCGGGCT 
TCTGCCTGCA 
CACTTGTTCT 
CCTGTGCCAG 
TTTCAGGTGG 
GCTCAGTTGA 
TTAATACCCA 
ATTTTATGCT 
ATGTCTCAGC 
CTAGAAAAAT 
AAACAGTTTC 
ACAATTTAGA 
TCACTGAGTT 
AAGGAGGTTT 
AAGAGGCTAA 



11 



GTCTTTGCCC 
CTGCCCACCT 
CTCGCCGGCG 
TCCCCCCCAG 
TTATGCAGCA 
CAGGCTGCGG 
AGGAGGTGCT 
GCCCTGAGAT 
GGGTCCGCCT 
TCGCCCGGGA 
GTTTTG CATT 
AAACGACCGG 
TGGACTGGGC 
GTGCCTTGCG 
ATCAAGAAGT 
TTCAATAGAA 
GGTGACACCA 
GAAAGTTCAT 
ATCAATGCAC 
GGCATTTTTC 
ACCATACATT 
CTGCATGCCT 
TGAGAAAGCA 
TGACGCCATG 
AAGATTGCTG 



21 
I 

TTGCTGGCAT 
GCTGCTCCGC 
GTGGAAGCAA 
TACCCTCCCA 
CTTCGGGCTT 
GAAGCCCCAC 
AGCCCTTGCA 
TCTCGCGGAG 
GCCGAGCGGT 
GCTAGGCCTG 
GGCCGAGCCC 
ATGTGCGGCT 
CGAGGTCCCG 
CAAGGTGAAG 
CTGGGTCCAG 
GAACGTTGTG 
TACCCATCTG 
GGAGAAGTGT 
CCTCTGAAGA 
AATAATATAG 
TCCCGTGACT 
AGCATCCACC 
CCCCATGGAT 
GTTCATAGAC 
CTTCAGGCAG 
CTGGTGATGA 



31 
I 

CCOGAGCTTC 
AGACGGGGCT 
CTGCGCTGAT 
CAGATCCAGC 
TGTTTGGGTT 
CGGCTGGAGA 
GAGCCCTCTC 
ACCGCGGGAC 
GCCCGGGCCC 
CGGAAAACGT 
GCGTCCGGAA 
CGGCCCTGGC 
CCTCGTTCCT 
ACAATAGATG 
AATGTGGATG 
ATATTGTTTC 
TGCATGTTAT 
CTATCCAGCT 
AATATCCTGT 
AAAAATTAAA 
TTCGTCTTGG 
CCGAAAGGAT 
ACATCCATGT 
AGAAGATCTC 
CTGTCTGTGA 
CAGATCAGAC 



41 

I 

CTCCCTTGCC 
GCAAAGCTGC 
TGATGCGCCA 
ATCACCCAGT 
TGATTGTGTT 
GAAACAAAAG 
TCCAGTCGCC 
CCGCCGTGCC 
GCTTACCTGC 
CCTAGCGACA 
GGCAGCCAGG 
TTTTTTTACC 
CTGGGCAGCC 
TGCATCTTCA 
GTGTGTTCAA 
CAATTTAATA 
AATACCCACT 
GCGTCCAGGA 
GGATCTTTAT 
TTCCGTTGGA 
ATTTGGCTCA 
TCATAATCAA 
GCTGTCTTTG 
TGGAAACATA 
AAGTCATATC 
GTCTCATCTC 



51 
I 

AGCCAGGACG 
AACTAATGGT 
CAGACTTTTT 
GAATGTACAT 
TGGCTCTTCG 
CTCTTTTCTT 
GCCGGGCCCT 
GAGCCGGGAG 
ACCGCTTGCT 
CTCGCCCGCG 
CGGCGGGCGC 
GCTGCATTTG 
TGGGTGTTTT 
AATGCAGCAT 
GAGGATTTCA 
AGCAAAGGCT 
GAAAATGAAA 
GCCGAAGCTA 
TATCTTGTTG 
AACGATTTAT 
TACGTTGATA 
TGCAGTGACT 
ACAGAGAACA 
GATACACCAG 
GGATGGCGAA 
GCTCTTGATA 



960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1S00 



60 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 



60 
120 
180 
240 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 



432 



WO 02/086443 

GCAAATTGGC AGGCATAGTG GTQCCCAATQ ACGGAAACTG TCATCTGAAA AACAACGTCT 
ACGTCAAATC GACAACCATG GAACACCCCT CACTAGGCCA ACTTTCAGAG AAATTAATAG 
ACAACAACAT TAATGTCATC TTTGCAGTTC AAGGAAAACA ATTTCATTGG TATAAGGATC 
TTCTACCCCT CTTGCCAGGC ACCATTGCTG GTGAAATAGA ATCAAAGGCT GCAAACCTCA 
ATAATTTGGT AGTGGAAGCC TATCAGAAGC TCATTTCAGA AGTGAAAGTT CAGGTGGAAA 
ACCAGGTACA AGGCATCTAT TTTAACATTA CCGCCATCTG TCCAGATGGG TCCAGAAAGC 
CAGGCATGGA AGGATGCAGA AACGTGACGA GCAATGATGA AGTTCTTTTC AATGTAACAG 
TTACAATGAA AAAATGTGAT GTCACAGGAG GAAAAAACTA TGCAATAATC AAACCTATTG 
GTTTTAATGA AACCGCTAAA ATT CAT AT AC ACAGAAACTG CAGCTGTCAG TGTGAGGACA 
ACAGAGGACC TAAAGGAAAG TGTGTAGATG AAACTTTTCT AGATTCCAAG TGTTTCCAGT 
GTGATGAGAA TAAATGTCAT TTTGATGAAG ATCAGTTTTC TTCTGAGAGT TGCAAGTCAC 
ACAAGGATCA GCCTGTTTGC AGTGGTCGAG GAGTTTGTGT TTGTGGGAAA TGTTCATGTC 
ACAAAATTAA GCTTGGAAAA GTGTATGGAA AATACTGTGA AAAGGATGAC TTTTCTTGTC 
CATATCACCA TGGAAATCTG TGTGCTGGGC ATGGAQAGTG TGAAGCAGGC AGATGCCAAT 
GCTTCAGTGG CTGGGAAGGT GATCGATGCC AGTGCCCTTC AGCAGCAGCC CAGCACTGTG 
TCAATTCAAA GGGCCAAGTG TGCAGTGGAA GAGGCACGTG TGTGTGTGGA AGGTGTGAGT 
GCACCGATCC CAGGAGCATC GGCCGCTTCT GTGAACACTG CCCCACCTGT TATACAGCCT 
GCAAGGAAAA CTGGAATTGT ATGCAATGCC TTCACCCTCA CAATTTGTCT CAGGCTATAC 
TTGATCAGTG CAAAACCTCA TGTGCTCTCA TGGAACAACA GCATTATGTC GACCAAACTT 
CAGAATGTTT CTCCAGCCCA AGCTACTTGA GAATATTTTT CATCATTTTC ATAGTTACAT 
TCTTGATTGG GTTGCTTAAA GTCCTGATCA TTAGACAGGT GATACTACAA TGGAATAGTA 
ATAAAATTAA GTCCTCATCA GATTACAGAG TGTCAGCCTC AAAAAAGGAT AAGTTGATTC 
TGCAAAGTGT TTGCACAAGA GCAGTCACCT ACCGACGTGA GAAGCCTGAA GAAATAAAAA 
TGGATATCAG CAAATTAAAT GCTCATGAAA CTTTCAGGTG CAACTTCTAA AAAAAGATTT 
TTAAACACTT AATGGGAAAC TGGAATTGTT AATAATTGCT CCTAAAGATT ATAATTTTAA 
AAGTCACAGG AGGAGACAAA TTGCTCACGG TCATGCCAGT TGCTGGTTGT ACACTCGAAC 
GAAGACTGAC AAGTATCCTC ATCATGATGT GACTCACATA GCTGCTGACT TTTTCAGAGA 
AAAATGTGTC TTACTACTGT TTGAGACTAG TGTCGTTGTA GCACTTTACT GTAATATATA 
ACTTATTTAG ATCAGCATAG AATGTAGATC CTCTGAAGAG CACTGATTAC ACTTTACAGG 
TACCTGTTAT CCCTACGCTT CCCAGAGAGA ACAATGCTGT GAGAGAGTTT AGCATTGTGT 
CACTACAAGG GTACAGTAAT CCCTGCACTG GACATGTGAG GAAAAAAATA ATCTGGCAAG 
TATATTCTAA GGTTGCCAAA CACTTCAACA GTTGGTGGTT GAATAGACAA GAACAGCTAG 
ATGAATAAAT GATTCGTGTT TCACTCTTTC AAGAGGTGAA CAGATACAAC CTTAATCTTA 
AAAGATTATT GCTTTTTAAA GTGTGTAGTT TTATGCATGT GTGTTTATGG TTTGCTTATT 
TTTGCAAGAT GGATACTAAT TCCAGCATTC TCTCCTCTTT GCCTTTATGT TTTGTTTTCT 
TTTTTACAGG ATAAGTTTAT GTATGTCACA GATGACTGGA TTAATTAAGT GCTAAGTTAC 
TACTGCCATA AAAAACTAAT AATACAATGT CACTTTATCA GAATACTAGT TTTAAAAGCT 
GAATGTTAA 

Seq ID NOt 645 Protein sequence 
Protein Accession ft: NP_002205 

1 11 21 31 41 51 

I I I I I I 

MCGSAliAPPT AAFVCLQNDR RGPASFLWAA WVFSLVLGLG QGEDNRCASS NAASCARCLA 

LGPECGWCVQ EDPISGGSRS ERCDIVSNLI SXGCSVDSIE YPSVHV1IPT ENEINTQVTP 

GEVSIQLRPG AEANFMLKVH PLKKYPVDLY YLVDVSASMH NNIEKLNSVG NDLSRKMAFF 

SRDFRLGFGS YVDKTVSPYI SIHPERIHNQ C8DYNLDCMP PHGYIHVIiSL TEN I TEFEKA 

VHRQKISGNI DTPEGGFDAM LQAAVCESHI GWRKEAKRLL LVMTDQTSHL ALDSKIAGIV 

VPNDGNCHLK NNVYVKSTTM EHPSLGQLSE KLIDNNINVI FAVQGKQFHW YKDLLPLLPG 

TIAGEIESKA ANLNNLWEA YQKLISEVKV QVENQVQGIY FKITAICPDG SRKPGMEGCR 

NVTSNDEVLF NVTVTMKKCD VTGGKNYAII KPIGFNBTAK IHIHRNCSCQ CEDNRGPKGK 

CVDETFLDSK CFQCDENKCH FDEDQFSSES CKSHKDQPVC SGRGVCVCGK CSCHKIKLGK 

VYGKYCEKDD PSCPYHHGNL CAGHGECEAG RCQCFSGWEG DRCQCPSAAA QHCVNSKGQV 

CSGRGTCVCG RCECTDPRSI GRFCEHCPTC YTACKENWNC MQdiHPHNLS QAILDQCKTS 

CALMEQQHYV DQTSECPSSP SYLRIFFIIP IVTFLIGLLK VLIIRQVILQ WNSNKIK9SS 

DYRVSASKKD KLILQSVCTR AVTYRREKPE EIKMDISKLN AHETFRCNF 

Seq ID NO t 646 DNA sequence 

Nucleic Acid Accession 8: NM_003318.l 

Coding sequence: 1..2574 



1 
I 

ATGGAATCCG 
AGAGACATTA 
ATTTCTGCTG 
AACCCAGAGG 
GATGCTCTTT 
GATAAATATG 
GCTATTCAAG 
AAATTTGCTT 
AAAAGTAAAC 
GAAATTGCCC 
AAGAATTTAT 
CATTTACAGA 
TTATATGGAG 
CAAACTAACA 
AGCCCAGATT 
ACCTCTAGAT 
TCCTGTGAAT 
TCAGATGAAA 
GAATCAAGTC 
GAGAGTAACC 
GCTGCATCTT 
AAACATACCA 
ACATCTAAAT 



11 
I 

AGGATTTAAG 
AAAATAAGTT 
ATACTACAGA 
ACTGGTTGAG 
TAAATAAATT 
GCCAAAATGA 
AGCCAGATGA 
TTGTTCATAT 
AACTTCTTCA 
TGCGGAATTT 
CAGCATCTAC 
ATAGGAACAA 
AGAACATGCC 
AAACTAAACA 
GTGATGTGAA 
CAGAATGCCG 
TAAGAAATTT 
AGAGTTCTGA 
TTCTAGCTAA 
AGAAACAGTG 
CAAATCACTG 
CTTTTGAGCA 
GGTTTGACCC 



21 
I 

TGGCAGAGAA 
TAAAAATGAA 
TAACTCGGGA 
TTTGTTGCTC 
GATTGGTCGT 
GAGTTTTGCT 
TGCACGTGAC 
ATCTTTTGCA 
AAAAGCTGTA 
AAACCTCCAA 
GGTATTAACT 
CAGTTGTGAT 
ACCACAAGAT 
GTCATGCCCA 
GACAGATGAT 
AGATTTGGTT 
AAAGTCTGTT 
ACTTATTATT 
ATTAGAAGAA 
GCAATCTAAG 
GCAGATTCCG 
ACCTGTCTTT 
AAAATCTATT 



31 
I 

TTGACAATTG 
GACCTTACTG 
ACTGTTAACC 
AAACTAGAGA 
TACAGTCAAG 
AGAATTCAAG 
TACTTTCAAA 
CAATTTGAAC 
GAACGTGGAG 
AAAAAGCAGC 
GCCCAAGAAT 
TCCAGAGGAC 
GCAGAAATAG 
TTTGGAAGAG 
TCAGTTGTAC 
QTGCCTGGAT 
CAAAATAGTC 
ACTGATTCAA 
ACTAAAGAGT 
AGAAAGTCAG 
GAGTTAGCCC 
TCAGTTTCAA 
TGTAAGACAC 



41 
I 

ATTCCATAAT 
ATGAACTAAG 
AAATTATGAT 
AAAACAGTGT 
CAATTGAAGC 
TGAGATTTGC 
TGGCCAGAGC 
TGTCACAAGG 
CAGTACCACT 
TGCTTTCAGA 
CATTTTCCGG 
AGACTACTAA 
GTTACCGGAA 
TCCCAGTTAA 
CTTGTTTTAT 
CTAAACCAAG 
ATTTCAAGGA 
TAACCCTGAA 
ATCAAGAACC 
AGTGTATTAA 
GAAAAGTTAA 
AACAGTCACC 
CAAGCAGCAA 



51 
I 

GAACAAAGTG 
CTTGAATAAA 
GATGGCAAAC 
TCCGCTAAGT 
GCTTCCCCCA 
TGAATTAAAA 
AAACTGCAAG 
TAATGTCAAA 
AGAAATGCTG 
GGAGGAAAAG 
TTCACTTGGG 
AGCCAGGTTT 
TTCATTGAGA 
CCTTCTAAAT 
GAAAAGACAA 
TGGAAATGAT 
ACCTCTGGTG 
GAATAAAACG 
AGAGGTTCCA 
CCAGAATCCT 
TACAGAGCAG 
ACCAATATCA 
TACCTTGGAT 



1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 



60 
120 
180 
240 
300 
360 
420 
460 
540 
600 
660 
720 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
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GATTACATGA GCTGTTTTAG AACTCCAGTT GTAAAGAATG ACTTTCCACC TGCTTGTCAG 1440 

TTGTCAACAC CTTATGGCCA ACCTGCCTGT TTCCAGCAGC AACAGCATCA AATACTTGCC 1S00 

ACTCCACTTC AAAATTTACA GGTTTTAGCA TCTTCTTCAG CAAATGAATG CATTTOGGTT 1560 

AAAGGAAGAA TTTATTCCAT TTTAAAGCAG ATAGGAAGTG GAGGTTCAAG CAAGGTATTT 1620 

CAGGTGTTAA ATGAAAAGAA ACAGATATAT GCTATAAAAT ATGTGAACTT AGAAGAAGCA 1680 

GATAACCAAA CTCTTGATAG TTACCGGAAC GAAATAGCTT ATTTGAATAA ACTACAACAA 1740 

CACAGTGATA AGATCATCCQ ACTTTATGAT TATGAAATCA CGGACCAGTA CATCTACATG 1800 

GTAATGGAGT GTGGAAATAT TGATCTTAAT AGTTGGCTTA AAAAGAAAAA ATCCATTGAT 1860 

CCATGGGAAC GCAAGAGTTA CTGGAAAAAT ATGTTAGAGG CAGTTCACAC AATCCATCAA 1920 

CATGGCATTG TTCACAGTGA TCTTAAACCA GCTAACTTTC TGATAGTTGA TGGAATGCTA 1980 

AAGCTAATTG ATTTTGGGAT TGCAAACCAA ATGCAACCAG ATACAACAAG TGTTGTTAAA 2040 

GATTCTCAGG TTGGCACAGT TAATTATATG CCACCAGAAG CAATCAAAGA TATGTCTTCC 2100 

TCCAGAGAGA ATGGGAAATC TAAGTCAAAG ATAAGCCCCA AAAGTGATGT TTGGTCCTTA 2160 

GGATGTATTT TGTACTATAT GACTTACGGG AAAACACCAT TT CAGCAG AT AATTAATCAG 2220 

ATTTCTAAAT TACATGCCAT AATTGATCCT AATCATGAAA TTGAATTTCC CGATATTCCA 2280 

GAGAAAGATC TTCAAGATGT GTTAAAGTGT TGTTTAAAAA GGGACCCAAA ACAGAGGATA 2340 

TCCATTCCTG AGCTCCTGGC TCATCCCTAT GTTCAAATTC AAACTCATCC AGTTAACCAA 2400 

ATGGCCAAGG GAACCACTGA AGAAATGAAA TATGTTCTGG GCCAACTTGT TGGTCTGAAT 2460 

TCTCCTAACT CCATTTTGAA AGCTGCTAAA ACTTTATATG AACACTATAG TGGTGGTGAA 2520 
AGTCATAATT CTTCATCCTC CAAGACTTTT GAAAAAAAAA GGGGAAAAAA ATGA 

Seq ID NO* 647 Protein sequence 
Protein Accession #t NPJJ03309.1 

1 11 21 31 41 51 

1 I I 1 I I 

MESEDLSGRE LTIDSIMNKV RDIKNKFKNE DLTDELSLNK ISADTTDNSG TVNQIMMMAN 60 

NPEDWLSLLL KLEKNSVPLS DALLNKLIGR YSQAIEALPP DKYGQNESPA RIQVRFAELK 120 

AIQEPDDARD YFQMARANCK KFAPVHISFA QFELSQGNVK KSKQLLQKAV ERGAVPLEML 180 

EIALRNLNLO KKQIiLSEEEK KNLSASTVLT AQESFSGSLQ HLQNRNNSCD SRGQTTKARF 240 

LYGENMPPQD AEIGYRNSLR QTNKTKQSCP FGRVPVNLLN SPDCDVKTDD SWPCFMKRQ 300 

TSRSECRDLV VPGSKPSGND SCELRNLKSV QNSHFKEPLV SDEKSSELII TDS1TLKNKT 360 

ESSLLAKLEE TKEYQEPEVP ESNQKQWQSK RKSECINQNP AASSNHWQIP ELARKVNTEQ 420 

KHTTFEQPVF SVSKQSPPIS TSKWFDPKSI CKTPSSNTLD DYMSCPRTPV VKNDFPPACQ 480 

LSTPYGQPAC FQQQQHQIIiA TPLQNLQVLA SSSANECISV KGRIYSILKQ IGSGGSSKVF 540 

QVLNEKKQIY AIKYVNLEEA DNQTLDSYRN EIAYLNKLQQ HSDKIIRLYD YEITDQYIYM 600 

VMECGNIDLN SWLKKKKSID PWERKSYWKN MLEAVHTIHQ HGIVHSDLKP ANFLIVDGML 660 

KLIDFGIANQ MQPDTTSWK DSQVGTVNYM PPEA1KDMSS SRENGKSKSK ISPKSDVWSL 720 

GCILYYMTYG KTPFQQIINQ ISKLHAIIDP NHEIEFPDIP EKDLQDVLKC CLKRDPKQRI 780 

SIPELLAHPY VQIQTHPVNQ MAKGTTEEMK YVLGQLVGLN SPNSILKAAK TLYEHYSGGE 840 
SHN8SSSKTF EKKRGKK 

Seq IO NO i 648 ONA sequence 
Nucleic Acid Accession #: NM_015507 
Coding sequence: 241.. 1902 

1 11 21 31 41 51 

I I I I I I 

CCGCAGAGGA GCCTCGGCCA GGCTAGCCAG GGCGCCCCCA GCCCCTCCCC AGGCCGCGAG 60 

CGCCCCTGCC GCGGTGCCTG GCCTCCCCTC CCAGACTGCA GGGACAGCAC CCGGTAACTG 120 

CGAGTGGAGC GGAGGACCCG AGCGGCTGAG GAGAGAGGAG GCGGCGGCTT AGCTGCTACG 180 

GGGTCCGGCC GGCGCCCTCC CGAGGGGGGC TCAGGAGGAG GAAGGAGGAC CCGTGCGAGA 240 

ATGCCTCTGC CCTGGAGCCT TGCGCTCCCG CTGCTGCTCT CCTGGGTGGC AGGTGGTTTC 300 

GGGAACGCGG CCAGTGCAAG GCATCACGGG TTGTTAGCAT CGGCACGTCA GCCTGGGGTC 360 

TGTCACTATG GAACTAAACT GGCCTGCTGC TACGGCTGGA GAAGAAACAG CAAGGGAGTC 420 

TGTGAAGCTA CATGCGAACC TGGATGTAAG TTTGGTGAGT GCGTGGGACC AAACAAATGC 480 

AGATGCTTTC CAGGATACAC CGGGAAAACC TGCAGTCAAG ATGTGAATGA GTGTGGAATG 540 

AAACCCCGGC CATGCCAACA CAGATGTGTG AATACACACG GAAGCTACAA GTGCTTTTGC 600 

CTCAGTGGCC ACATGCTCAT GCCAGATGCT ACGTGTGTGA ACTCTAGGAC ATGTGCCATG 660 

ATAAACTGTC AGTACAGCTG TGAAGACACA GAAGAAGGGC CACAGTGCCT GTGTCCATCC 720 

TCAGGACTCC GCCTGGCCCC AAATGGAAGA GACTGTCTAG ATATTGATGA ATGTGCCTCT 780 

GGTAAAGTCA TCTGTCCCTA CAATCGAAGA TGTGTGAACA CATTTGGAAG CTACTACTGC 840 

AAATGTCACA TTGGTTTCGA ACTGCAATAT ATCAGTGGAC GATATGACTG TATAGATATA 900 

AATGAATGTA CTATGGATAG CCATACGTGC AGCCACCATG CCAATTGCTT CAATACCCAA 960 

GGGTCCTTCA AGTGTAAATG CAAGCAGGGA TATAAAGGCA ATGGACTTCG GTGTTCTGCT 1020 

ATCCCTGAAA ATTCTGTGAA GGAAGTCCTC AGAGCACCTG GTACCATCAA AGACAGAATC 1080 

AAGAAGTTGC TTGCTCACAA AAACAGCATG AAAAAGAAGG CAAAAATTAA AAATGTTACC 1140 

CCAGAACCCA CCAGGACTCC TACCCCTAAG GTGAACTTGC AGCCCTTCAA CTATGAAGAG 1200 

ATAGTTTCCA GAGGCGGGAA CTCTCATGGA GGTAAAAAAG GGAATGAAGA GAAAATGAAA 1260 

GAGGGGCTTG AGGATGAGAA AAGAGAAGAG AAAGCCCTGA AGAATGACAT AG AGG AG CG A 1320 

AGCCTGCGAG GAGATGTGTT TTTCCCTAAG GTGAATGAAG CAGGTGAATT CGGCCTGATT 1380 

CTGGTCCAAA GGAAAGCGCT AACTTCCAAA CTGGAACATA AAGATTTAAA TATCTCGGTT 1440 

GACTG CAGCT TCAATCATGG GATCTGTGAC TGGAAACAGG ATAGAGAAGA TGATTTTGAC 1500 

TGGAAT CCTG CTGATCGAGA TAATGCTATT GGCTTCTATA TGGCAGTTCC GGCCTTGGCA 1560 

GGTCACAAGA AAGACATTGG CCGATTGAAA CTTCTCCTAC CTGACCTGCA ACCCCAAAGC 1620 

AACTTCTGTT TGCTCTTTGA TTA CCGGCTG GCCGGAGACA AAGTCGGGAA ACTTCGAGTG 1680 

TTTGTGAAAA ACAGTAACAA TGCCCTGGCA TGGGAGAAGA CCACGAGTGA GGATGAAAAG 1740 

TGGAAGACAG GGAAAATTCA GTTGTATCAA GGAACTGATG CTACCAAAAG CATCATTTTT 1800 

GAAGCAQAAC GTGGCAAGGG CAAAACCGGC GAAATCGCAG TGGATGGCGT CTTG CTTGTT I860 

TCAGGCTTAT GTCCAGATAG CCTTTTATCT GTGGATGACT GAATGTTACT ATCTTTATAT 192 0 

TTGACTTTGT ATGTCAGTTC CCTGGTTTTT TTGATATTGC ATCATAGGAC CTCTGGCATT 1980 

TTAGAATTAC TAGCTGAAAA ATTGTAATGT ACCAACAGAA ATATTATTGT AAGATGCCTT 2040 

TCTTGTATAA GATATGCCAA TATTTGCTTT AAATATCATA TCACTGTATC TTCTCAGTCA 2100 

TTTCTGAATC TTTCCACATT ATATTATAAA ATATGGAAAT GTCAGTTTAT CTCCCCTCCT 2160 

CAGTATATCT GATTTGTATA AGTAAGTTGA TGAGCTTCTC TCTACAACAT TTCTAGAAAA 2220 

TAGAAAAAAA AGGACAGAGA AATGTTTAAC TGTTTGACTC TTATGATACT TCTTGGAAAC 2280 

TATGACATCA AAGATAGACT TTTGCCTAAG TGGCTTAGCT GGGTCTTTCA TAGCCAAACT 2340 
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TGTATATTTA AATTCTTTGT AATAATAATA TCCAAATCAT CAAAAAAAAA AAAAAAAA 

Seq ID NO: 649 Protein sequence 
Protein Accession #: NP_056322 

1 11 21 31 41 51 

I I I I I I 

MPLPWSLALP LIiLSWVAGQF GNAASARHHG LLASARQPGV CHYGTKLACC YGWRRNSKGV 60 

CEATCEPGCK FGECVGPNKC RCFPGYTGKT CSQDVNECGM KPRPCQHRCV NTHGSYKCPC 120 

LSGHMLMPDA TCVNSRTCAM INCQYSCBDT EEGPQCLCPS SGLRIAPNGR DCLD1DECAS 180 

GKVICPYNRR CVNTFGSYYC KCHIGFELQV ISGRYDCIDI NECTMDSHTC SHHANCFNTQ 240 

GSFKCKCKQG YKGNGLRCSA IPENSVKEVL. RAPGTIKDRI KKLLAHKNSM KKKAKIKNVT 300 

PEPTRTPTPK VNLQPFNYEE IVSRGGNSHG GKKGNEEKMK EGLEDEKREE KALXNDIEER 3 60 

SLRGDVFFPK VNEAGEFGLI LVQRKAI.TSK LEHKDLNISV DCSFNHGICD WKQDREDDFD 420 

WNPADRDNAI GFYMAVPALA GHKKDIGRLK LLLPDLQPQS NFCLLFDYRL AGDKVGKLRV 480 

FVKNSNNALA WEKTTSEDEK WKTGKIQLYQ GTD ATKS I I F EAERGKGKTG EIAVDGVtrLV 540 
SGLCPDSLLS VED 

Seq ID NO: 650 DNA sequence 

Nucleic Acid Accession #: NM_0O3506.1 

Coding sequence: 259.-2379 

1 11 21 31 41 51 

I I 1 i I I 

GCAGCTCCAG TCCCGGACGC AACCCCGGAG CCGTCTCAGG TCCCTGGGGG GAACGGTGGG 60 

TTAGACGGGG ACGGGAAGGG ACAGCGGCCT TCGACCGCCC CCCGAGTAAT TGACCCAGGA 120 

CTCATTTTCA GGAAAGCCTG AAAATGAGTA AAATAGTGAA ATGAGGAATT TGAACATTTT 180 

ATCTTTGGAT GGGGATCTTC TGAGGATGCA AAGAGTGATT CATCCAAGCC ATGTGGTAAA 240 

ATCAGGAATT TGAAGAAAAT GGAGATGTTT ACATTTTTGT TGACGTGTAT TTTTCTACCC 300 

CTCCTAAGAG GGCACAGTCT CTTCACCTGT GAACCAATTA CTGTTCCCAG ATGTATGAAA 360 

ATGGCCTACA ACATGACGTT TTTCCCTAAT CTGATGGGTC ATTATGACCA GAGTATTGCC 420 

GCGGTGGAAA TGGAGCATTT TCTTCCTCTC GCAAATCTGG AATGTTCACC AAACATTGAA 480 

ACTTTCCTCT GCAAAGCATT TGTACCAACC TGCATAGAAC AAATTCATGT GGTTCCACCT 540 

TGTCGTAAAC TTTGTGAGAA AGTATATTCT GATTGCAAAA AATTAATTGA CACTTTTGGG 600 

ATCCGATGGC CTGAGGAGCT TGAATGTGAC AGATTACAAT ACTGTGATGA GACTGTTCCT 660 

GTAACTTTTG ATCCACACAC AG AAT TTCTT GGTCCTCAGA AGAAAACAGA ACAAGTCCAA 720 

AGAGACATTG GATTTTGGTG TCCAAGGCAT CTTAAGACTT CTGGGGGACA AGGATATAAG 780 

TTTCTGGGAA TTGACCAGTG TGCGCCTCCA TGCCCCAACA TGTATTTTAA AAGTGATGAG 840 

CTAGAGTTTG CAAAAAGTTT TATTGGAACA GTTTCAATAT TTTGTCTTTG TGCAACTCTG 900 

TTCACATTCC TTACTTTTTT AATTGATGTT AGAAGATTCA GATACCCAGA GAGACCAATT 960 

ATATATTACT CTGTCTGTTA CAGCATTGTA TCTCTTATGT ACTTCATTGG ATTTTTGCTG 1020 

GGCGATAGCA CAGCCTGCAA TAAGGCAGAT GAGAAGCTAG AACTTGGTGA CACTGTTGTC 1080 

CTAGGCTCTC AAAATAAGGC TTGCACCGTT TTGTTCATGC TTTTGTATTT TTTCACAATG 1140 

GCTGGCACTG TGTGGTGGGT GATTCTTACC ATTACTTGGT TCTTAGCTGC AGGAAGAAAA 1200 

TGGAGTTGTG AAGCCATCGA GCAAAAAGCA GTGTGGTTTC ATGCTGTTGC ATGGGGAACA 1260 

CCAGGTTTCC TGACTGTTAT GCTTCTTGCT CTGAACAAAG TTGAAGGAGA CAACATTAGT 1320 

GGAGTTTGCT TTGTTGGCCT TTATGACCTG GATGCTTCTC GCTACTTTGT ACTCTTGCCA 1380 

CTGTGCCTTT GTGTGTTTGT TGGGCTCTCT CTTCTTTTAG CTGGCATTAT TTCCTTAAAT 1440 

CATGTTCGAC AAGTCATACA ACATGATGGC CGGAACCAAG AAAAACTAAA GAAATTTATG 1500 

ATTCGAATTG GAGTCTTCAG CGGCTTGTAT CTTGTGCCAT TAGTGACACT TCTCGGATGT 1560 

TACGTCTATG AGCAAGTGAA CAGGATTACC TGGGAGATAA CTTGGGTCTC TGATCATTGT 1620 

CGTCAGTACC ATATCCCATG TCCTTATCAG GCAAAAGCAA AAGCTCGACC AQAATTGGCT 1680 

TTATTTATGA TAAAATACCT GATGACATTA ATTGTTGGCA TCTCTGCTGT CTTCTGGGTT 1740 

GGAAGCAAAA AGACATGCAC AGAATGGGCT GGGTTTTTTA AACGAAATCG CAAGAGAGAT 1800 

CCAATCAGTG AAAGTCGAAG AGTACTACAG GAATCATGTG AGTTTTTCTT AAAGCACAAT 1860 

TCTAAAGTTA AACACAAAAA GAAGCACTAT AAACCAAGTT CACACAAGCT GAAGGTCATT 1920 

TCCAAATCCA TGGGAACCAG CACAGGAGCT ACAGCAAATC ATGGCACTTC TGCAGTAGCA 1980 

ATTACTAGCC ATGATTACCT AGGACAAGAA ACTTTGACAG AAATCCAAAC CTCACCAGAA 2040 

ACATCAATGA GAGAGGTGAA AGCGGACGGA GCTAGCACCC CCAGGTTAAG AGAACAGGAC 2100 

TGTGGTGAAC CTGCCTCGCC AGCAGCATCC ATCTCCAGAC TCTCTGGGGA ACAGGTCGAC 2160 

GGGAAGGGCC AGGCAGGCAG TGTATCTGAA AGTGCGCGGA GTGAAGGAAG GATTAGTCCA 2220 

AAGAGTGATA TTACTGACAC TGGCCTGGCA CAGAGCAACA ATTTGCAGGT CCCCAGTTCT 2280 

TCAGAACCAA GCAGCCTCAA AGGTTCCACA TCTCTGCTTG TTCACCCAGT TTCAGGAGTG 2340 

AGAAAAGAGC AGGGAGGTGG TTGTCATTCA GATACTTGAA GAACATTTTC TCTCGTTACT 2400 

CAGAAGCAAA TTTGTGTTAC ACTGGAAGTG ACCTATGCAC TGTTTTGTAA GAATCACTGT 2460 

TACGTTCTTC TTTTGCACTT AAAGTTGCAT TGCCTACTGT TATACTGGAA AAAATAGAGT 2520 

TCAAGAATAA TATGACTCAT TTCACACAAA GGTTAATGAC AACAATATAC CTGAAAACAG 2580 

AAATGTGCAG GTTAATAATA TT TTTTT AAT AGTGTGGGAG GACAGAGTTA GAGGAATCTT 2640 

CCTTTTCTAT TTATGAAGAT TCTACTCTTG GTAAGAGTAT TTTAAGATGT ACTATGCTAT 2700 

TTTACCTTTT TGATATAAAA TCAAGATATT TCTTTGCTGA AGTATTTAAA TCTTATCCTT 2 760 

GTATCTTTTT ATACATATTT GAAAATAAGC TTATATGTAT TTGAACTTTT TTGAAATCCT 2820 

ATTCAAGTAT TTTTATCATG CTATTGTGAT ATTTTAGCAC TTTGGTAGCT TTTACACTGA 2880 

ATTTCTAAGA AAATTGTAAA ATAGTCTTCT TTTATACTGT AAAAAAAGAT ATACCAAAAA 2 940 

GTCTTATAAT AGGAATTTAA CTTTAAAAAC CCACTTATTG ATACCTTACC ATCTAAAATG 3000 

TGTGATTTTT ATAGTCTCGT TTTAGGAATT TCACAGATCT AAATTATGTA ACTGAAATAA 3 060 

GGTGCTTACT CAAAQAGTGT CCACTATTGA TTGTATTATG CTGCTCACTG ATCCTTCTGC 3120 

ATATTTAAAA TAAAATGTCC TAAAGGGTTA GTAGACAAAA TGTTAGTCTT TTGTATATTA 3180 

GGCCAAGTGC AATTGACTTC CCTTTTTTAA TGTTTCATGA CCACCCATTG ATTGTATTAT 3240 

AACCACTTAC AGTTGCTTAT ATTTTTTGTT TTAACTTTTG TTTCTTAACA TTTAGAATAT 3300 
TACATTTTGT ATTATACAGT ACCTTTCTCA GACATTTTGT AG 

Seq ID NO: 651 Protein sequence 
Protein Accession #: NP_003497.l 

1 11 21 31 41 51 

I I I I I I 
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MEMPTPLLTC IFIiPLLRGHS LFTCEPITVP RCMKMAYNMT 
FLPLANLECS PNIETFLCKA FVPTCIEQIH WPPCRKLCE 
LECDRLQYCD ETVPVTPDPH TEFLGPQKKT EQVQRDIGPW 
CAPPCPNMYF KSDELEFAKS FIGTVSIFCI* CATLFTPLTF 
YSIVSLMYFI GFLLGDSTAC NKADEKLELG DTWLGSQNK 
VILTITWFLA AGRKWSCEAI EQKAVWFHAV AWGTPGFLTV 
LYDI/DASRYF VhhPhChCVF VGLSLLLAGI ISLNHVRQVI 
SGLYLVPLVT LLGCYVYEQV NRITWEITWV SDHCRQYHIP 
LMTLIVGISA VFWVGSKKTC TEWAGFFKRN RKRDPISESR 
KKHYKPSSHK LKVISKSMGT STGATANHGT SAVAITSHDY 
KADGASTPRL REQDCGEPAS PAASISRLSG EQVDGKGQAG 
TGLAQSNNLQ VPSSSEPSSL KGSTSLLVHP VSGVRKEQGG 

Seq ZD NO: 652 DNA sequence 

Nucleic Acid Accession #: NM_014791.1 

Coding sequence: 171.. 2126 



PCI7US02/12476 



FFPNLMGHYD 
KVYSDCKKLI 
CPRHLKTSGG 
LIDVRRFRYP 
ACTVtiFMIiLY 
MLLALNKVEG 
QHDGRNQEKL 
CPYQAKAKAR 
RVLQESCEFF 
LGQETLTEIQ 
SVSESARSEG 
GCHSDT 



TTGGCGGGCG 
GCCGCGTCTC 
TCCGCCCCTC 
ATGATGAACT 
AGGTCAAACT 
AAAACACACT 
TGAGACATCA 
TGGTTCTTGA 
TGTCAGAAGA 
ACAGCCAGGG 
ATAAATTAAA 
ATCTACAGAC 
CATATCTTGG 
GTGGATTTCT 
GAAAATATGA 
TGCAGGTGGA 
TGCAAGATTA 
ATGATTGCGT 
TAATTTCACT 
AGGCTCGGGG 
CTACCCCATT 
ATAAAAATTA 
GTGCTGCTAC 
AATCTAAATC 
AAAATGTATA 
CAAAGACTCC 
ACACTACACC 
TAAATTCAAC 
GCTCAGTGGA 
CCAAAGTGTT 
GCAAAAGGAA 
CTACAACTAG 
AGAAGCATGT 
TTGGGAAAGT 
TGGGTATCAG 
ACATCCTATC 
GGTGTGATAC 
CTACCAACTT 
GATATTATTT 
TAATCATGTG 
AATGTAAGCT 
TTGTGAATAT 



11 

I 

GAAGCGGCCA 
TCAGGACAGC 
AGGTTCTTTT 
TCTCAAATAT 
TGCCTGCCAT 
AGGGAGTGAT 
GCATATATGT 
GTACTGCCCT 
GGAGACCCGG 
CTATGCTCAC 
GCTGATTGAC 
ATGCTGTGGG 
ATCAGAGGCA 
ACCATTTGAT 
TGTTCCCAAG 
CCCAAAGAAA 
CAACTATCCT 
AACAGAACTT 
GTGGCAGTAT 
AAAACCAGTT 
CACAGACATC 
TGTGGCGGGA 
TCCCCGAACA 
ATTAACTCCA 
TACTCCTAAG 
AGTTAATAAG 
CTCAAAAGCT 
AGGAACAGAC 
ATTGGATCTC 
TGGGAGCCTT 
GGGTT CTGCC 
ATTAGTGAAT 
TGACTTTGTA 
GACAATGCAA 
GAGGCAGCGG 
TAGCTGCAAG 
AGCCTACATA 
GTTTCTAAAG 
TGTGTATGAA 
GTTTTGTATA 
CTTAACTATG 



21 
f 

CAACCCGGCG 
AGGCCCCTGT 
TCTAATTCCA 
TATGAATTAC 
ATCCTTACTG 
TTGCCCCGGA 
CAACTCTACC 
GGAGGAGAGC 
GTTGTCTTCC 
AGGGACCTCA 
TTTGGTCTCT 
AGTCTGGCTT 
GATGTTTGGA 
GATGATAATG 
TGGCTCTCTC 
CGGATTTCTA 
GTTGAGTGGC 
TCTGTACATC 
GATCACCTCA 
CGTTTAAGGC 
AAGTCAAATA 
TTAATAGACT 
TCACAGTTTA 
GCCTTATGCA 
TCTGCTGTAA 
AACCAGCATA 
AGAAACCAGT 
AAGTTAATGA 
AACCAAGCAC 
GAAAGGGGGT 
AGAGACGGGC 
CCAGATCAAC 
CAAAAGGGTT 
TTTGAATTAG 
CTTAAGGGCG 
GTATAATTGA 
AAGACTGTTA 
AGCTATCTTA 
TCTAAATCAA 
TTAATAATTG 
TCTCTTTGTA 



31 
I 

ATOGAAAAGA 
CCTTCTGTCG 
AATAAACTTG 
ATGAAACTAT 
GAGAGATGGT 
TCAAAACGGA 
ATGTGCTAGA 
TGTTTGACTA 
GTCAGATAGT 
AGCCAGAAAA 
GTGCAAAACC 
ATGCAG CACC 
GCATGGGCAT 
TAATGGCTTT 
CCAGTAGCAT 
TGAAAAATCT 
AAAGCAAGAA 
ACAGAAACAA 
CGGCTACCTA 
TTTCTTCTTT 
ATTGGAGTCT 
ATGATTGGTG 
CCAAGTACTG 
GAACACCTGC 
AGAATGAAGA 
AGAGAGAAAT 
GCCTGAAAGA 
CAGGTGTCAT 
ATATGGAGGA 
TGGATAAGGT 
CCAGAAGACT 
TGTTGAATGA 
ATACACTGAA 
AAGTGTGCCA 
ATGCCTGGGT 
TGGATTCTTC 
TGATCGCTTT 
AGACCAATAT 
GCCCATCTGT 
TTGACTTTCT 
ATGTGTAATT 



41 

I 

TTCTTAGGAA 
GGCGCCGCTC 
CAAGAGGACT 
TGGGACAGGT 
AGCTATAAAA 
GATTGAGGCC 
GACAGCCAAC 
TATAATTTCC 
ATCTGCTGTT 
TTTGCTGTTT 
CAAGGGTAAC 
TGAGTTAATA 
ACTGTTATAT 
ATACAAGAAG 
TCTGCTTCTT 
ATTGAACCAT 
TCCTTTTATT 
CAGGCAAACA 
TCTTCTGCTT 
CTCCTGTGGA 
GGAAGATGTG 
TGAAGATGAT 
GACAGAATCA 
AAATAAATTA 
GTACTTTATG 
ACTCACTACG 
AACTCCAATT 
TAG CCCTGAG 
GACTCCAAAA 
TATCACTGTG 
AAAGCTTCAC 
AATAATGTCT 
GTGTCAAACA 
GCTTCAAAAA 
TTACAAAAGA 
CATCCTGCCG 
GATTTTAAAG 
CTCTTTGTTT 
CATTATGTTA 
TAGATTCACT 
TCTTTCTGAA 



QS I AAVEMEH 
DTFGIRWPEE 
QGYKFLGIDQ 
ERPIIYYSVC 
FFTMAGTVWW 
DNISGVCFVG 
KKFMIRIGVF 
PELALFMIKY 
LKHNSKVKHK 
TSPETSMREV 
RISPKSDITD 



51 
I 

CGCCGTACCA 
AGCCGTGCCC 
ATGAAAGATT 
GGCTTTGCAA 
ATCATGGATA 
TTGAAGAACC 
AAAATATTCA 
CAGGATCGCC 
GCTTATGTGC 
GATGAATATC 
AAGGATTACC 
CAAGGCAAAT 
GTT CTTATGT 
ATTATGAGAG 
CAACAAATGC 
CCCTGGATCA 
CACCTCGATG 
ATGGAGGATT 
CTAGCCAAGA 
CAAGCCAGTG 
ACCGCAAGTG 
TTATCAACAG 
AATGGGGTGG 
AAGAACAAAG 
TTTCCTGAGC 
CCAAATCGTT 
AAAATACCAG 
AGGCGGTGCC 
AGAAAGGGAG 
CTCACCAGGA 
TATAATGTGA 
ATTCTTCCAA 
CAGTCAGATT 
CCGGATGTGG 
TTAGTGGAAG 
GATGAGTGTG 
TTCATTGGAA 
TTAAACAAAA 
CTGTCTTTTT 
TCCATATGTG 
ATAAAACCAT 



Seq ID NO: 653 Protein sequence 
Protein Accession #: NP_055606.1 



1 
I 

MKDYDELLKY 
LKNIiRHQHIC 
AYVHSQGYAH 
QGKSYLGSEA 
QQMLQVDPKK 
MEDLISLWQY 
TASDKNYVAG 
KNKENVYTPK 
KIPVNSTGTD 
LTRSKRXGSA 
QSDFGKVTMQ 



11 

I 

YELHETIGTG 
QLYHVLETAN 
RDLKPENLLF 
DVWSMGILLY 
RISMKNLLNH 
DHLTATYLLL 
LIDYDWCEDD 
SAVKNEEYFM 
KLMTGVISPE 
RDGPRRLKLH 
PEIiEVCQLQK 



21 
I 

GFAKVKLACH 
KIFMVLEYCP 
DEYHKLKLID 
VLMCGFLPFD 
PWIMQDYNYP 
LAKKARGKPV 
LSTGAATPRT 
FPEPKTPVNK 
RRCRSVELDL 
YNVTTTRLVN 
PDWGIRRQR 



31 
I 

ILTGEMVAIK 
GGELFDYIIS 
FGLCAKPKGN 
DENVMALYKK 
VEWQSKNPFI 
RLRLSSFSCG 
SQFTKYWTES 
NQHKREILTT 
NQAHMEETPK 
PDQLLNEIMS 
LKGDAWVYKR 



41 
I 

IMDKNTLGSD 
QDRIjSEEETR 
KDYHLQTCCG 
IMRGKYDVPK 
HU3DDCVTEL 
QASATPFTDI 
NGVBSKSIiTP 
PNRYTTPSKA 
RKGAKVPGSL 
ILPKKHVDFV 
LVEDILSSCK 



51 
I 

IiPRIKTEIEA 
WFRQIVSAV 
SLAYAAPELI 
WLSPSSILLL 
SVHHRNNRQT 
KSNNWSLEDV 
ALCRTPANKL 
RNQCLKETPI 
ERGLDKVITV 
QKGYTLKCQT 
V 



Seq ID NO> 654 DNA sequence 
Nucleic Acid Accession #: NM_000582 
Coding sequence; 88.. 990 



60 
120 
ISO 
240 
300 
360 
420 
480 
540 
600 
660 



60 
120 
180 
240 
300 
360 
420 
480 
540' 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1B00 
I860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



51 



1 11 21 31 41 

I I I I I I 

GCAGAGCACA GCATCGTCGG GACCAGACTC GTCTCAGGCC AGTTGCAGCC TTCTCAGCCA 
AACGCCGACC AAGGAAAACT CACTACCATG AGAATTGCAG TGATTTGCTT TTGCCTCCTA 



60 
120 



436 



WO 02/086443 

OGCATCACCT GTGCCATACC AGTTAAACAG GCTGATTCTG GAAGTTCTGA GGAAAAGCAG 180 

CTTTACAACA AATACCCAGA TGCTGTGGCC ACATGGCTAA ACCCTGACCC AT CTCAGAAG 240 

CAGAATCTCC TAGCCCCACA GACCCTTCCA AGTAAGTCCA ACGAAAGCCA TGACCACATG 300 

GATGATATGG ATGATGAAGA TGATGATGAC CATGTGGACA GCCAGGACTC CATTGACTCG 360 

AACGACTCTG ATGATGTAGA TGACACTGAT GATTCTCACC AGTCTGATGA GTCTCACCAT 420 

TCTGATGAAT CTGATGAACT GGTCACTGAT TTTCCCACGG ACCTGCCAGC AACCGAAGTT 480 

TTCACTCCAG TTGTCCCCAC AQTAGACACA TATGATGGCC GAGGTGATAG TGTGGTTTAT 540 

GGACTGAGGT CAAAATCTAA GAAGTTTCGC AGACCTGACA TCCAGTACCC TGATGCTACA 600 

GACGAGGACA TCACCTCACA CATGGAAAGC GAGGAGTTGA ATGGTGCATA CAAGGCCATC 660 

CCCGTTGCCC AGGACCTGAA CGCGCCTTCT GATTGGGACA GCCGTGGGAA GGACAGTTAT 720 

GAAACGAGTC AGCTGGATGA CCAGAGTGCT GAAACCCACA GCCACAAGCA GTCCAGATTA 780 

TATAAGCGGA AAGCCAATGA TGAGAGCAAT GAGCATTCCG ATGTGATTGA TAGTCAGGAA 840 

CTTTCCAAAG TCAGCCGTGA ATTCCACAGC CATGAATTTC ACAGCCATGA AGATATGCTG 900 

GTTGTAGACC CCAAAAGTAA GGAAGAAGAT AAACACCTGA AATTTCGTAT TTCTCATGAA 960 

TTAGATAGTG CATCTTCTGA GGTCAATTAA AAGGAGAAAA AATACAATTT CTCACTTTGC 1020 

ATTTAGTCAA AAGAAAAAAT GCTTTATAGC AAAATGAAAG AGAACATGAA ATGCTTCTTT 1080 

CTCAGTTTAT TGGTTGAATG TGTATCTATT TGAGTCTGGA AATAACTAAT GTGTTTGATA 1140 

ATTAGTTTAG TTTGTGGCTT CATGGAAACT CCCTGTAAAC TAAAAGCTTC AGGGTTATGT 12 00 

CTATGTTCAT TCTATAGAAG AAATGCAAAC TATCACTGTA TTTTAATATT TGTTATTCTC 1260 

TCATGAATAG AAATTTATGT AGAAGCAAAC AAAATACTTT TACCCACTTA AAAAGAGAAT 1320 

ATAACATTTT ATGTCACTAT AATCTTTTGT TTTTTAAGTT AGTGTATATT TTGTTGTGAT 1380 

TATCTTTTTG TGGTGTGAAT AAATCTTTTA TCTTGAATGT AATAAGAATT TGGTGGTGTC 1440 

AATTGCTTAT TTGTTTTCCC ACGGTTGTCC AGCAATTAAT AAAACATAAC CTTTTTTACT 1500 
GCCTAAAAAA AAAAAAAAAA AAAA 

Seq ID NO: 655 Protein sequence 
Protein Accession #: NP_000573 

1 11 21 31 41 51 

I II I I I 

MRIAVICFCIi LGITCAIPVK QADSGSSEEK QLYNKYPDAV ATWLNPDPSQ KQNLLAPQTL 60 

PSKSNESHDH MDDMDDEDDD DHVDSQDS ID SNDSDDVDDT DDSHQSDESH HSDESDELVT 120 

DFPTDLPATE VFTPWPTVD TYDGRGDSW YQhRSKSKKF RRPDIQYPDA TDEDITSHME 180 

SEELNGAYKA IPVAQDLNAP SDWDSRGKDS YETSQLDDQS AETHSHKQSR LYKRKANDES 240 
NEHSDVIDSQ ELSKVSREFH SHE FHSHEDM LWDPKSKEE DKHUCFRISH ELDSASSEVN 

Seq ID NO i 656 DNA sequence 

Nucleic Acid Accession #: NMJJ03108.1 

Coding sequence: 76.. 1401 



1 11 21 31 41 51 

I 1 I I I I 

GGGGTGGGAG GGGGAGGGGG ACCTCCGCAC GAGACCCAGC GGCCCGGGTT GGAGCGTCCA 60 

GCCCTGCAAC GGATCATGGT GCAGCAGGCG GAGAGCTTGG AAGCGGAGAG CAACCTGCCC 120 

CGGGAGGCGC TGGACACGGA GGAGGGCGAA TTCATGGCTT GCAGCCCGGT GGCCCTGGAC 180 

GAGAGCGACC CAGACTGGTG CAAGACGGCG TCGGGCCACA TCAAGCGGCC GATGAACGCG 240 

TTCATGGTAT GGTCCAAGAT CGAACGCAGG AAGATCATGG AGCAGTCTCC GGACATGCAC 300 

AACGCCGAGA TCTCCAAGAG GCTGGGCAAG CGCTGGAAAA TGCTGAAGGA CAGCGAGAAG 360 

ATCCCGTTCA TCCGGGAGGC GGAGCGGCTG CGGCTCAAGC ACATGGCCGA CTACCCCGAC 420 

TACAAGTACC GGCCCCGGAA AAAGCCCAAA ATGGACCCCT CGGCCAAGCC CAGCGCCAGC 480 

CAGAGCCCAG AGAAGAGCGC GGCCGGCGGC GGCGGCGGGA GCGCGGGCGG AGGCGCGGGC 540 

GGTGCCAAGA CCTCCAAGGG CTCCAGCAAG AAATGCGGCA AGCTCAAGGC CCCCGCGGCC 600 

GCGGGCGCCA AGGCGGGCGC GGGCAAGGCG GCCCAGTCCG GGGACTACGG GGGCGCGGGC 660 

GACGACTACG TGCTGGGCAG CCTGCGCGTG AGCGGCTCGG GCGGCGGCGG CGCGGGCAAG 720 

ACGGTCAAGT GCGTGTTTCT GGATGAGGAC GACGACGACG ACGACGACGA CGACGAGCTG 780 

CAGCTGCAGA TCAAACAGGA GCCGGACGAG GAGGACGAGG AACCACCGCA CCAGCAGCTC 840 

CTGCAGCCGC CGGGGCAGCA GCCGTCGCAG CTGCTGAGAC GCTACAACGT CGCCAAAGTG 900 

CCCGCCAGCC CTACGCTGAG CAGCTCGGCG GAGTCCCCCG AGGGAGCGAG CCTCTACGAC 960 

GAGGTGCGGG CCGGCGCGAC CTCGGGCGCC GGGGGCGGCA GCCGCCTCTA CTACAGCTTC 1020 

AAGAACATCA CCAAGCAGCA CCCGCCGCCG CTCGCGCAGC CCGCGCTGTC GCCCGCGTCC 1080 

TCGCGCTCGG TGTCCACCTC CTCGTCCAGC AGCAGCGGCA GCAGCAGCGG CAGCAGCGGC 1140 

GAGGACGCCG ACGACCTGAT GTTCGACCTG AGCTTGAATT TCTCTCAAAG CGCGCACAGC 1200 

GCCAGCGAGC AGCAGCTGGG GGGCGGCGCG GCGGCCGGGA ACCTGTCCCT GTCGCTGGTG 1260 

GATAAGGATT TGGATTCGTT CAGCGAGGGC AGCCTGGGCT CCCACTTCGA GTTCCCCGAC 1320 

TACTGCACGC CGGAGCTGAG CGAGATGATC GCGGGGGACT GGCTGGAGGC GAACTTCTCC 13 80 

GACCTGGTGT TCACATATTG AAAGGCGCCC GCTGCTCGCT CTTTCTCTCG GAGGGTGCAG 1440 

AGCTGGGTTC CTTGGGAGGA AGTTGTAGTG GTGATGATGA TGATGATGAT AATGATGATG 1500 

ATGATGGTGG TGTTGATGGT GGCGGTGGTA GGGTGGAGGG GAGAGAAGAA GATGCTGATG 1560 

ATATTGATAA GATGTCGTGA CGCAAAGAAA TTGGAAAACA TGATGAAAAT TTTGGTGGAG 1620 

TTAAAGTGAA ATGAGTAGTT TTTAAACATT TTTCCTGTCC TTTTTTTGTC CCCCCTCCCT 1680 

TCCTTTATCG TGTCTCAAGG TAGTTGCATA CCTAGTCTGG AGTTGTGATT ATTTTCCCAA 1740 

AAAATGTGTT TTTGTAATTA CTATTTCTTT TTCCTGAAAT TCGTGATTGC AACAAAGGCA 1800 

GAGGGGGCGG CGCGGCGGAG GGGAGGTAGG ACCCGCTCCG GAAGGCGCTG TTTGAAGCTT 1860 

GTCGGTCTTT GAAGTCTGGA AGACGTCTGC AGAGGACCCT TTTGGCAGCA CAACTGTTAC 1920 

TCTAGGGAGT TGGTGGAGAT ATTTTTTTTT CTTAAGAGAA CTTAAAGAAC TGGTGATTTT 1980 
TTTTTAACAA AAAAAGGG 

Seq ID NO: 657 Protein sequence 
Protein Accession ft: NP_003 099.1 

1 11 21 31 41 51 

MVQQAESLEA ESNLPREALD TEEGEFMACS PVALDESDPD WCKTASGHIK RPMNAFMVWS 60 

KIERRKIMEQ SPDMHNAEIS KRLGKRWKML KDSEKIPFIR EAERLRLKHM ADYPDYKYRP 120 

RKKPKMDPSA KPSASQSPEK SAAGGGGGSA GGGAGGAKTS KGSSKKCGKL KAPAAAGAKA 180 

GAGKAAQSGD YGGAGDDYVL GSLRVSGSGG GGAGKTVKCV FLDEDDDDDD DDDELQLQIK 240 



437 



5 

10 
15 
2Q 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

QEPDEBDEBP PHQQliLQPPG QQPSQLLRRY NVAKVPASPT LSSSAESPEG ASLYDEVRAG 
ATSGAGGGSR LYYSFKNITK QHPPPLAQPA LSPASSRSVS TSSSSSSGSS SGSSGBDADD 
LMFDLSLNFS QSAHSASEQQ LGGOAAAGNL SLSLVDKDLD 3FSEGSLGSH FEPPDYCTPE 
LSEMIAGDWL EANFSDLVFT V 

Seq ID NO: 658 DNA sequence 
Nucleic Acid Accession ff: NM_001719 
Coding eequencet 123 ..1418 



PCT/US02/12476 



300 
360 
420 



1 
I 

GGGCGCAGCG 
CTGCCACCTG 
CGATGCACGT 
CCCTGTTCCT 
GCTTCATCCA 
CCATTTTGGG 
CCATT3TTCAT 
GCCAGGGCTT 
GCCTGCAAQA 
TGGAACATGA 
TTTCCAAGAT 
ACATCCGGGA 
AG CACT TGGG 
AGGAGGGCTG 
GGCACAACCT 
AGTTGGCGGG 
TCTTCAAGGC 
GCCAGAACCQ 
AGAACAGCAG 
GAGACCTGGG 
AGGGGGAGTG 
AGACGCTGGT 
AGCTCAATGC 
ACAGAAACAT 
TTGGGGCCAA 
CTQCCTTTTG 
AAA CAT GAG c 
TCCTACAAGC 
GCCGGGCCAG 
TTATGAGCGC 
GGGCACATTG 
CAATAAAACG 



11 
I 

GGGCCCGTCT 
GGGCGGTGCG 
GCGCTCACTG 
GCTGCGCTCC 
CCGGCGCCTC 
CTTGCCCCAC 
GCTGGACCTG 
CTCCTACCCC 
TAGCCATTTC 
CAAGGAATTC 
CCCAGAAGGG 
ACGCTTCGAC 
CAGGGAATCG 
GCTGGTGTTT 
GGGCCTGCAG 
CCTGATTGGG 
CACGGAGGTC 
CTCCAAGACG 
CAGCGACCAG 
CTGGCAGGAC 
TGCCTTCCCT 
CCACTTCATC 
CATCTCCGTC 
GGTGGTCCGG 
GTTTTTCTGQ 
TGAGACCTTC 
AGCATATGGC 
TGTGCAGGCA 
GTCATTGGCT 
CTACCAGCCA 
GTGTCTGTGC 
AATGAATG 



21 
I 

GCAGCAAGTG 
GGCCCGGAGC 
CGAGCTGCGG 
GCCCTGGCCG 
CGCAGCCAGG 
CGCCCGCGCC 
TACAACGCCA 
TACAAGGCCG 
CTCACCGACG 
TTCCACCCAC 
GAAGCTGTCA 
AATGAGACGT 
GATCTCTTCC 
GACATCACAG 
CTCTCGGTGG 
CGGCACGGGC 
CACTTCCGCA 
CCCAAGAACC 
AGGCAGGCCT 
TGGATCATCG 
CTGAACTCCT 
AACCCGG AAA 
CTCTACTTCG 
G CCTGTGGCT 
ATCCTCCATT 
CCCTCCCTAT 
TTTTGATCAG 
AAACCTAGCA 
GGGAAGTCTC 
GGCCACCCAG 
GAAAGGAAAA 



31 

I 

ACCGACGGCC 
CCGGAGCCCG 
CGCCGCACAG 
ACTTCAGCCT 
AG CGGCGGGA 
CGCACCTCCA 
TGGCGGTGGA 
TCTTCAGTAC 
CCGACATGGT 
GCTACCACCA 
CGGCAGCCGA 
TCCGGATCAG 
TGCTCGACAG 
CCACCAGCAA 
AGACGCTGGA 
CCCAGAACAA 
GCATCCGGTC 
AGGAAGCCCT 
GTAAGAAGCA 
CGCCTGAAGG 
ACATGAACGC 
CGGTGCOCAA 
ATGACAGCTC 
GCCACTAGCT 
GCTCGCCTTG 
CCGCAACTTT 
TTTTTCAGTG 
GGAAAAAAAA 
AGCCATGCAC 
CCGTGGGAGG 
TTGACCCGGA 



Seq ID NOt 659 Protein sequence 
Protein Accession #: NP_001710 



MHVRSLRAAA 
ItiGLPHRPRP 
LQDSHFLTDA 
IRBRFDNETP 
HNLGLOLSVE 
QNRSKTPKNQ 
GECAFPLNSY 
RNMWHACGC 



11 
I 

PHSFVALWAP 
HLQGKHNSAP 
DMVMSPVNIjV 
RISVYQVLQE 
TLDGQSINPK 
EALRMANVAE 
MNATNHAIVQ 
H 



21 
I 

LFLLRSALAD 
MFMLDLYNAM 
EHDKEFFHPR 
HLGRESDLFL 
LAGLIGRHGP 
NSSSDQRQAC 
TLVHFINPET 



31 
I 

PSLDNEVHSS 
AVEEGGGPGG 
YHHREFRFDL 
LDSRTLWA3E 
QNKQPFMVAF 
KKHELYVSFR 
VPKPCCAPTQ 



41 
I 

GGGACGGCCG 
GGTAGCGCGT 
CTTCGTGGCG 
GGACAACGAG 
GATGCAGCGC 
GGGCAAGCAC 
GGAGGGCGGC 
CCAGGGCCCC 
CATGAGCTTC 
TCGAGAGTTC 
ATTCCGGATC 
CGTTTATCAG 
CCGTACCCTC 
CCACTGGGTG 
TGGGCAGAGC 
GCAGCCCTTC 
CACGGGGAGC 
GCGGATGGCC 
CGAGCTGTAT 
CTACGCCGCC 
CACCAACCAC 
GCCCTGCTGT 
CAACGTCATC 
CCTCCGAGAA 
GCCAGGAACC 
AAAGGTGTGA 
GCAGCATCCA 
ACAACGCATA 
GGACTCGTTT 
AAGGGGGCGT 
AGTTCCTGTA 



41 
I 

FIHRRLRSQE 
QGFSYPYKAV 
SKIPEGEAVT 
EGWLVFDITA 
FKATEVHPRS 
DLGWQDWI1A 
LNAISVLYFD 



Seq ID NO: 660 DMA sequence 

Nucleic Acid Accession #s Eos sequence 

Coding sequence: 211. .189S 



GGATCTGAGG 
GGGGCTTGGG 
GAGGAATTAT 
TGATTTTTTT 
GTGCTTTTTC 
CACAGGTTCC 
CTTGTGCTGA 
GAAGGTAATT 
AAAATATCGG 
TTCCGACACT 
GCCAATTATT 
TTCTTTGAAC 
GCTGTGGCTA 
CACATGCACT 
GTAGTCCATG 
CAAAATTCCA 
GTTGTGATGT 
TACCTGCATA 
ATCTTGATAG 
ACTCTGGCTG 
GCACCGATCT 
CTAGCTACCA 
AAACTGGCCA 



11 
I 

GGCGCCCAGT 
AGGCAGCCTG 
CTGATAAAAT 
CCCTCGAAAA 
TTTTCTCTTC 
TTGAACAGCT 
AAGCGAAAGT 
GTTTCCCTGA 
CTGTTCCATG 
GTAACCCCAA 
CAGACTGCCT 
GCCTCTATGT 
TTCTCATCAT 
TATTTGTGTC 
CTCACATAGG 
TTGAGGCAAC 
TTATTTACTT 
ATCTCATCTT 
GCTGGGGGTT. 
ATGCGAGGTG 
TAGCAGCTAT 
AAATCTGGGA 
AATCGACACT 



21 
I 

CACTTCCTCC 
CTCTCCAGTC 
TCCTGGGTTA 
TGACCTTTTT 
TTTTTCTACG 
GGATTCTGAT 
ACAATGTGAA 
ATGGGATGGA 
CCCTCCTTAT 
TGGAACATGG 
TCGCTTTCTG 
AATGTATACC 
TGGTTACTTC 
TTTCATGCTG 
AGTAAAGGAG 
TTCTGTGGAC 
CCTGGCTACA 
TGTGGCTTTC 
TCCAGCAGCA 
CTGGGAACTT 
TGGGCTGAAT 
GACCAATGCA 
GGTCCTGGTC 



31 
I 

ACGTTCTCGT 
CCTATCCACC 
ATATTTTTAA 
ATGCTTCGAA 
ATAAATGAAA 
GGCACCATTA 
CTCAACATCA 
CTCATTTGTT 
ATTTATGACT 
GATTTTATGC 
CAGCCAGATA 
GTTGGCTACT 
AGACGATTGC 
AGAGCTACAA 
CTGGAGTCCC 
AAATCACAAT 
AATTATTATT 
TTTTCGGACA 
TTTGTTGCAG 
AGTGCTGGAG 
TTTATTCTGT 
GTTGGGCATG 
CTAGTCTTTG 



41 
I 

GCTGGGCGGG 
CACAGGTTTT 
AAACGGAGAG 
GCAGTTTGTC 
GCATTTCTTC 
CTATAGAGGA 
CAGCTCAACT 
GGCCCAGAGG 
TCAACCATAA 
ACAGCTTAAA 
TCAGCATAGG 
CCATCTCTTT 
ATTGCACTAG 
GCATCTTTGT 
TAATAATGCA 
ATATCGGGTG 
GGATCCTGGT 
CCAAATACCT 
CATGGGCTGT 
ACATCAAGTG 
TTCTGAATAC 
ACACAAGGAA 
GAGTGCATTA 



51 
I 

CCTGCCCCCT 
AGAGCCGGCG 
CTCTGGGCAC 
GTGCACTCGA 
GAGATCCTCT 
AACTCGGCAC 
GGGCCCGGCQ 
CCTCTGGCCA 
GTCAACCTCG 
CGGTTTGATC 
TACAAGGACT 
GTGCTCCAGG 
TGGGCCTCGG 
GTCAATCCGC 
ATCAACCCCA 
ATGGTGGCTT 
AAACAGCGCA 
AACGTGGCAG 
GTCAGCTTCC 
TACTACTGTG 
GCCATCGTGC 
QCGCCCACGC 
CTGAAGAAAT 
TTCAGACCCT 
AGCAGACCAA 
GAGTATTAGG 
ATGAACAAGA 
AAGAAAAATG 
CCAGAGGTAA. 
GGCAAGGGGT 
ATAAATGTCA 



51 
I 

RREMQREILS 
FSTQGPPLAS 
AAEFRIYKDY 
TSNHWWNPR 
IRSTGSKQRS 
PEGYAAYYCE 
DSSNVILKKY 



51 

I 

AGGAGCGGAT 
TTGGGTCGGA 
TTTTTAAAAA 
AACCAGCATA 
AAGAAAAAGG 
GCAGATTGTC 
CCAGGAGGGA 
AACAGTGGGG 
AGGAGTTGCT 
TAAAACATGG 
AAAGCAAGAA 
TGGTTCCTTG 
GAACTATATC 
CAAAGACAGA 
GGATGACCCA 
CAAGATTGCT 
GGAAGGTCTC 
GTGGGGCTTC 
GGCACGAGCA 
GATTTATCAA 
GGTTAGAGTT 
GCAATACAGG 
CATCGTGTTC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 



60 
120 
180 
240 
300 
360 
420 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



438 



WO 02/086443 

GTATGCCTGC CTCACTCCTT CACTGGQCTC GGGTGGGAGA TCCGCATGCA CTGTGAGCTC 1440 

TTCTTCAACT CCTTTCAGGG TTTCTTTGTG TCTATCATCT ACTGCTACTG CAATGGAGAG 1500 

GTTCAGGCAG AGGTGAAGAA GATGTGGAGT CGGTGGAATC TCTCCGTGGA CTGGAAAAGG 1560 

ACACCGCCAT GTGGCAGCCG CAGATGCGGC TCAGTGCTCA CCACCGTGAC GCACAGCACC 1620 

AGCAGCCAGT CACAGGTGGC GGCCAGCACA CGCATGGTGC TTATCTCTGG CAAAGCTGCC 1680 

AAGATCGCCA GCAGACAGCC TGACAGCCAC ATCACTTTAC CTGGCTATGT CTGGAGTAAC 1740 

TCAGAGCAGG ACTGCCTGCC ACACTCTTTC CACGAGGAGA CCAAGGAAGA TAGTGGGAGG 1B00 

CAGGGAGATG ATATTCTAAT GGAGAAGCCT TCCAGGCCTA TGGAATCTAA CCCAGACACT I860 
GAAGGATGCC AAGGAGAAAC TGAGGATGTT CTCTGA 

Seq ID NO: 661 Protein sequence 
Protein Accession #: Eos sequence 

1 11 21 31 41 51 

I I I I 1 I 

MIiRSSLSTSI VLFLFSSFST INESI SSRKR HRFLEQLDSD GTITIEEQIV LVLKAKVQCE 60 

LNITAQLQEG EGNCPPEWDG LICWPRGTVG KISAVPCPPY IYDFNHKGVA FRHCNPNGTW 120 

DPMHSLNKTW ANYSDCLRFL QPDISIGKQE FPERLYVMYT VGYSISFGSL AVAIL.IIGYF 180 

RRLHCTRNYI HMHLFVSFML RATSIFVKDR WHAHIGVKE LESLIMQDDP QNSIEATSVD 240 

KSQYIGCKIA WMFIYFLAT NYYWILVEGL YLHNLIPVAP FSDTKYLWGP ILIGWGFPAA 300 

PVAAWAVARA TLADARCWEL SAGD1KWIYQ APILAAIGLN PILPLNTVRV LATKIWETNA 360 

VGHDTRKQYR KLAKSTLVLV LVFGVHYIVP VCLPHSFTGL GWEIRMHCEL FFNSFQGFFV 420 

SIIYCYCMGE VQAEVKKMWS RWNLSVDWKR TPPCGSRRCG SVLTTVTHST SSQSQVAAST 480 

RMVLISGKAA KIASRQPDSH ITLPGYVWSN SEQDCLPHSP HEETKEDSGR QGDDILMEKF 540 
SRPMESNPDT EGCQGETEDV L 

Seq ID NO i 662 DNA sequence 
Nucleic Acid Accession #: NM_005048 
Coding sequence: 143.. 1795 

1 11 21 31 41 51 

! 1 I I I I 

GGCCGGTGGC CCGGGCCCGA CCACCCCAGC TGCGCGTCGT TACTGGCCAC AAGTTTGCTC 60 

TGGGCCAGCC AAGTTGGCAA CTTGGAAGCT TCTCCCGGGC TCTGGAGGAG GGTCCCTGCT 120 

TCTTCCTACA GCCGTTCCGG GCATGGCCGG GCTGGGGGCG TCGCTCCACG TCTGGGGTTG 180 

GCTAATGCTC GGCAGCTGCC TCCTGGCCAG AGCCCAGCTG GATTCTGATG GCACCATTAC 240 

TATAGAGGAG CAGATTGTCC TTGTGCTGAA .AGCGAAAGTA CAATGTGAAC TCAACATCAC 300 

AGCTCAACTC CAGGAGGGAG AAGGTAATTG TTTCCCTGAA TGGGATGGAC TCATTTGTTG 360 

GCCCAGAGGA ACAGTGGGGA AAATATCGGC TGTTCCATGC CCTCCTTATA TTTATGACTT 420 

CAACCATAAA GGAGTTGCTT TCCGACACTG TAACCCCAAT GGAACATGGG ATTTTATGCA 480 

CAGCTTAAAT AAAACATGGG CCAATTATTC AGACTGCCTT CGCTTTCTGC AGCCAGATAT 540 

CAGCATAGGA AAGCAAGAAT TCTTTGAACG CCTCTATGTA ATGTATACCG TTGGCTACTC 600 

CATCTCTTTT GGTTCCTTGG CTGTGGCTAT TCTCATCATT GGTTACTTCA GACGATTGCA 660 

TTGCACTAGG AACTATATCC ACATGCACTT ATTTGTGTCT TTCATGCTGA GAGCTACAAG 720 

CATCTTTGTC AAAGACAGAG TAGTCCATGC TCACATAGGA GTAAAGGAGC TGGAGTCCCT 780 

AATAATGCAG GATGACCCAC AAAATTCCAT TGAGGCAACT TCTGTGGACA AATCACAATA 840 

TATCGGGTGC AAGATTGCTG TTGTGATGTT TATTTACTTC CTGGCTACAA ATTATTATTG 900 

GATCCTGGTG GAAGGTCTCT ACCTGCATAA TCTCATCTTT GTGGCTTTCT TTTCGGACAC 960 

CAAATACCTG TGGGGCTTCA TCTTGATAGG CTGGGGGTTT CCAGCAGCAT TTGTTGCAGC 1020 

ATGGGCTGTG GCACGAGCAA CTCTGGCTGA TGCGAGGTGC TGGGAACTTA GTGCTGGAGA 1080 

CATCAAGTGG ATTTATCAAG CACCGATCTT AGCAGCTATT GGGCTGAATT TTATTCTGTT 1140 

TCTGAATACG GTTAGAGTTC TAGCTACCAA AATCTGGGAG ACCAATGCAG TTGGGCATGA 1200 

CACAAGGAAG CAATACAGGA AACTGGCCAA ATCGACACTG GTCCTGGTCC TAGTCTTTGG 1260 

AGTGCATTAC ATCGTGTTCX3 TATGCCTGCC TCACTCCTTC ACTGGGCTCG GGTGGGAGAT 1320 

CCGCATGCAC TGTGAGCTCT TCTTCAACTC CTTTCAGGGT TTCTTTGTGT CTATCATCTA 1380 

CTGCTACTGC AATGGAGAGG TTCAGGCAGA GGTGAAGAAG ATGTGGAGTC GGTGGAATCT 1440 

CTCCGTGGAC TGGAAAAGGA CACCGCCATG TGGCAGCCGC AGATGCGGCT CAGTGCTCAC 1500 

CACCGTGACG CACAGCACCA GCAGCCAGTC ACAGGTGGCG GCCAGCACAC GCATGGTGCT 1560 

TATCTCTGGC AAAGCTGCCA AGATCGCCAG CAGACAGCCT GACAGCCACA TCACTTTACC 1620 

TGGCTATGTC TGGAGTAACT CAGAGCAGGA CTGCCTGCCA CACTCTTTCC ACGAGGAGAC 1680 

CAAGGAAGAT AGTGGGAGGC AGGGAGATGA TATTCTAATG GAGAAGCCTT CCAGGCCTAT 1740 

GGAATCTAAC CCAGACACTG AAGGATGCCA AGGAGAAACT GAGGATGTTC TCTGAATGGA 1800 

CATTTGTGGC TGACTTTCAT GGGCTGGTCC AATGGCTGGT TGTGTGAGAG GGCTTGGCTG 1860 

ATACTCCTAT GCTTGAGTTC AAAGGCTGAA AATTCAGTTA AGGTGTTACT TAATAATAGT 1920 

TTTTAGGCTC CATGAATTGG CTCCTGTAAA TACTAACGAC ATGAAAATGC AAGTGTCAAT 1980 

GGAGTAGTTT ATTACCTTCT ATTGGCATCA AGTTTTCCTC TAAATTAATG TATGGTATTT 2040 

GCTCTGTGAT TGTTCATTTT TTTCTGCTAC TTTTGGGTAG AAAAAAGATT CAATTGCTTG 2100 

GCTGTAGCTT TCTCTCATAT ATATCACCCT AAATATAATG AAGATCTTTT AGTGTGTATC 2160 

ATTTTCCTTT TAGAAACTAG TATTCTCTTA TTTCTTACTT TAATGTACTT CTATCACTGC 2220 

ATTTATTTTG CCTGTGCATA GGAGCAATTA GGATCTAAAA AAATATATGG GAAGATAAAA 2280 

GATCTAAGAA CAAGTACTTG CTGGAAAATT AGTTGGCTGG ACATTGATAA AATAATGCAT 2340 

TTATAACAAT TACATGTGTT TTTGGGAACA AGGAAAATTT CTCAAAAAAG AATATTTCAC 2400 

ACATCCCTTC TTTTGAATGG CCTCTTTGTG ACCAGCCAGA CCTCAGGTCT TCACTCTTTC 2460 

TTCTTTGTAA ACCATGTCAT GTGGAAAGAT TTCCTCAGTT AGTGAGCTTG TGTCTGCAAA 2520 

TTGATTTTGT TTGTAATGTA TTTTGATAGC AAATCATGCT GCATCTATAT CTTTTTCTTG 2580 

TTTGAGCTGT TACTACATTG TACATGGCAT GTGGGATCAA TTAAAAATTT GTTTTAAAAA 2 640 
T 

Seq ID NO: 663 Protein sequence 
Protein Accession #t NP_005039 

1 11 21 31 41 51 

1 I I I I I 

MAGLGASLHV WGWLMLGSCL LARAQLDSDG TITIEEQIVI* VIjKAKVQCEL NITAQLQEGE 60 

GNCFPEWDGL ICWPRGTVGK ISAVPCPPYI YDFNHKGVAF RHCNPNGTWD FMHSLNKTWA 120 

NYSDCLRFLQ PDISIGKQEF FERLYVMYTV GYSISPGSLA VAILIIGYFR RLHCTRNYIH 180 

MHLFVSFMLR ATSIFVKDRV VHAHIGVKEL ESLIMQDDPQ NSIEATSVDK SQYIGCKIAV 240 



439 
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15 
20 
25 
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WO 02/086443 

VMPIYFLATN YYWILVEGLY 
LADARCWELS AGDIKWIYQA 
LAKSTLVLVt, VFGVHYIVFV 
QAEVKKMWSR WNLSVDWKRT 
IASRQPDSHI TLPGYVWSNS 
GCQGETEDVXi 



LHNLIFVAFP SDTKYLWGFI LIGWGFPAAP VAAWAVARAT 
PILAAIGLNF ILFLNTVRVL ATKIWETNAV GHDTRKQYRK 
CLPHSFTGLG WEIRMHCELF FNSFQGFFVS IIYCYCNGEV 
PPCGSRRCGS VLTTVTHSTS SQSQVAASTR MVLISGKAAK 
EQDCLPHSFH EETKEDSGRQ GDDILMEKPS RPMESNPDTE 



PCT/US02/12476 



300 
360 
420 
480 
S40 



Seq ID NO: 664 DNA sequence 
Nucleic Acid Accession Ui NM_012152 
Coding sequence* 43.. 1104 



CTTCTTTAAA 
GACAAGCACA 
GGAACAAAGC 
TCTAATTCTC 
TACCTGTTGG 
ATGTTTAACA 
GGGCTTCTGG 
AGGCACATGT 
CTGCTCATTT 
TGGAATTGCC 
TACCTTGTTT 
CTGCGGATCT 
TCCATCAGCC 
GCGTTTGTGG 
AGGCAGTGTG 
GTCGTGAACC 
ATGATCTGCT 
GTCCTCAGCA 
GTCTGCAATA 
GTCTTAGG 



11 
I 

TTTCTTTCTA 
TGGACTTTTT 
TTGTGATTGT 
TGGTCATCGC 
CTAATTTAGC 
CAGGCCCAGT 
ACAGTAGCTT 
CAATCATGAG 
TGCTTGTCTG 
TCTGCAACAT 
TCTGGACAGT 
ACGTGTACGT 
GCCGGAGGAC 
TATGCTGGAC 
GCGTGCAGCA 
CCATCATCTA 
GCTTCTCTCA 
GGAGTGACAC 
AAAGCACTTC 



21 
I 

GGATGTTCAC 
TTATAATAGG 
TTTGTGTGTT 
GGCAGTGATC 
TGCTGCCGAT 
TTCAAAAACT 
GACTGCTTCC 
GATGCGGGTC 
GGCCATCGCC 
CTCTGCCTGC 
GTCCAACCTC 
CAAGAGGAAA 
ACCCATGAAG 
CCCGGGCCTG 
TGTGAAAAGG 
CTCCTACAAG 
GGAGAACCCA 
AGGCAGCCAG 
CTAAACTCTG 



31 

! 

TTCTTCTCCA 
AGCAACACTG 
GGGACGTTTT 
AAAAACAGAA 
TTCTTCGCTG 
TTGACTGTCA 
CTCACCAACT 
CATAGCAACC 
ATTTTTATGG 
TCTTCCCTGG 
ATGGCCTTCC 
ACCAACGTCT 
CTAATGAAGA 
GTGGTTCTGC 
TGGTTCCTGC 
GACGAGGACA 
GAGAGGCGTC 
TACATAGAGG 
GATGCCTCTC 



41 
I 

CAATGAATGA 
ATACTGTCGA 
TCTGCCTGTT 
AATTTCATTT 
GAATTGCCTA 
ACCGCTGGTT 
TGCTGGTTAT 
TGACCAAAAA 
GGGCGGTCCC 
CCCCCATTTA 
TCATCATGGT 
TGTCTCCGCA 
CGGTGATGAC 
TCCTCGACGG 
TGCTGGCGCT 
TGTATGGCAC 
CCTCTCGCAT 
ATAGTATTAG 
GGCCCACCCA 



SI 
I 

GTGTCACTAT 
TGACTGGACA 
TATTTTTTTT 
CCCCTTCTAC 
TGTATTCCTG 
TCTCCGTCAG 
CGCCGTGGAG 
GAGGGTGACA 
CACACTGGGC 
CAGCAGGAGT 
TGTGGTGTAC 
TACAAGTGGG 
TGTCTTAGGG 
CCTGAACTGC 
GCTCAACTCC 
CATGAAGAAG 
CCCCTCCACA 
CCAAGGTGCA 
GGTGATGACT 



Seq ID NO: 665 Protein sequence 
Protein Accession #: NP_036284 



1 
I 

MNECHYDKHM 
FHFPFYYLLA 
LVIAVERHMS 
PIYSRSYLVP 
VMTVIiGAFW 
YGTMKKMICC 



11 

I " 

DFFYNRSNTD 
NLAAADFFAG 
IMRMRVH SNL 
WTVSNLMAFL 
CWTPGLWLL 
FSQENPERRP 



21 
I 

TVDDWTGTKL 
IAYVFLMFNT 
TKKRVTIiLIL 
IMVWYIiRIY 
LDGLNCRQCG 
SRIPSTVLSR 



31 
I 

VIVLCVGTFF 
GPVSKTLTVN 
LVWAIAIFMG 
VYVKRKTNVL 
VQHVKRWPLL 
SDTGSQYIED 



Seq ID NO: 666 DNA sequence 
Nucleic Acid Accession #i NM_002821 
Coding sequence: 150.. 3362 



AACTCCCGCC 
GCGCTCCGGT 
CCTCAGCTCC 
CCCGCCGGTT 
CCATTGTCTT 
TTCGCTGTGA 
CTGTCCAGGA 
ACCGGCTGCA 
AAGCCCGCAG 
TGAAGCATCC 
ACATTGATGG 
ATGGTCAGAG 
GTCCTGAGCA 
GCAGCCAGAA 
CCCAGGACGT 
AGCCACCCCC 
GCCCCCCACA 
TCCGGCCACG 
TCATCCTGGA 
GGGTGTTTAC 
AGCCCAGCGT 
AGAAGGGCCA 
GCCACGCGGC 
TGCCCTCCTG 
TGGATTGCCT 
TCATCTCAGA 
TGGAGGTGTA 
AGGCGCAAGC 
AGCAGTGCAT 
AG CCCACTAT 
ACGCTGGGAC 
TTGCCTCCAA 
TTATCACCTT 
TGCAGTGCGA 
TCCTGGACCC 



11 
I 

TCGGGACGCC 
GCGTCCGCCT 
TTTTCCTGAG 
GCCTCTGCTC 
CATCAAGCAG 
GGTTGAGGCT 
CACGGAGCGG 
GGACTCTGGC 
TGCCAACGCC 
AGCCTCGGAA 
GCACCCTCGa 
CAACCACACA 
TAGTGGGCTG 
CTTCACCTTG 
GGTAGTAGCG 
GAGCCTGCAG 
CCTCCGCAGA 
CAATGCAGGG 
AGCCACACTT 
AGCTGGCAGC 
GTGGTGGGAG 
CGAGCTGGTQ 
CAACCTGGCT 
GCTGAAGAAG 
GACCCAGGCC 
GGACTCACGG 
TGATGGGACA 
CCGTGTCCAA 
GGAGTTTGAC 
TAAGTGGGAA 
CCTGCATTTT 
CGGGCCGCAG 
CAAAGTGGAA 
GGCCCAGGGG 
CACCAAGCTG 



21 
I 

TCGGGGTCGG 
CCTGTGCCCG 
CCCGCCGCGA 
AGCGTCCTGC 
CCGTCCTCCC 
CCGGGCCCGG 

ACCTTCCAGT 
TCCTTCAACA 
GCTGAGATCC 
CCCACCTACC 
GTCAGCAGCA 
TATTCCTGCT 
AGCATTGCTG 
AGGTATGAGG 
TGGCTCTTTG 
GCCACAGTGT 
ATCTACCGCT 
CACCTAGCAG 
GAGGAGCGTG 
CACGCGGGAG 
TTGGCCAATA 
GGTCAGCGGA 
CCCCAAGACA 
ACACCAAAAC 
TTCGAGGTCT 
TGGTACCGTT 
GTGCTGGAAA 
AAGGAGGCCA 
CGGGCAGATG 
GCCCGGGTGA 
GGCCAGATTC 
CCAGAGCGTA 
GACCCCAAGC 
GGACCCAGGA 



31 
I 

GCTCCGGCTG 
CCGCGGAGCA 
TGGGAGCTGC 
TGCTGCCGCT 
AGGATGCACT 
TACATGTGTA 
AGGG CAGCAG 
GTGTGGCTCG 
TCAAATGGAT 
AGCCACAGAC 
AATGGTTCOG 
AGGAGCGGAA 
GCGCCCACAG 
ATGAAAGCTT 
AGGCCATGTT 
AGGATGAGAC 
TTGCCAACGG 
GCATTGGCCA 
AGATTGAAGA 
TGACCTGCCT 
TCCGGCTGCC 
TTGCTGAAAG 
GACAGGATGT 
GCCAGCTGGA 
CTACAGTTGT 
TCAAGAATGG 
GTATGAGCAG 
AGCTCAAGTT 

GGAGCAGCCT 
CTCGAGATGA 
GTGCCCATGT 
CGACTGTGTA 
CGCTGATTCA 
TGCACATCTT 



41 

I 

CLPIFFSNSL 
RWFLRQGLLD 
AVPTLGWNCL 
SPHTSGSISR 
LALLNSWNP 
SISQGAVCNK 



41 

1 

CGGCTGCTGC 
GTCTGCGGCC 
GCGGGGATCC 
GCTGGGCGGT 
GCAGGGGCGC 
CTGGCTGCTC 
CCTGAGCTTT 
GGATGATGTC 
TGAGGCAGGT 
CCAGGTCACA 
AGATGGGACC 
CCTGACGCTC 
TGCTTTTGGC 
TGCCAGGGTG 
CCATTGCCAG 
TCCCATCACT 
GTCTCTGCTG 
GGGGCAGAGG 
CATGCCGCTA 
TCCCCCCAAG 
CACCCATGGC 
TGATGCTGGT 
CAACATCACT 
GGAGGGCAAA 
CTGGTACAGA 
GACCTTGCGC 
CACCCCAGCC 
CACACCACCA 
TTCAGCCACA 
CCCAGAGTGG 
CGCTGGCAAC 
CCAGCTCACT 
CCAGGGCCAC 
GTGGAAAGGC 
CCAGAATGGC 



60 
120 
160 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



51 
I 

VIAAVIKNRK 
SSLTASLTNL 
CN I SACS SLA 
RRTPMKLMKT 
IIY5YKDEDM 
STS 



51 
I 

TGCGGCGCCC 
CGCCGTGCGC 
CCGGCCAGAC 
ACCCAGACAG 
CGGGCGCTGC 
GATGGGGCCC 
GCAGCTGTGG 
ACTGGAGAAG 
CCTGTGGTCC 
CTTCGTTGCC 
CCCCTTTCTG 
CGGCCAGCTG 
CAGGCTTGCA 
GTGCTGGCAC 
TTCTCAGCCC 
AACCGCAGTC 
CTGACCCAGG 
GGCCCACCCA 
TTTGAGC CAC 
GGTCTGCCAG 
AGGGTCTACC 
GTCTACACCT 
GTGGCCACTG 
CCCGGCTACT 
AACCAGATGC 
ATCAACAGCG 
GGCAGCATCG 
CCCCAGCCAC 
GGCCGAGAGA 
GTGACAGACA 
TACACTTGCA 
GTGGCAGTTT 
ACAGCCCTAC 
AAGGACCGCA 
TCCCTGGTGA 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
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WO 02/086443 

TCCATGACGT GGCCCCTGAG GACTCAGGCC GCTACACCTG 
ACATCAAGCA CACGGAGGCC CCCCTCTATG TCGTGGACAA 
AGGGCCCTGG CAGCCCTCCC CCCTACAAGA TGATCCAGAC 
CCGCTGTGGC CTACATCATT GCCGTGCTGQ GCCTCATGTT 
AAGCCAAGCG GCTGCAGAAG CAGCCCGAGG GCGAGGAGCC 
QAGGGCCTTT GCAGAACGGG CAGCCCTCAG CAGAGATCCA 
GCTTGGGCTC CGGCCCCGCG GCCACCAACA AACGCCACAG 
TCCCACGGTC TAGCCTGCAG CCCATCACCA CGCTGGGGAA 
TCCTGGCAAA GGCTCAGGGC TTGGAGGAGG GAGTGGCAGA 
GCCTGCAGAC GAAGGATGAG CAGCAGCAGC TGGACTTCCG 
GGAAGCTGAA CCACGCCAAC GTGGTGCGGC TCCTGGGGCT 
ACTACATGGT GCTGGAATAT GTGGATCTGG GAGACCTCAA 
AGAGCAAGGA TGAAAAATTG AAGTCACAGC CCCTCAGCAC 
GCACCCAGGT AGCCCTGGGC ATGGAGCACC TGTCCAACAA 
TGGCTGCGCG TAACTGCCTG GTCAGTGCCC AGAGACAAGT 
TCAGCAAGGA TGTGTACAAC AGTGAGTACT ACCACTTCCG 
GCTGGATGTC CCCCGAGGCC ATCCTGGAGG GTGACTTCTC 
CCTTCGGTGT GCTGATGTGG GAAGTGTTTA CACATGGAGA 
CAGATGATGA AGTACTGGCA GATTTGCAGG CTGGGAAGGC 
GCTGCCCTTC CAAACTCTAT CGGCTGATGC AGCGCTGCTG 
GGCCCTCCTT CAGTGAGATT GCCAGCGCCC TGGGAGACAG 
GAGGAGGGAG CCOGCTCAGG ATGGCCTGGG CAGGGGAGGA 
CAGCATGATG GGCAAGATCC CTGTCCTCCT GGGCCCTGAG 
TTGCTGAGGT CTGAGCAGGG CCTGGCCTTT CCTCCTCTTC 
GGCTGACTTG GAOCCAAACT GGGCGACTAG GGCTTTGAGC 
CTCTTCCTCT ATCAGGGACA GTGTGGGTGC CACAGGTAAC 
TTCTCCCCTT GACCGGGTCC AACTCTGCCA CTCATCTGCC 
AGGCTTGGGA TGAGCTGGGT TTGTGGGGAG TTCCTTAATA 
AGGGTTAATG AGTCTCTTGC CCACTGGTCC ACTTGGGGGT 
ACACAGCAAG TGAGTCCTCC CCACTCTGGG CTTGTGCACA 
CCCCACCCTT CTCTCCTTTC CTCATCCTAA GTGCCTGGCA 
CTTTTGACAC TATATAAACC GCCCTTTTTG TATGCACCAC 
TGCAGCGTGG GGTGGGTGGG CATGGGAGGT AGGGGTGGGC 
GCCATCCTTA CCCCACACTT TTATTGTTGT CGTTTTTTGT 
TGTTTTTGTT TTTACACTCG CTGCTCTCAA TAAATAAGCC 

Seq ID NO: 667 Protein sequence 
Protein Accession #t NP_002812 



MGAARGS PAR 
VKVYWIiLDGA 
IKWIEAGPW 
KERNLTLRPA 
EAMPHCQFSA 
CIGQGQRGPP 
VRLPTHGRVY 
SQLiEEGKPGY 
CMSSTPAGSI 
GSSLPEWVTD 
TTVYQGHTAL 
RYTCIAGNSC 
GLMPYCKKRC 
KRHSTSDKMH 
LDFRRELEMF 
PLSTKQKVAL 
YHFRQAWVPL 
AGKARLPQPE 



11 
I 

PRRIiPLLSVL 
PVQDTERRFA 
LKHPASEAEI 
GPEHSGLYSC 
QPPPSLQWLP 
IILEATLHLA 
QKGHELVLAN 
LDCLTQATPK 
EAQARVQVLE 
NAGTLHFARV 
LQCEAQGDPK 
NIKHTEAPLY 
KAKRLQKQPE 
PPRSSLQPIT 
GKLNHANWR 
CTQVALGMEH 
RWMSPEAILE 
GCPSKLYRLM 



21 
I 

LLPLLGGTQT 
QGSSLSFAAV 
QPQTQVTLRC 
CAHSAPGQAC 
EDETPITNRS 
ElEDMPIiFEP 
IAESDAGVYT 
PTWWYRNQM 
KLKFTPPPQP 
TRDDAGNYTC 
PLIQWKGKDR 
WDKPVPEES 
GEEPEMECLN 
TLGKSEFGEV 
LLGLCREAEP 
LSNNRFVHKD 
GDFSTKSDVW 
QRCWALSPKD 



31 
I 

AIVFIKQPSS 
DRLQDSGTFQ 
HIDGHPRPTY 
SSQNFTLSIA 
RPPHLRRATV 
RVFTAGSEER 
CKAANLAGQR 
IilSEDSRFEV 
QQCMEFDKEA 
ZASNGPQGQI 
ILDPTKLGPR 
EGPGSPPPYK 
GGPLQNGQPS 
FIiAKAQGLEE 
HYMVLEYVDL 
LAARNCLVSA 
AFGVLMWEVP 
RPSFSEIASA 



Seq ID NO: 668 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequences 1..1389 



ATGGGCTACC 
ACCCTTGTTT 
GTTGTCAACT 
GGGTTTCCTT 
GTTTTATTGA 
AAAACTTTCG 
ATAGCAATGA 
ATCCCAGGAG 
ACAGTTACCT 
TCCCTCATCT 
TCACTGGGTC 
ATTCAAGCGG 
TACAGTTCTC 
GTGATTTCTG 
TTCACCCAAG 
AGATTTTGTT 
GAGGTAATTG 
ACAGTGATGG 
GTTCTAGAAC 
TGTTATCTGA 
ATGCTTCCCA 



11 
1 

AGAGGCAGGA 
CTGAACATGA 
OGATTATAGG 
TGGGAATATT 
TAAAAGGAGG 
GCTTTCCAGG 
TAAGTTACAA 
TTGATCCTGA 
TTACTCTGCC 
CTACAGGTTT 
CACACATACC 
TCGGGGTTAT 
TAGAAGAACC 
TATTTATCTG 
GGGACTTATT 
ATGGTGTCAC 
CCAATGTGTT 
TCATCACTGT 
TCAATGGTGT 
AACTGTCTGA 
TTGGTGCTGT 



21 
I 

GCCTGTCATC 
GTATAAAGAG 
ATCTGGTATA 
GCTTTTATTC 
GGCCCTCTCT 
GTATCTGCTC 
TATAATAGCT 
AAACGTGTTT 
TTTATCCTTG 
AACAACTCTG 
AAAAACAGAA 
GTCTTTTGCA 
CACAGTAGCT 
TATATTCTTT 
TGAAAATTAC 
TGTCATTTTG 
TTTTGGTGGG 
AGCCACGCTT 
GCTCTGTGCA 
AGAACCAAGG 
GGTGATGGTT 



31 
I 

CCGCCGCAGA 
AAAACCTGTC 
ATAGGATTGC 
TGGGTTT CAT 
GGAACAGATA 
CTCTCTGTTC 
GGAGATACTT 
ATTGGTCGCC 
TACCGAAATA 
ATTCTTGGAA 
GACGCTTGGG 
TTTATTTGCC 
AAGTGGTCCC 
GCTACATGTG 
TGCAGAAATG 
ACATACCCTA 
AATCTTTCAT 
GTGTCATTGC 
ACTCCCCTCA 
ACACACTCCG 
TTTGGATTCG 



CATTGCAGGC 
GCCTGTGCCG 
CATTGGGTTG 
CTACTGCAAG 
AGAGATGGAA 
AGAAGAAGTG 
CACAAGTGAT 
GAGTGAGTTT 
GACCCTGGTA 
GAGGGAGTTG 
GTGCCGGGAG 
GCAGTTCCTG 
CAAGCAGAAG 
CCGCTTTGTG 
GAAGGTGTCT 
CCAGGCCTGG 
TACCAAGTCT 
GATGCCCCAT 
TAGACTTCCT 
GGCCCTCAGC 
CACCGTGGAC 
CATCTCTAGA 
GTGCCCTAGT 
CTCACCCTCA 
TGGGCAGTTT 
CCCAATTT CT 
AACTTTGCCT 
TTCTCAAGTT 
CTAGACCAGG 
CTGACCCAGA 
GATGAAGGAG 
GGGCGGCTTT 
CCTGGAGATG 
TTGTTTTGTT 
TTTTTTA 



41 
I 

QDALQGRRAL 
CVARDDVTGE 
QWFRDGTPLS 
DESPARWLA 
FANGSLLLTQ 
VTCI1PPKGI1P 
RQDVNITVAT 
FKNGTLRINS 
TVPCSATGRE 
RAHVQLTVAV 
MHIFQNGSLV 
MIQTIGLSVG 
AEIQEEVALT 
GVAETLVLVK 
GDLKQFLRIS 
QRQVKVSALG 
THGEMPHGGQ 
LGDSTVDSKP 



41 
I 

GAGATTTAGA 
AGTCTGCTGC 
CTTATTCAAT 
ATGTTACGGA 
CCTACCAGTC 
TTCAGTTTTT 
TGAGCAAAGT 
ACTTCATTAT 
TAGCAAAGCT 
TTGTAATGGC 
TATTTGCAAA 
ACCATAACTC 
GCCTTATCCA 
GATACTTGAC 
ATGACCTGGT 
TGGAATGCTT 
CGGTTTTCCA 
TGATTGATTG 
TTTTTAT CAT 
ATAAGATTAT 
TCATGGCTAT 



AACAGCTGCA 
GAGGAGTCGG 
TCGGTGGGTG 
AAGCGCTGCA 
TGCCTCAACG 
GCCTTGACCA 
AAGATGCACT 
GGGGAGGTGT 
CTTGTGAAGA 
GAGATGTTTG 
GCTGAGCCCC 
AGGATTTCCA 
GTGGCCCTAT 
CATAAGGACT 
GCCCTGGGCC 
GTGCCGCTGC 
GATGTCTGGG 
GGTGGGCAGG 
CAGCCCGAGG 
CCCAAGGACC 
AGCAAGCCGT 
GGGAAGCTCA 
GCAACAGGCA 
TCCTTTGGGA 
CCCCTGCCAC 
GGCCTTCAAC 
GGGGAGGGCT 
CTGGGCACAC 
ATTATAGAGG 
CCCACGTCTT 
TTTTCAGGAG 
TATATGTAAT 
AGGAGGGTGG 
TTTTTGTTTT 



51 
I 

LRCEVEAPGP 
EAR SAN AS FN 
DGQSNHTVSS 
PQDWVARYE 
VRPRNAGIYR 
EPSVWWEHAG 
VPSWLKKPQD 
VEVYDGTWYR 
KPTIKWERAD 
PITFKVEPER 
IHDVAPEDSG 
AAVAYIIAVL 
SLGSGPAATN 
SLQTKDEQQQ 
KSKDEKLKSQ 
LSKDVYNSEY 
ADDEVLADLQ 




TTTGGTCAAT 
GTATCCTTTT 
TTTTCAAAGA 
TGGACTTTCC 
TGGAAAGGTC 
AAGGGCAATT 
GCCCAATGCC 
CTTCTTAGTT 
TATGTCCATC 
ATTTACTGGC 
AACATTTGGA 
TGTGACAAGA 
CATTGTTGTA 
CCTCGGGATA 
TCCATCAGCC 
GTCTTGTGTC 
TACAAATACT 



2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 



PCTYUS02/12476 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
7B0 
840 
900 
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1020 
1080 
1140 
1200 
1260 
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WO 02/086443 

CAAGACTGCA CCCATGGGCA GGAAATGTTC TACTGCTTTC CTGACAATTT CTCTCTCACA 
AATACCTCAG AGTCTCATGT TCAGCAGACA ACACAACTTT CTACTTTAAA TATTAGTATC 
TTTCAATGA 

Seq ID NO: 669 Protein sequence 
Protein Accession #: Eos sequence 



1320 
1380 



PCTYUS02/12476 



MGYQRQEPVI 
GFPLGILLLF 
IAMISYNIIA 
SIiISTGLTTL 
YSSLEEPTVA 
RFCYGVTVIL 
VLBLNGVLCA 
QDCTHGQBMP 



11 
I 

PPQRDhDDRE 
WVSYVTDFSL 
GDTIiSKVFQR 
ZLGIVMARAI 
KWSRL1HMSI 
TYPMECFVTR 
TPLIFIIPSA 
YCFPDNFSLT 



21 
1 

TLVSEHBYKE 
VLLIKGGALS 
IPGVDPENVF 
SLGPHIPKTE 
VISVFICIFF 
EVIANVFFGG 
CYLKLSEEPR 
NTSESHVQQT 



31 
I 

KTCQSAALFN 
GTDTYQSLVN 
IGRHFIIGLS 
DAWVFAKPNA 
ATCGYLTFTG 
NLSSVFHIW 
THSDKIMSCV 
TQLSTLNISI 



41 
I 

WNSIIGSGI 
KTFGFPGYLL 
TVTFTLPLSL 
IQAVGVMSFA 
FTQGDtiFENY 
TVMVITVATL 
MLPIGAWMV 
FQ 



51 

I 

IGLPYSMKQA 
LSVIiQFLYPF 
YRNIAKLGKV 
FICHHNSFLV 
CRNDDLVTFG 
VSLLIDCLGI 
FGFVMAITNT 



Seq ID NOi 670 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence i 1..1284 



ATGGGCTACC 
AAGCAAGCTG 
TTTTCCCTTG 
TTGGTCAATA 
TATCCTTTTA 
TTTCAAAGAA 
GGACTTTCCA 
GGAAAGGTCT 
AGGGCAATTT 
CCCAATGCCA 
TTCTTAGTTT 
ATGTCCATCG 
TTTACTGGCT 
ACATTTGGAA 
GTGACAAGAG 
ATTGTTGTAA 
CTCGGGATAG 
CCATCAGCCT 
TCTTGTGTCA 
ACAAATACTC 
TCTCTCACAA 
ATTAGTATCT 



11 
I 

AGAGGCAGGA 
GGTTTCCTTT 
TTTTATTGAT 
AAACTTTCGG 
TAGCAATGAT 
TCCCAGGAGT 
CAGTTACCTT 
CCCTCATCTC 
CACTGGGTCC 
TTCAAGCGGT 
ACAGTTCTCT 
TGATTTCTGT 
TCACCCAAGG 
GATTTTGTTA 
AGGTAATTGC 
CAGTGATGGT 
TTCTAGAACT 
GTTATCTGAA 
TGCTTCCCAT 
AAGACTGCAC 
ATACCTCAGA 
TTCAACTCGA 



21 

1 

GCCTGTCATC 
GGGAATATTG 
AAAAGGAGGG 
CTTTCCAGGG 
AAGTTACAAT 
TGATCCTGAA 
TACTCTGCCT 
TACAGGTTTA 
ACACATACCA 
CGGGGTTATG 
AGAAGAACCC 
ATTTATCTGT 
GGACTTATTT 
TGGTGTCACT 
CAATGTGTTT 
CATCACTGTA 
CAATGGTGTG 
ACTGTCTGAA 
TGGTGCTGTG 
CCATGGGCAQ 
GTCTCATGTT 
GTAA 



31 
I 

CCGCCGCAGA 
CTTTTATTCT 
GCCCTCTCTG 
TATCTGCTCC 
ATAATAGCTG 
AACGTGTTTA 
TTATCCTTGT 
ACAACTCTGA 
AAAACAGAAG 
TCTTTTGCAT 
ACAGTAGCTA 
ATATTCTTTG 
GAAAATTACT 
GTCATTTTGA 
TTTGGTGGGA 
GCCACGCTTG 
CTCTGTGCAA 
GAACCAAGGA 
GTGATGGTTT 
GAAATGTTCT 
CAGCAGACAA 



Seq ID NOt 671 Protein sequence 
Protein Accession #» Eos sequence 



1 
I 

MGYQRQEPVI 
LVNKTFGFPG 
GLSTVTFTLP 
PNAIQAVGVM 
FTGFTQGDLF 
IWTVMVITV 
SCVMLPIGAV 
ISIFQLE 



11 
I 

PPQRGLPYSM 
YLLLSVLQPL 
LSLYRNIAKL 
SFAFICHHNS 
ENYCRNDDLV 
ATLVS LL IDC 
VMVFGFVMAI 



21 
I 

KQAGFPLGIL 
YPFIAMISYN 
GKVSLISTGL 
FLVYS SLEEP 
TFGRFCYGVT 
LGIVLELNGV 
TNTQDCTHGQ 



31 
I 

LLFWVSYVTD 
IIAGDTLSKV 
TTLILGIVMA 
TVAKWSRLIH 
VILTYPMECF 
LCATPLIFII 
EMFYCFPDNF 



41 
1 

GAGGATTGCC 
GGGTTTCATA 
GAACAGATAC 
TCTCTGTTCT 
GAGATACTTT 
TTGGTCGCCA 
ACCGAAATAT 
TTCTTGGAAT 
ACGCTTGGGT 
TTATTTGCCA 
AGTGGTCCCG 
CTACATGTGG 
GCAGAAATGA 
CATACCCTAT 
ATCTTTCATC 
TGTCATTGCT 
CTCCCCTCAT 
CACACTCCGA 
TTGGATTCGT 
ACTGCTTTCC 
CACAACTTTC 



41 

1 

FSLVLLIKGG 
FQRIPGVDPE 
RAISLGPHIP 
MSIVISVFIC 
VTREVIANVF 
PSACYLKLSE 
SLTNTSESHV 



Seq ID NOt 672 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequencer 1..1203 



1 
I 

ATGGGCTACC 
AAAGGAGGGG 
TTTCCAGGGT 
AGTTACAATA 
GATCCTGAAA 
ACTCTGCCTT 
ACAGGTTTAA 
CACATACCAA 
GGGGTTATGT 
GAAGAACCCA 
TTTATCTGTA 
GACTTATTTG 
GGTGTCACTG 
AATGTGTTTT 
ATCACTGTAG 
AATGGTGTGC 
CTGTCTGAAG 
GGTGCTGTGG 
CATGGGCAGG 
TCTCATGTTC 



11 
I 

AGAGGCAGGA 
CCCTCTCTGG 
ATCTGCTCCT 
TAATAGCTGG 
ACGTGTTTAT 
TATCCTTGTA 
CAACTCTGAT 
AAACAGAAGA 
CTTTTGCATT 
CAGTAGCTAA 
TATTCTTTGC 
AAAATTACTG 
TCATTTTGAC 
TTGGTGGGAA 
CCACGCTTGT 
TCTGTGCAAC 
AACCAAGGAC 
TGATGGTTTT 
AAATGTTCTA 
AGCAGACAAC 



21 
I 

GCCTGTCATC 
AACAGATACC 
CTCTGTTCTT 
AGATACTTTG 
TGGTCGCCAC 
CCGAAATATA 
TCTTGGAATT 
CGCTTGGGTA 
TATTTGCCAC 
GTGGTCCCGC 
TACATGTGGA 
CAGAAATGAT 
ATACCCTATG 
TCTTTCATCG 
GTCATTGCTG 
TCCCCTCATT 
ACACTCCGAT 
TGGATTCGTC 
CTGCTTTCCT 
ACAACTTTCT 



31 

I 

CCGCCGCAGT 
TACCAGTCTT 
CAGTTTTTGT 
AGCAAAGTTT 
TTCATTATTG 
GCAAAGCTTG 
GTAATGGCAA 
TTTGCAAAGC 
CATAACTCCT 
CTTATCCATA 
TACTTGACAT 
GACCTGGTAA 
GAATGCTTTG 
GTTTTCCACA 
ATTGATTGCC 
TTTAT CATTC 
AAGATTATGT 
ATGGCTATTA 
GACAATTTCT 
ACTTTAAATA 



41 
I 

TGGTCAATAA 
ATCCTTTTAT 
TTCAAAGAAT 
GACTTTCCAC 
GAAAGGTCTC 
GGGCAATTTC 
CCAATGCCAT 
TCTTAGTTTA 
TGTCCATCGT 
TTACTGGCTT 
CATTTGGAAG 
TGACAAGAGA 
TTGTTGTAAC 
TCGGGATAGT 
CATCAGCCTG 
CTTGTGTCAT 
CAAATACTCA 
CTCTCACAAA 
TTAGTATCTT 



51 
I 

TTTATTGATA 
AACTTTCGGC 
AGCAATGATA 
CCCAGGAGTT 
AGTTACCTTT 
CCTCATCTCT 
ACTGGGTCCA 
TCAAGCGGTC 
CAGTTCT CTA 
GATTTCTGTA 
CACCCAAGGG 
ATTTTGTTAT 
GGTAATTGCC 
AGTGATGGTC 
TCTAGAACTC 
TTATCTGAAA 
GCTTCCCATT 
AGACTGCACC 
TACCTCAGAG 
TCAACTCGAG 



60 
120 
180 
240 
300 
360 
420 



51 

1 

TTATTCAATG 
TGTTACAGAC 
CTACCAGTCT 
TCAGTTTTTG 
GAGCAAAGTT 
CTTCATTATT 
AGCAAAGCTT 
TGTAATGGCA 
ATTTGCAAAG 
CCATAACTCC 
CCTTATCCAT 
ATACTTGACA 
TGACCTGGTA 
GGAATGCTTT 
GGTTTTCCAC 
GATTGATTGC 
TTTTATCATT 
TAAGATTATG 
CATGGCTATT 
TGACAATTTC 
TACTTTAAAT 



51 
I 

ALSGTDTYQS 
NVFIGRHFII 
KTEDAWVFAK 
IFFATCGYLT 
FGGNLSSVFH 
EPRTHSDKIM 
QQTTQLSTLN 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 



60 
120 
180 
240 
300 
360 
420 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 



442 



WO 02/086443 

TAA 



PCT/US02/12476 



Seg ID NO: 673 Protein sequence 
^ Protein Accession #: Eos sequence 

1 11 21 31 41 51 

I I I I I I 

MGYQRQEPVI PPQPSLVLLI KGGALSGTDT YQSLVNKTFG FPGYLLLSVL QPLYPPIAMI 60 

SYNIIAGDTL SKVFQRIPGV DPENVFIGRH FIIGLSTVTF TIiPLSLYRNI AKLG KVSL 1 3 120 

10 TGLTTLILGI VMARAISLGP HIPKTEDAWV FAKPNAIQAV GVMSPAFXCK HNSFLVYSSL 180 

EEPTVAXWSR LIHMSIVI9V PICIPFATCG YLTFTGFTQG DLFENYCRND DLVTFGRFCY 240 

GVTVILTYPM ECFVTREVIA NVFFGGNLSS VFHIWTVMV ITVATLVSLL IDCLGIVLEL 300 

NGVLCATPLI FIIPSACYLK LSEEPRTHSD KIMSCVMLPI GAWMVFGFV MAITNTQDCT 360 
HGQEMFYCFP DNFSLTNTSE 9HVQQTTQLS TLNI9IFQLE 



15 



Seq ID NO? 674 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence t 1..1140 



20 1 11 21 31 41 51 

I I I i I I 

ATGGGCTACC AGAGGCAGGA GCCTGTCATC CCGCCGCAGG TCAATAAAAC TTTCGGCTTT 60 

CCAGGGTATC TGCTCCTCTC TGTTCTTCAG TTTTTGTATC CTTTTATAGC AATGATAAGT 120 

TACAATATAA TAGCTGGAGA TACTTTGAGC AAAGTTTTTC AAAGAATCCC AGGAGTTGAT 180 

25 CCTGAAAACG TGTTTATTGG TCGCCACTTC ATTATTGGAC TTTCCACAGT TACCTTTACT 240 

CTGCCTTTAT CCTTGTACCG AAATATAGCA AAGCTTGGAA AGGTCTCCCT CATCTCTACA 300 

GGTTTAACAA CTCTGATTCT TGGAATTGTA ATGGCAAGGG CAATTTCACT GGGTCCACAC 360 

ATACCAAAAA CAGAAGACGC TTGGGTATTT GCAAAGCCCA ATGCCATTCA AGCGGTCGGG 420 

GTTATGTCTT TTGCATTTAT TTGCCACCAT AACTCCTTCT TAGTTTACAG TTCTCTAGAA 480 

30 GAACCCACAG TAGCTAAGTG GTCCCGCCTT ATCCATATGT CCATCGTGAT TTCTGTATTT 540 

ATCTGTATAT TCTTTGCTAC ATGTGGATAC TTGACATTTA CTGGCTTCAC CCAAGGGGAC 600 

TTATTTGAAA ATTACTGCAG AAATGATGAC CTGGTAACAT TTGGAAGATT TTGTTATGGT 660 

GTCACTGTCA TTTTGACATA CCCTATGGAA TGCTTTGTGA CAAGAGAGGT AATTGCCAAT 72 0 

GTGTTTTTTG GTGGGAATCT TTCATCGGTT TTCCACATTG TTGTAACAGT GATGGTCATC 780 

35 ACTGTAGCCA CGCTTGTGTC ATTGCTGATT GATTGCCTCG GGATAGTTCT AGAACTCAAT 840 

GGTGTGCTCT GTGCAACTCC CCTCATTTTT ATCATTCCAT CAGCCTGTTA TCTGAAACTG 900 

TCTGAAGAAC CAAGGACACA CTCCGATAAG ATTATGTCTT GTGTCATGCT TCCCATTGGT 960 

GCTGTGGTGA TGGTTTTTGG ATTCGTCATG GCTATTACAA ATACTCAAGA CTGCACCCAT 1020 

GGGCAGGAAA TGTTCTACTG CTTTCCTGAC AATTTCTCTC TCACAAATAC CTCAGAGTCT 1080 

40 CATGTTCAGC AGACAACACA ACTTTCTACT TTAAATATTA GTATCTTTCA ACTCGAGTAA 

Seq ID NO: 675 Protein sequence 
Protein Accession ft: Eos sequence 

45 1 11 21 31 41 51 

I I I I I. I 

MGYQRQEPVI PPQVNKTFGF PGYLtiLSVLQ PLYPFIAMIS YNI I AGDTLS KVFQRIPGVD 60 

PENVPIGRHF IIGLSTVTFT LPL9LYRNIA KLGKV9LIST GLTTLILGIV MARAISLGPH 120 

IPKTEDAWVP AKPNAIQAVG VMSFAFICHH NSPLVYSSLE EPTVAKWSRL IHMSIVISVP 180 

50 I CI FFATCGY LTFTGFTQGD LFENYCRNDD LVTFGRFCYG VTVILTYPME CFVTREVIAN 240 

VPFGGNLSSV FHIWTVMVI TVATLVSLLI DCLGIVLELN GVIiCATPLIF IIPSACYLXL 300 

SEEPRTHSDK IMSCVMLPIG AWMVFGFVM AITNTQDCTH GQEMFYCFPD NFSLTNTSES 360 
HVQQTTQLST LNISIFQLE 

55 Seq ID NO: 676 DNA sequence 

Nucleic Acid Accession #: NM_006853.1 
Coding sequence: 26.. 874 

1 11 21 31 41 51 

60 1 | | | I I 

AGGAATCTGC GCTCGGGTTC CGCAGATGCA GAGGTTGAGG TGGCTGCGGG ACTGGAAGTC 60 

ATCGGGCAGA GGTCTCACAG CAGCCAAGGA ACCTGGGGCC CGCTCCTCCC CCCTCCAGGC 120 

CATGAGGATT CTGCAGTTAA TCCTGCTTGC TCTGGCAACA GGGCTTGTAG GGGGAGAGAC 180 

CAGGAT CATC AAGGGGTTCG AGTGCAAGCC TCACTCCCAG CCCTGGCAGG CAGCCCTGTT 240 

65 CGAGAAGACG CGGCTACTCT GTGGGGCGAC GCTCATCGCC CCCAGATGGC TCCTGACAGC 300 

AGCCCACTGC CTCAAGCCCC GCTACATAGT TCACCTGGGG CAGCACAACC TCCAGAAGGA 360 

GGAGGGCTGT GAGCAGACCC GGACAGCCAC TGAGTCCTTC CCCCACCCCG GCTTCAACAA 420 

CAGCCTCCCC AACAAAGACC ACCGCAATGA CATCATGCTG GTGAAGATGG CATCGCCAGT 480 

CTCCATCACC TGGGCTGTGC GACCCCTCAC CCTCTCCTCA CGCTGTGTCA CTGCTGGCAC 540 

70 CAGCTGCCTC ATTTCCGGCT GGGGCAGCAC GTCCAGCCCC CAGTTACGCC TGCCTCACAC 600 

CTTGCGATGC GCCAACATCA CCATCATTGA GCACCAGAAG TGTGAGAACG CCTACCCCGG 660 

CAACATCACA GACACCATGG TGTGTGCCAG CGTGCAGGAA GGGGGCAAGG ACTCCTGCCA 720 

GGGTGACTCC GGGGGCCCTC TGGTCTGTAA CCAGTCTCTT CAAGGCATTA TCTCCTGGGG 780 

CCAGGATCCG TGTGCGATCA CCCGAAAGCC TGGTGTCTAC ACGAAAGTCT GCAAATATGT 840 

75 GGACTGGATC CAGGAGACGA TGAAGAACAA TTAGACTGGA CCCACCCACC ACAGCCCATC 900 

ACCCTCCATT TCCACTTGGT GTTTGGTTCC TGTTCACTCT GTTAATAAGA AACCCTAAGC 960 

CAAGACCCTC TACGAACATT CTTTGGGCCT CCTGGACTAC AGGAGATGCT GTCACTTAAT 1020 

AATCAACCTG GGGTTCGAAA TCAGTGAGAC CTGGATTCAA ATTCTGCCTT GAAATATTGT 1080 

GACTCTGGGA ATGACAACAC CTGGTTTGTT CTCTGTTGTA TCCCCAGCCC CAAAGACAGC 1140 

80 TCCTGGCCAT ATATCAAGGT TTCAATAAAT ATTTGCTAAA TGAGTG 

Seq ID NO: 677 Protein sequence 
protein Accession #: NP_006844.1 

85 1 . 11 21 31 41 51 

I I I I I 1 

MRILQLILLA LATGLVGGET RIIKGFECKP HSQPWQAALF EKTRLLCGAT LIAPRWLLTA 60 



443 



WO 02/086443 

AHCLKPRYIV HLGQHNLQKE EGCEQTRTAT ESFPHPGFNN SLPNKDHRND IMLVKMASPV 120 

SSSSuur lssScvtag? sclisgwgst sspqlrlpht lrcanitiie hqkcenaypg 180 

NITDTMVCAS VQEGGKDSCQ GDSGGPLVCN QSLQGIISWG QDPCAITRKP GVYTKVCKYV 240 
DWIQETMKNM 

Seq ID NO i 678 DNA sequence 

Nucleic Acid Accession #: Eob sequence 

Coding sequence: 1..933 

1 11 21 31 41 SI 

ATGTGCAGCA ATGGACGGTG CATCCCGGGC GCCTGGCAGT GTGACGGGCT GCCTGACTGC 60 

TTCGACAAGA GTGATGAGAA GGAGTGCCCC AAGGCTAAGT CGAAATGTGG CCCGACCTTC 120 

TTCCCCTGTG CCAGCGGCAT CCATTGCATC ATTGGTCGCT TCCGGTGCAA TGGGTTTGAG 180 

GACTGTCCCG ATGGCAGCGA TGAAGAGAAC TGCACAGCAA ACCCTCTGCT TTGCTCCACC 240 

G^CCGCTACC ACTGCAAGAA CGGCCTCTGT ATTGACAAGA GCTTCATCTG CGATGGACAG 300 

AAGACAACAG TGATGAGGAA AGCTGTGAAA GTTCTCAAGA ACCCGGCAGT 360 

GGGCAGGTGT TTGTGACTTC AGAGAACCAA CTTGTGTATT ACCCCAGCAT CACCTATGCC 420 

ATCATCGGCA GCTCCGTCAT TTTTGTGCTG GTGGTGGCCC TGCTGGCACT GGTCTTGCAC 480 

£cS£a A^GGAACAA CCT CATG ACG CTGCCCGTGC ACCGGCTGCA GCACCCTCTG 540 

CTGCTGTCCC GCCTGGTGGT CCTGGACCAC CCCCACCACT GCAACGTCAC CTACAACGTC 600 

AATAATGGCA TCCAGTATGT GGCCAGCCAG GCGGAGCAGA ATGCGTCGGA AGTAGGCTCC 660 

CCACCCTCCT ACTCCGAGGC CTTGCTGGAC CAGAGGCCTG CGTGGTATGA CCTTCCTCCA 720 

CCGCCCTACT CTTCTGACAC GGAATCTCTG AACCAAGCCG ACCTGCCCCC CTACCGCTCC 780 

CGGTCCGGGA GTGCCAACAG TCCCAGCTCC CAGGCAGCCA GCAGCCTCCT GAGCGTGGAA 840 

GACACCAGCC ACAGCCCGGG GCAGCCTGGC . CCCCAGGAGG GCACTGCTGA GCCCAGGGAC 900 
TCTGAGCCGA GCCAGGGCAC TGAAGAAGTA TAA 

Seq ID NO» . 679 Protein sequence 
Protein Accession tf: Eos sequence 

LsNGRCIPG IwQCDGLPDC PDKSDEKECP KAKSKCGPTF PPCASGIHCI IGRFRCNGFE 60 

DCPDGSDEEN CTANPLLCST ARYHCKNGLC IDKSPICDGQ NNCQDN SD EE SCESSQEPGS 120 

GQVFVTSENQ LVYYPSITYA IIGSSVIFVL WALLALVLH HQRKRNNLMT LPVHRLQHPV 180 

LLSRLWLDH PHHCNVTYNV NNGIQYVASQ AEQNASEVGS PPSYSEALLD QRPAWYDLPP 240 

PPYSSDTESL NQADLPPYRS RSGSANSASS QAASSLLSVE DTSHSPGQPG PQEGTAEPRD 300 
SEPSQGTEEV 

Seq ID NO: 680 DNA sequence 
Nucleic Acid Accession ft: S78203.1 
Coding sequence i 1 . .2190 

1 11 21 31 41 51 

ATGAATCCTT TCCAGAAAAA TGAGTCCAAG -GAAACTCTTT TTTCACCTGT CTCCATTGAA 60 

GAGGTACCAC CTCGACCACC TAGCCCTCCA AAGAAGCCAT CTCCGACAAT CTGTGGCTCC 120 

AACTATCCAC TGAGCATTGC CTTCATTGTG GTGAATGAAT TCTGCGAGCG CTTTTCCTAT 180 

^TCGAATGA AAGCTGTGCT GATCCTGTAT TTCCTGTATT TCCTGCACTG GAATGAAGAT 240 

ACCTCCACAT CTATATACCA TGCCTTCAGC AGCCTCTGTT ATTTTACTCC CATCCTGGGA 300 

GCAGCCATTG CTGACTCGTG GTTGGGAAAA TTCAAGACAA TCATCTATCT CTCCTTGGTG 360 

TATGTGCTTG GCCATGTGAT CAAGTCCTTG GGTGCCTTAC CAATACTGGG AGGACAAGTG 420 

GTACACACAG TCCTATCATT GATCGGCCTG AGTCTAATAG CTTTGGGGAC AGGAGGCATC 480 

AAACCCTGTG TGGCAGCTTT TGGTGGAGAC CAGTTTGAAG AAAAACATGC AGAGGAACGG 540 

ACTAGATACT TCTCAGTCTT CTACCTGTCC ATCAATGCAG GGAGCTTGAT TTCTACATTT 600 

ATCACACCCA TGCTGAGAGG AGATGTGCAA TGTTTTGGAG AAGACTGCTA TGCATTGGCT 660 

TTTGGAGTTC CAGGACTGCT CATGGTAATT GCACTTGTTG TGTTTGCAAT GGGAAGCAAA 720 

ATATACAATA AACCACCCCC TGAAGGAAAC ATAGTGGCTC AAGTTTTCAA ATGTATCTGG 780 

£^£££tT CCAATCGTTT CAAGAACCGT TCTGGAGACA TTCCAAAGCG ACAGCACTGG 840 

CTAGACTGGG CAGCTGAGAA ATATCCAAAG CAGCTCATTA TGGATGTAAA GGCACTGACC 900 

AGGGTACTAT TCCTTTATAT CCCATTGCCC ATGTTCTGGG CTCTTTTGGA TCAGCAGGGT 960 

TCACGATGGA CTTTGCAAGC CATCAGGATG AATAGGAATT TGGGGTTTTT TGTGCTTCAG 1020 

CCGGACCAGA TGCAGGTTCT AAATCCCTTT CTGGTTCTTA TCTTCATCCC GTTGTTTGAC 1080 

TTTGTCATTT ATCGTCTGGT CTCCAAGTGT GGAATTAACT TCTCATCACT TAGGAAAATG 1140 

GCTGTTGGTA TGATCCTAGC GTGCCTGGCA TTTGCAGTTG OGGCAGCTGT AGAGATAAAA 1200 

ATAAATGAAA TGGCCCCAGC CCAGTCAGGT CCCCAGGAGG TTTTCCTACA AGTCTTGAAT 1260 

CTGGCAGATG ATGAGGTGAA GGTGACAGTG GTGGGAAATG AAAACAATTC TCTGTTGATA 1320 

Sg^C^CA AATCCTTTCA GAAAACACCA CACTATTCCA AACTGCACCT GAAAACAAAA 1380 

AGCCAGGATT ^CACTTCCA CCTGAAATAT CACAATTTGT CTCTCTACAC TGAGCATTCT 1440 

otgSgI Igaactggta cagtcttgtc attcgtgaag atgggaacag TATCTCCAGC 1500 

ATGATGGTAA AGGATACAGA AAGCAAAACA ACCAATGGGA TGACAACCGT GAGGTTTGTT 1560 

AACACTTTGC ATAAAGATGT CAACATCTCC CTGAGTACAG ATACCTCTCT CAATGTTGGT 1620 

SSSSg StG^GTCTGC TTATAGAACT GTGCAAAGAG GAGAATACCC TGCAGTGCAC 1680 

TGTAGAACAG AAGATAAGAA CTTTTCTCTG AATTTGGGTC TTCTAGACTT TGGTGCAGCA 1740 

?A^££g ^A^ACTAA TAACACCAAT CAGGGTCTTC AGGCCTGGAA GATTGAAGAC 1800 

I^Sa^CA ACAAAATGTC CATTGCGTGG CAGCTACCAC AATATGCCCT GGTTACAGCT 1860 

GG^A^S £g^£c?GT CACAGGTCTT GAGTTTTCTT ATTCTCAGGC TCCCTCTAGC 1920 

ATGAAATCTG TGCTCCAGGC AGCTTGGCTA TTGACAATTG CAGTTGGGAA TATCATCGTG 1980 

CTTCTTGTGG CACAGTTCAG TGGCCTGGTA CAGTGGGCCG AATTCATTTT GTTTTCCTGC 2040 

CTCCTGCTGG TGATCTGCCT GATCTTCTCC ATCATGGGCT ACTACTATGT TCCTGTAAAG 2100 

ACAGAGGATA TGCGGGGTCC AGCAGATAAG CACATTCCTC ACATCCAGGG GAACATGATC 2160 
AAACTAGAGA CCAAGAAGAC AAAACTCTGA 

Seq ID NO j 681 Protein sequence 
Protein Accession #* AAB34388.1 



444 



WO 02/086443 



PCT/US02/12476 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



i 
I 

MNPFQKNESK 
YGMKAVLILY 
YVLGHVIKSL 
TRYFSVFYL3 
IYNKPPPEGN 
RVLFLYIPLP 
FVIYRLVSKC 
LADDEVKVTV 
VQEKNWYSLV 
EDYGVSAYRT 
IPANKMSIAW 
LWAQFSQLV 
KLETKKTKL 



11 
I 

ETLPSPVSIE 
FLYPLHWNED 
GALPILGGQV 
INAGSLISTF 
IVAQVFKCIW 
MFWALLDQQG 
G INF S SLR KM 
VGNENNSLLI 
IREDGNSISS 
VQRGEYPAVH 
QLPQYALVTA 
QWAEFILFSC 



21 
I 

EVPPRPPSPP 
TSTSIYHAFS 
VHTVLSLIGL 
ITPMZjRGDVQ 
FAISNRFKNR 
6RWTLQAIRM 
AVGMILACLA 
ESIKSFQKTP 
MMVKDTESKT 
CRTEDKNFSL 
GEVMFSVTGL 
LLLVICLIFS 



31 
I 

KKPSPTICGS 

slcyftpilg 
slialgtggi 
cfgedcyala 
sgdipkrqhw 
nrnlgffvlq 
pavaaaveik 
hysklhlktk 

TNGMTTVRFV 
NLGLLDFGAA 
EFSYSQAPSS 
IMGYYYVPVK 



41 
) 

NYPLSIAFIV 
AAIADSWLGK 
KPCVAAFGGD 
FGVPGLLMVI 
LDWAAEKYPK 
PDQMQVLNPF 
INEMAPAQSG 
SQDFHFHLKY 
NTLHKDVNIS 
YLFVITNNTN 
MKSVLQAAWL 
TEDMR G PAD K 



SI 
! 

VNEFCERFSY 
FKTIIYIiSLV 
QFEEKHABER 
ALWFAMGSK 
QLIMDVKALT 
LVLIFIPLFD 
PQEVFLQVLN 
HNLSLYTEHS 
LSTDTSLNVG 
QGLQAWKIED 
LTIAVGNIIV 
HIPHIQGNMI 



Seq ID NO: 682 DNA Bequence 

Nucleic Acid Accession #: nm_016077.1 

Coding sequence : 128 667 



TCGCTTTGTG 
CGCGATAGAA 
ACTGTAGATG 
CTTGGCTGTT 
GATGCTCCCC 
CTTGGGAGAC 
AAAAGGGAAA 
AAGAAGAAAT 
CAAAGCTCCT 
GACTGTAAGT 
CCTAGGGATT 
TTACTAGGTG 
GATTCTAACA 
AAACCTATTC 



11 
I 

ATTCTTGATC 
ACGTGTTCGC 
CCCTCCAAAT 
GGAGTTGCTT 
AAAAGCAAGA 
AGCGGGGAGT 
GTGGCTGCCC 
CCTGAAATGC 
GATGAAGAAA 
TTAATTCAAG 
GGGCCAGQAC 
GACTTTGATA 
ACAAAAGCTG 
CCATGTTCTA 



21 

1 

CGGAACTTTG 
TTGCCCAGAA 
CCTTGGTTAT 
GTGGCATGTG 
OGAGCAAGAC 
ACAAGATGAT 
AGTGCTCTCA 
TCAAACAATQ 
CCCTGATTGC 
ATGCTGGACG 
CAGCAGACCT 
TGACAACAAC 
AATTTCTTCA 
AAAAAA 



31 
I 

TCACCCAGGA 
GAAGGGAAGG 
GGAATATTTG 
CCTGGGCTGG 
ACACACAGAT 
TCTTGTGGTT 
TGCTGCTGTT 
GGAATACTGT 
ATTATTGGCC 
TACTCAGATT 
AATTGACAAA 
CCCTCCATCA 
CCCAACTTAA 



41 
I 

ACCCCGGAAG 
CGCGAGTGAG 
GCTCATCCCA 
AGCCTTCGAG 
ACTGAAAGTG 
CGAAATGACT 
TCAGCCTACA 
GGCCAGCCCA 
CATGCAAAAA 
GCACCAGGCT 
GTCACTGGTC 
CAAGTGTTTG 
ATGTTCTTGA 



51 

1 

AGGTAGCTCA 
GAAAGGAGGT 
GTACACTCGG 
TATGCTTTGG 
AAGCAAGCAT 
TAAAGATGGG 
AGCAGATTCA 
AGGTGGTGGT 
TGCTGGGACT 
CTCAAACTGT 
ACCTAAAACT 
AAGCCTGTCA 
GATGAAAATA 



Seq ID NO: 683 Protein sequence 
Protein Accession ft: NP_0 57161.1 



11 
I 



21 



31 



41 



51 



MPSKSLVMEY LAHPSTLGLA VGVACGMCLG WJSLRVCFGML PKSKTSKTHT DTESEASILG 
DSGEYKMILV VRNDLKMGKG KVAAQCSHAA VSAYKQIQRR NPEMLKQWEY CGQPKVWKA 
PDEETL I ALL AHAKMLGLTV SLIQDAGRTQ IAPGSQTVLG IGPGPADLID KVTGHLKLY 

Seq ID NO i 684 DNA sequence 

Nucleic Acid Accession #i NMJ>04864.1 

Coding sequence: 26.. 952 



CGGAACGAGG 
TCAGATGCTC 
GGCCGAGGCG 
ATTCCGAGAG 
CTGGGAAGAT 
AGTGCGGCTG 
GGGGCTCCCC 
AAGGTCGTGG 
GCCCGCGCTG 
AT CTTCGTCC 
CCGCAGAGCG 
TCTGCACACG 
ACGGGAGGTG 
CATGCACGCG 
CTGCTGCGTG 
GTCGCTCCAG 
GGTCCTTCCA 
GGGCTCAAGG 
TTATTTATTA 
ACTGTGTATT 
AAAA 



11 
I 

GCAACCTGCA 
CTGGTGTTGC 
AGCCGCGCAA 
TTGCGGAAAC 
TCGAACACCG 
GGATCCGGCG 
GAGGCCTCCC 
GACGTGACAC 
CACCTGCGAC 
GCACGGCCCC 
CGTGCGCGCA 
GTCCGCGCGT 
CAAGTGACCA 
CAGATCAAGA 
CCCGCCAGCT 
ACCTATGATG 
CTGTGCACCT 
TTCCTGAGAC 
TTAATTTATT 
TATTTAAAAC 



21 
I 

CAGCCATGCC 
TGGTGCTCTC 
GTTTCCOGGG 
GCTACGAGGA 
ACCTCGTCCC 
GCCACCTGCA 
GCCTTCACCG 
GACCGCTGCG 
TGTCGCCGCC 
AGCTGGAGTT 
ACGGGGACGA 
CGCTGGAAGA 
TGTGCATCGG 
CGAGCCTGCA 
ACAATCCCAT 
ACTTGTTAGC 
GCGCGGGGGA 
ACCCGATTCC 
GGGGTGACCT 
TCTGGTGATA 



31 

I 

CGGGCAAGAA 
GTGGCTGCCG 
ACCCTCAGAG 
CCTGCTAACC 
GGCCCCTGCA 
CCTGCGTATC 
GGCTCTGTTC 
GCGTCAGCTC 
GCCGTCGCAG 
GCACTTGCGG 
CTGTCCGCTC 
CCTGGGCTGG 
CGCGTGCCCG 
CCGCCTGAAG 
GGTGCTCATT 
CAAAGACTGC 
GGCGACCTCA 
TGCCCAAACA 
TCTTGGGGAC 
AAAATAAAGC 



41 

I 

CTCAGGACGG 
CATGGGGGCG 
TTGCACTCCG 
AGQCTGCGGG 
GTCCGGATAC 
TCTCGGGCCG 
CGGCTGTCCC 
AGCCTTGCAA 
TCGGACCAAC 
CCGCAAGCCG 
GGGCCCGGGC 
GCCGATTGGG 
AGCCAGTTCC 
CCCGACACGG 
CAAAAGACCG 
CACTGCATAT 
GTTGTCCTGC 
GCTGTATTTA 
TCGGGGGCTG 
TGTCTGAACT 



51 
I 

TGAATGGCTC 
CCCTGTCTCT 
AAGACTCCAG 
CCAACCAGAG 
TCACGCCAGA 
CCCTTCCCGA 
CGACGGCGTC 
GACCCCAAGC 
TGCTGGCAGA 
CCAGGGGGCG 
GTTGCTGCCG 
TGCTGTCGCC 
GGGCGGCAAA 
AGCCAGCGCC 
ACACCGGGGT 
GAGCAGTCCT 
CCTGTGGAAT 
TATAAGTCTG 
GTCTGATGGA 
GTTAAAAAAA 



Seq ID NO: 685 Protein sequence 
Protein Accession #« NP 004855.1 



1 11 

I I 
MPGQELRTVN GSQMLLVLLV 
EDLLTRLRAN QSWEDSNTDL 
HRALFRLSPT ASRSWDVTRP 
ELHLRPQAAR GRRRARARNG 
IGACPSQFRA ANMHAQIKTS 
LAKDCHCI 



21 31 41 51 

I I I I 

LSWLPHGGAL SLAEASRASF PGPSELHSED SRFRELRKRY 
VPAPAVRILT PEVRLGSGGH LHLRISRAAL PEGLPEASRL 
LRRQLSLARP QAPALKLRLS PPPSQSDQLL AESSSARPQL 
DDCPLGPGRC CRLHTVRASL EDLGWADWVL SPREVQVTMC 
LHRLKPDTEP APCCVPASYN PMVLIQKTDT GVSLQTYDDL 



60 
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300 
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720 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



60 
120 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
' 840 
900 
960 
1020 
1080 
1140 
1200 



60 
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180 
240 
300 



Seq ID NO: 686 DNA sequence 
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Nucleic Acid Accession ft: NM_002423.2 
Coding sequence t 48.. 851 

1 11 21 31 41 51 

5 | | | I I i 

ACCAAATCAA CCATAGGTCC AAGAACAATT GTCTCTGGAC GGCAGCTATG CGACTCACCG 60 

TGCTGTGTGC TGTGTGCCTQ CTGCCTGGCA GCCTGGCCCT GCCGCTGCCT . CAGGAGGCGG 120 

GAGGCATGAG TGAGCTACAG TGGGAACAGG CTCAGGACTA TCTCAAGAGA TTTTATCTCT 180 

rt ATGACTCAGA AACAAAAAAT GCCAACAGTT TAGAAGCCAA ACTCAAGGAG ATGCAAAAAT 240 

10 TCTTTGGCCT ACCTATAACT GGAATGTTAA ACTCCCGCGT CATAGAAATA ATGCAGAAGC 300 

CCAGATGTGG AGTGCCAGAT GTTGCAGAAT ACTCACTATT TCCAAATAGC CCAAAATGGA 360 

CTTCCAAAGT GGTCACCTAC AGGATCGTAT CATATACTCG AGACTTACCG CATATTACAG 420 

TGGATCGATT AGTGTCAAAG GCTTTAAACA TGTGGGGCAA AGAGAT CCCC CTGCATTTCA 480 

GGAAAGTTGT ATGGGGAACT GCTGACATCA TGATTGGCTT TGCGCGAGGA GCTCATGGGG 540 

15 . ACTCCTACCC ATTTGATGGG CCAGGAAACA CGCTGGCTCA TGCCTTTGCG CCTGGGACAG 600 

GTCTCGGAGG AGATGCTCAC TTCGATGAGG ATGAACGCTG GACGGATGGT AGCAGTCTAG 660 

GGATTAACTT CCTGTATGCT GCAACTCATG AACTTGGCCA TTCTTTGGGT ATGGGACATT 720 

CCTCTGATCC TAATGCAGTG ATGTATCCAA CCTATGGAAA TGGAGATCCC CAAAATTTTA 780 

AACTTTCCCA GGATGATATT AAAGGCATTC AGAAACTATA TGGAAAGAGA AGTAATTCAA 840 

20 GAAAGAAATA GAAACTTCAG GCAGAACATC CATTCATTCA TTCATTGGAT TGTATATCAT 900 

TGTTGCACAA TCAGAATTGA TAAGCACTGT TCCTCCACTC CATTTAGCAA TTATGTCACC 960 

CTTTTTTATT GCAGTTGGTT TTTGAATGTC TTTCACTCCT TTTATTGGTT AAACTCCTTT 1020 

ATGGTGTGAC TGTGTCTTAT TCCATCTATG AGCTTTGTCA GTGCGCGTAG ATGTCAATAA 1080 
ATGTTACATA CACAAATAAA TAAAATGTTT ATTCCATGGT AAATTTA 



25 



40 



Seq ID NO: 687 Protein sequence 
Protein Accession ft: NP_002414.1 



1 11 21 31 41 51 

30 i j i | I I 

MRLTVI*CAVC LLPGSLALPIi PQEAGGMSEL QWEQAQDYLK RFYLYDSETK NANSLEAKLK 60 

EMQKFFGIiPI TGMLNSRVIE IMQKPRCGVP DVAEYSLFPN SPKWTSKWT YRI V SYTRDL 120 

PHITVDRLVS KAXiNMWGKEI PLHFRKWWG TADIMIGFAR GAHGDSYPFD GPGNTLAHAF 180 

APGTGLGGDA HFDEDERWTD GS SLGINFLY AATHELGHSL GMGHSSDPNA VMYPTYGNGD 240 

35 PQNFKLSQDD IKGIQKLYGK RSNSRKK 



Seq XD NO: 688 DNA sequence 

Nucleic Acid Accession ft: NM_005221.3 

Coding sequence : 1 . . 870 



1 11 21 31 41 51 

I I I I I I 

ATGACAGGAG TGTTTGACAG AAGGGTCCCC AGCATCCGAT CCGGCGACTT CCAAGCTCCG 60 

TTCCAGACGT CCGCAGCTAT GCACCATCCG TCTCAGGAAT CGCCAACTTT GCCCGAGTCT 120 

45 TCAGCTACCG ATTCTGACTA CTACAGCCCT ACGGGGGGAG CCCCGCACGG CTACTGCTCT 180 

CCTACCTCGG CTTCCTATGG CAAAGCTCTC AACCCCTACC AGTATCAGTA TCACGGCGTG 240 

AACGGCTCCG CCGGGAGCTA CCCAGCCAAA GCTTATGCCG ACT AT AG CT A CGCTAGCTCC 300 

TACCACCAGT ACGGCGGCGC CTACAACCGC GTCCCAAGCG CCACCAACCA GCCAGAGAAA 360 

GAAGTGACCG AGCCCGAGGT GAGAATGGTG AATGGCAAAC CAAAGAAAGT TCGTAAACCC 420 

50 AGGACTATTT ATTCCAGCTT TCAGCTGGCC GCATTACAGA GAAGGTTTCA GAAGACTCAG 480 

TACCTCGCCT TGCCGGAACG CGCCGAGCTG GCCGCCTCGC TGGGATTGAC ACAAACACAG 540 

GTGAAAATCT GGTTTCAGAA CAAAAGATCC AAGATCAAGA AGATCATGAA AAACGGGGAG 600 

ATGCCCCCGG AGCACAGTCC CAGCTCCAGC GACCCAATGG CGTGTAACTC GCCGCAGTCT 660 

CCAGCGGTGT GGGAGCCCCA GGGCTCGTCC CGCTCGCTCA GCCACCACCC TCATGCCCAC 720 

55 CCTCCGACCT CCAACCAGTC CCCAGCGTCC AGCTACCTGG AGAACTCTGC ATCCTGGTAC 780 

ACAAGTGCAG CCAGCTCAAT CAATTCCCAC CTGCCGCCGC CGGGCTCCTT ACAGCACCCG 840. 
CTGGCGCTGG CCTCCGGGAC ACTCTATTAG 

Seq ID NO* 689 Protein sequence 
60 Protein Accession ft* NP_005212.1 

1 11 21 31 41 51 

I I I I I I 

MTGVFDRRVP SIRSGDFQAP FQTSAAMHHP SQESPTLPES SATDSDYYSP TGGAPHGYCS 60 

65 PTSASYGKAL NPYQYQYHGV NGSAGSYPAK AYADYSYASS YHQYGGAYNR VPSATNQPEK 120 

EVTEPEVRMV NGKPKKVRKP RTIYSSFQLA ALQRRFQKTQ YLALPERAEL AASLGLTQTQ 180 

VKIWFQNKRS KIKKIMKNGE MPPEHSPSSS DPMACNSPQS PAVWEPQGSS RSLSHHPHAH 240 
PPTSNQSPAS SYLENSASWY TSAASSINSH LPPPGSLQHP LALASGTLY 
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It is understood that the examples described above in no way serve to limit the true 
scope of this invention, but rather are presented for illustrative purposes. All publications, 
sequences of accession numbers, and patent applications cited in this specification are herein 
ncorporated by reference as if each individual publication or patent application were 
specifically and individually indicated to be incorporated by reference. 
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WHAT IS CLAIMED IS: 

1 LA method of detecting a lung cancer-associated transcript in a cell 

2 from a patient, the method comprising contacting a biological sample from the patient with a 

3 polynucleotide that selectively hybridizes to a sequence at least 80% identical to a sequence 

4 as shown in Tables 1 A- 16. 

1 2. The method of claim 1, wherein the polynucleotide selectively 

2 hybridizes to a sequence at least 95% identical to a sequence as shown in Tables 1 A- 16. 

1 3. The method of claim 1 , wherein the biological sample is a tissue 

2 sample. 

1 4. The method of claim 1, wherein the biological sample comprises 

2 isolated nucleic acids. 

1 5. The method of claim 4, wherein the nucleic acids are mRNA. 

1 6. The method of claim 4, further comprising the step of amplifying 

2 nucleic acids before the step of contacting the biological sample with the polynucleotide. 

1 7. The method of claim 1, wherein the polynucleotide comprises a 

2 sequence as shown in Tables 1 A-16. 

1 8. The method of claim 1, wherein the polynucleotide is labeled. 

1 9. The method of claim 8, wherein the label is a fluorescent label. 

1 10. The method of claim 1, wherein the polynucleotide is immobilized on 

2 a solid surface. 

1 11. The method of claim 1 , wherein the patient is undergoing a therapeutic 

2 regimen to treat lung cancer. 

1 12. The method of claim 1, wherein the patient is suspected of having lung 

2 cancer. 

1 1 3 . A method of monitoring the efficacy of a therapeutic treatment of lung 

2 cancer, the method comprising the steps of: 
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3 (i) providing a biological sample from a patient undergoing the therapeutic 

4 treatment; and 

5 (ii) determining the level of a lung cancer-associated transcript in the 

6 biological sample by contacting the biological sample with a polynucleotide that selectively 

7 hybridizes to a sequence at least 80% identical to a sequence as shown in Tables 1A-16, 

8 thereby monitoring the efficacy of the therapy. 

1 14. The method of claim 13, further comprising the step of: (iii) comparing 

2 the level of the lung cancer-associated transcript to a level of the lung cancer-associated 

3 transcript in a biological sample from the patient prior to, or earlier in, the therapeutic 

4 treatment. . 

1 15. The method of claim 1 3, wherein the patient is a human. 

1 1 6 . A method of monitoring the efficacy of a therapeutic treatment of lung 

2 cancer, the method comprising the steps of: 

3 (i) providing a biological sample from a patient undergoing the therapeutic 

4 treatment; and 

5 (ii) determining the level of a lung cancer-associated antibody in the biological 

6 sample by contacting the biological sample with a polypeptide encoded by a polynucleotide 

7 that selectively hybridizes to a sequence at least 80% identical to a sequence as shown in 

8 Tables 1 A-16, wherein the polypeptide specifically binds to the lung cancer-associated 

9 antibody, thereby monitoring the efficacy of the therapy. 

1 17. The method of claim 16, further comprising the step of: (iii) comparing 

2 the level of the lung cancer-associated antibody to a level of the lung cancer-associated 

3 antibody in a biological sample from the patient prior to, or earlier in, the therapeutic 

4 treatment. 

1 18. The method of claim 1 6, wherein the patient is a human. 

1 1 9. A method of monitoring the efficacy of a therapeutic treatment of lung 

2 cancer, the method comprising the steps of: 

3 (i) providing a biological sample from a patient undergoing the therapeutic 

4 treatment; and 
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5 (ii) determining the level of a lung cancer-associated polypeptide in the 

6 biological sample by contacting the biological sample with an antibody, wherein the antibody 

7 specifically binds to a polypeptide encoded by a polynucleotide that selectively hybridizes to 

8 a sequence at least 80% identical to a sequence as shown in Tables 1 A-16, thereby 

9 monitoring the efficacy of the therapy. 

1 20. The method of claim 19, further comprising the step of: (iii) comparing 

2 the level of the lung cancer-associated polypeptide to a level of the lung cancer-associated 

3 polypeptide in a biological sample from the patient prior to, or earlier in, the therapeutic 

4 treatment. 

2 1 . The method of claim 1 9, wherein the patient is a human. 

22. An isolated nucleic acid molecule consisting of a polynucleotide 
sequence as shown in Tables 1 A- 16. 

23. The nucleic acid molecule of claim 22, which is labeled. 

24. The nucleic acid of claim 23, wherein the label is a fluorescent label 

25. An expression vector comprising the nucleic acid of claim 22. 

26. A host cell comprising the expression vector of claim 25. 

27. An isolated polypeptide which is encoded by a nucleic acid molecule 
having polynucleotide sequence as shown in Tables 1A-16. 

28. An antibody that specifically binds a polypeptide of claim 27. 

29. The antibody of claim 28, further conjugated to an effector component. 

30. The antibody of claim 29, wherein the effector component is a 
fluorescent label. 

3 1 . The antibody of claim 29, wherein the effector component is a 
radioisotope or a cytotoxic chemical. 

32. The antibody of claim 29, which is an antibody fragment. 
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1 33 . The antibody of claim 29, which is a humanized antibody 

1 34. A method of detecting a lung cancer cell in a biological sample from a 

2 patient, the method comprising contacting the biological sample with an antibody of claim 

3 28. 

1 35. The method of claim 34, wherein the antibody is further conjugated to 

2 an effector component. 

1 36. The method of claim 35, wherein the effector component is a 

2 fluorescent label. 

1 37. A method of detecting antibodies specific to lung cancer in a patient, 

2 the method comprising contacting a biological sample from the patient with a polypeptide 

3 encoded by a nucleic acid comprises a sequence from Tables 1A-16. 

1 3 8 . A method for identifying a compound that modulates a lung cancer- 

2 associated polypeptide, the method comprising the steps of: 

3 (i) contacting the compound with a lung cancer-associated polypeptide, the 

4 polypeptide encoded by a polynucleotide that selectively hybridizes to a sequence at least 

5 80% identical to a sequence as shown in Tables 1 A-16; and 

6 (ii) determining the functional effect of the compound upon the polypeptide. 

1 39. The method of claim 38, wherein the functional effect is a physical 

2 effect. 

1 40. The method of claim 38, wherein the functional effect is a chemical 

2 effect. 

1 41. The method of claim 38, wherein the polypeptide is expressed in a 

2 eukaryotic host cell or cell membrane. 

1 42. The method of claim 38, wherein the functional effect is determined by 

2 measuring ligand binding to the polypeptide. 

1 43. The method of claim 38, wherein the polypeptide is recombinant. 



451 



WO 02/086443 



PCT/US02/12476 



1 44. A method of inhibiting proliferation of a lung cancer-associated cell to 

2 treat lung cancer in a patient, the method comprising the step of administering to the subject a 

3 therapeutically effective amount of a compound identified using the method of claim 38. 

1 45 . The method of claim 44, wherein the compound is an antibody. 

1 46. The method of claim 45, wherein the patient is a human. 

1 47. A drug screening assay comprising the steps of 

2 (i) administering a test compound to a mammal having lung cancer or a cell 

3 isolated therefrom; 

4 (ii) comparing the level of gene expression of a polynucleotide that selectively 

5 hybridizes to a sequence at least 80% identical to a sequence as shown in Tables 1 A- 16 in a 

6 treated cell or mammal with the level of gene expression of the polynucleotide in a control 

7 cell or mammal, wherein a test compound that modulates the level of expression of the 

8 polynucleotide is a candidate for the treatment of lung cancer. 

1 48. The assay of claim 47, wherein the control is a mammal with lung 

2 cancer or a cell therefrom that has not been treated with the test compound. 

1 49. The assay of claim 47, wherein the control is a normal cell or mammal. 

1 50. A method for treating a mammal having lung cancer comprising 

2 administering a compound identified by the assay of claim 47. 

1 51. A pharmaceutical composition for treating a mammal having lung 

2 cancer, the composition comprising a compound identified by the assay of claim 47 and a 

3 physiologically acceptable excipient. 
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